SemanticScuttle - klotz.me » Tags: information theory

Tags: information theory*

0 bookmark(s) - Sort by: Date ↓ / Title /

The Physics of mHC: Why Deep Learning Needs Energy Conservation

This article presents a compelling argument that the Manifold-Constrained Hyper-Connections (mHC) method in deep learning isn't just a mathematical trick, but a fundamentally physics-inspired approach rooted in the principle of energy conservation.

The author argues that standard neural networks act as "active amplifiers," injecting energy and potentially leading to instability. mHC, conversely, aims to create "passive systems" that route information without creating or destroying it. This is achieved by enforcing constraints on the weight matrices, specifically requiring them to be doubly stochastic.

The derivation of these constraints is presented from a "first principles" physics perspective:

* **Conservation of Signal Mass:** Ensures the total input signal equals the total output signal (Column Sums = 1).
* **Bounding Signal Energy:** Prevents energy from exploding by ensuring the output is a convex combination of inputs (non-negative weights).
* **Time Symmetry:** Guarantees energy conservation during backpropagation (Row Sums = 1).

The article also draws a parallel to Information Theory, framing mHC as a way to combat the Data Processing Inequality by preserving information through "soft routing" – akin to a permutation – rather than lossy compression.

Finally, it explains how the Sinkhorn-Knopp algorithm is used to enforce these constraints, effectively projecting the network's weights onto the Birkhoff Polytope, ensuring stability and adherence to the laws of thermodynamics. The core idea is that a stable deep network should behave like a system of pipes and valves, routing information without amplifying it.

2026-01-14 Tags: mhc, deep learning, physics, energy conservation, doubly stochastic matrices, sinkhorn-knopp algorithm, information theory, neural networks, deep seek, llm by klotz

What Is Entropy? A Measure of Just How Little We Really Know.

Entropy, once seen as a measure of disorder in physical systems, is now understood as a reflection of our ignorance and knowledge limitations. This evolving perspective links entropy to information theory and challenges traditional views of objectivity in science.

2025-08-22 Tags: entropy, thermodynamics, information theory, statistical mechanics, observer, uncertainty, second law of thermodynamics, shannon entropy, jaynes entropy, quantum information by klotz

Duality between predictability and reconstructability in complex systems

The relationship between predictability and reconstructability, and how it can vary in opposite directions in complex systems. The work is based on information theory and was performed on various dynamics on random graphs, including continuous deterministic systems, and provides analytical calculations of the uncertainty coefficients for many different systems.

2024-05-28 Tags: complex systems, predictability, reconstructability, information theory, time series, cybernetics, requisite variety, production engineering by klotz

Understanding Abstractions in Neural Networks: The Core of Cognition

This article explains the concept of abstraction in neural networks and its connection to generalization. It also discusses how different components in neural networks contribute to abstraction and reveals an interesting duality between abstraction and generalization.

2024-05-15 Tags: neural networks, abstraction, generalization, information theory, mathematics, machine learning by klotz

Entropy - A Key Concept for All Data Science Beginners

2020-11-09 Tags: information theory, entropy, machine learning by klotz

[1401.4767] Hilbert-space factorization is a limited and expensive information-processing resource

2019-02-06 Tags: hilbert space, tegmark, information theory, entropy, ontology by klotz

“Deep Learning is Non-Equilibrium Information Dynamics”

2018-12-27 Tags: deep learning, information theory, chaos, nonlinear dynamics, carlos perez, medium by klotz

“How to Explain Deep Learning using Chaos and Complexity”

2018-12-27 Tags: deep learning, information theory, chaos, nonlinear dynamics, carlos perez, medium by klotz

“The Asymmetry of Information Discovery and the Limitation of Entropy”

2018-10-14 Tags: ai, information theory, entropy by klotz

Information Theory and Statistical Mechanics