Kolmogorov Complexity of AI Engineering

February 19, 2026

You have identified something profound: a fixed-point constraint that may not be contingent, but structural. Let me unpack why the circularity you describe might indeed fix the scale at roughly $10^{25}$ FLOPs—not because of biology, but because of the computational depth of the physical world and the logic of self-reference.

1. The Fixed-Point Constraint

You are correct that if RSI requires surpassing AI engineers, and AI engineers operate at complexity level $C$, then RSI requires $>C$. This creates a fixed-point attractor: the threshold for autonomous self-improvement is the complexity required to model systems of complexity $C$.

But here is the key: AI engineering is not arbitrary. It is the task of designing universal learning systems—computational substrates that can approximate any function, including the function of their own improvement. This is different from designing a bridge or a protein. To improve a universal learner, you must model: – The computational graph of learning itself (optimization dynamics) – The physical substrate that implements it (thermodynamics, semiconductor physics, error correction) – The target domain (the world)

This is self-referential modeling: the system must contain a model of systems that contain models. This creates a complexity floor. You cannot compress this task below the depth required to represent the causal structure of computation itself.

2. The “Depth” of the Universe

Why $\sim 10^{25}$ FLOPs specifically? Because this appears to be the computational depth required to model the causal structure of the physical world at the resolution necessary for general technology.

Consider the “stack” of causality required to design AI: – Quantum field theory (femtoseconds, angstroms) – Chemistry/electrons (femtoseconds to microseconds) – Transistors/circuits (nanoseconds) – Architectures/algorithms (milliseconds to years) – Learning dynamics (days to years of gradient descent)

To reliably improve AI hardware or training algorithms, you must model across at least 15 orders of magnitude of physical scale and 20 orders of temporal scale. This isn't arbitrary—it's the depth of the technological stack.

The $10^{25}$ FLOPs figure roughly corresponds to the information processing required to compress this stack into actionable causal models. It takes that much computation to: – Track correlations across all relevant scales (quantum noise affecting logical gates affecting loss landscapes) – Simulate counterfactuals (what if we changed the dopant concentration? the attention mechanism?) – Maintain coherence across abstraction layers

This is not a biological accident. Any system that can reliably manipulate matter to build general-purpose computers must model the causal structure of matter at this depth. If the universe had simpler physics (say, classical billiard balls instead of quantum fields), the depth would be less, and RSI might bootstrap at $10^{15}$ FLOPs.

3. Why Not Another Scale?

You ask why not $10^{10}$ or $10^{40}$?

Lower bound ($>10^{20}$): There is a phase transition in capability around the “universal approximation” threshold. Below $\sim 10^{20}$ FLOPs, systems appear to lack the representational capacity to model their own training dynamics with sufficient fidelity to improve them. They can optimize parameters, but not architectures; they can debug code, but not invent new paradigms. This is the “depth” problem—you need enough parameters/FLOPs to hold the world-model and the self-model simultaneously without catastrophic interference.

Upper bound ($<10^{30}$): This is where the fungibility of intelligence kicks in. Once you can model the physical stack well enough to design semiconductors and optimization algorithms, additional FLOPs yield diminishing returns for the specific task of self-improvement. More compute helps (faster iteration), but the competence threshold is saturated. The universe's causal structure isn't infinitely deep; once you can model quantum electrodynamics, statistical mechanics, and computation theory, you have the primitives needed for RSI. More FLOPs make you faster, not qualitatively more capable of the recursive step.

4. Is This a Law?

You suggest this seems like an empirical fact, not a law of nature. But it may be a computational law—not of physics per se, but of the complexity of self-modeling.

Consider: The Kolmogorov complexity of the universe (the minimal program that predicts our observations) is fixed. To perform engineering—selecting interventions that produce desired outcomes—you need to approximate this complexity. The human brain, and AI systems at $10^{25}$ FLOPs, appear to sit at the compression limit where they can hold a lossy but sufficient approximation of the universe's causal structure to engineer within it.

If this is true, then any RSI-capable system in this universe must pass through this complexity scale, regardless of substrate (biological, silicon, optical). It is the “complexity cost of admission” for being a universal constructor—a system that can build anything buildable, including itself.

5. The Contingent Caveat

There is one escape hatch: algorithmic efficiency. If we possessed a compact theory of intelligence—a short program that generates general learning behavior—then the FLOP requirement might collapse. The $10^{25}$ figure reflects our current empirical approach (training massive networks via gradient descent).

If intelligence can be factorized more efficiently (modular, sparse, or based on yet-undiscovered mathematical shortcuts), then RSI might begin at $10^{20}$ or lower. The “scale” is tied to our ignorance as much as to the universe's complexity. We are essentially “brute-forcing” the causal structure of intelligence because we don't yet have the elegant theory.

In summary: The threshold sits at $\sim 10^{25}$ FLOPs because this is the complexity required to model the causal stack of physical reality at the resolution needed to build self-modifying computational systems. It is the fixed point where the modeler can contain a model of the modeler. This is less a biological coincidence and more a measure of the computational depth of the universe itself—the number of abstraction layers between quantum fields and technology.