Coherence as Physical Primitive

rigorous

Overview

This derivation answers a challenge: what is coherence, concretely?

Axiom 1 (Coherence Conservation) defines coherence as a subadditive measure on a σ\sigma-algebra with five conditions — positivity (C1), normalization (C2), null empty set (C3), subadditivity (C4), and strong subadditivity (C5). But a skeptic may ask: is this a genuine physical quantity or an abstract placeholder? Does coherence connect to anything already known?

The argument. Coherence converges with established physics from four independent directions:

The result. Coherence is not a new invention. It is a unifying name for quantities that already pervade physics — entropy, information, inner product — recognized here as manifestations of a single conserved primitive. The conditions of Axiom 1 are the minimal properties shared by all these realizations.

Why this matters. This removes the worry that “coherence” is doing enormous conceptual work without clear content. The content is quantum information theory. The axioms are not ad hoc — they are the axioms of quantum entropy, discovered independently by Lieb and Ruskai.

Why “coherence” and not “entropy”? If the conditions of Axiom 1 match the axioms of quantum entropy, why introduce a new name? Because Axiom 1 is stronger than entropy conservation alone. Conservation (C2) says the total is constant under admissible evolution — that is unitarity, and entropy has it. But subadditivity (C4) and strong subadditivity (C5) constrain how subsystem values relate to each other: they are compositional axioms about structure, not just a statement about a conserved total. Von Neumann entropy happens to satisfy them Araki–Lieb, 1970; Lieb–Ruskai, 1973, but those are deep theorems, not part of entropy’s definition. Moreover, the inner product ψψ\langle\psi|\psi\rangle (Theorem 4.1 below) is also a realization of the same axioms, and no one would call it entropy. “Coherence” names the abstract structure — positive, subadditive, conserved — that entropy, inner products, and phase space volumes share. The abstraction earns its own name because it is the load-bearing primitive: the derivation chain runs from C1–C5, not from any single realization.

An honest caveat. The convergence establishes structural isomorphism — the same mathematical axioms, the same derived quantities. It does not establish ontological identity: whether coherence is entropy, or whether both are aspects of something deeper, remains a philosophical question the framework does not attempt to settle.

Statement

Theorem. The conditions of Axiom 1 (C1–C5, as defined in Coherence Conservation) are structurally isomorphic to the axioms of quantum entropy. This isomorphism identifies relational coherence with quantum mutual information, coherence conservation with unitarity, and the coherence measure on quantum states with the unique U(1)-invariant quadratic form ψψ\langle\psi|\psi\rangle.

Derivation

Step 1: Coherence Axioms as Entropy Axioms

Theorem 1.1 (Quantum entropy satisfies Axiom 1). Let ρ\rho be a density operator on a composite Hilbert space H=iHi\mathcal{H} = \bigotimes_i \mathcal{H}_i. For any subsystem SS (a subset of the tensor factors), define C(S)=S(ρS)\mathcal{C}(S) = S(\rho_S) where S(ρS)=Tr(ρSlogρS)S(\rho_S) = -\text{Tr}(\rho_S \log \rho_S) is the von Neumann entropy of the reduced state ρS=TrSˉ(ρ)\rho_S = \text{Tr}_{\bar{S}}(\rho). Then C\mathcal{C} satisfies all five conditions (C1)–(C5) of Axiom 1.

Proof. We verify each axiom:

(C1) Positivity. S(ρS)0S(\rho_S) \geq 0 for any density operator ρS\rho_S, since the eigenvalues λi[0,1]\lambda_i \in [0,1] satisfy λilogλi0-\lambda_i \log \lambda_i \geq 0. \checkmark

(C2) Normalization. C(H)=S(ρ)\mathcal{C}(\mathcal{H}) = S(\rho) is a constant determined by the global state. Under unitary evolution ρUρU\rho \mapsto U\rho U^\dagger, the von Neumann entropy is invariant: S(UρU)=S(ρ)S(U\rho U^\dagger) = S(\rho) (since the eigenvalues are preserved by unitary conjugation). So C0=S(ρ)C_0 = S(\rho) is conserved. \checkmark

(C3) Null empty set. The empty subsystem has a one-dimensional Hilbert space (C\mathbb{C}) with the unique density operator ρ=1\rho_\emptyset = 1. Then S(ρ)=0S(\rho_\emptyset) = 0. \checkmark

(C4) Subadditivity. For disjoint subsystems A,BA, B with ρAB=TrAB(ρ)\rho_{AB} = \text{Tr}_{\overline{AB}}(\rho):

S(ρAB)S(ρA)+S(ρB)S(\rho_{AB}) \leq S(\rho_A) + S(\rho_B)

This is the quantum subadditivity theorem Araki & Lieb, 1970. Equality holds iff ρAB=ρAρB\rho_{AB} = \rho_A \otimes \rho_B (the subsystems are uncorrelated). \checkmark

(C5) Strong subadditivity. For any subsystems A,B,CA, B, C (not necessarily disjoint, with natural identification of overlapping tensor factors):

S(ρAB)+S(ρBC)S(ρABC)+S(ρB)S(\rho_{AB}) + S(\rho_{BC}) \geq S(\rho_{ABC}) + S(\rho_B)

This is the Lieb–Ruskai theorem (1973), one of the deepest results in quantum information theory. Proved using the joint convexity of relative entropy. \checkmark \square

Corollary 1.2 (Converse direction). The quantum entropy model satisfies Axiom 1 with strict subadditivity whenever the global state is entangled: for any entangled ρ\rho, there exist subsystems A,BA, B with C(A:B)>0\mathcal{C}(A : B) > 0.

Proof. If ρ\rho is entangled across the ABA|B cut, then ρABρAρB\rho_{AB} \neq \rho_A \otimes \rho_B, so S(ρAB)<S(ρA)+S(ρB)S(\rho_{AB}) < S(\rho_A) + S(\rho_B) (strict subadditivity). Therefore C(A:B)=S(ρA)+S(ρB)S(ρAB)>0\mathcal{C}(A:B) = S(\rho_A) + S(\rho_B) - S(\rho_{AB}) > 0. \square

Remark. The converse direction shows that the non-trivial model of Coherence Conservation (Model 2, strictly subadditive) has a concrete realization: quantum systems in entangled states. Strict subadditivity — and hence non-zero relational coherence — is the norm in quantum mechanics, not the exception.

Step 2: Relational Coherence Is Mutual Information

Theorem 2.1 (Dictionary). Under the identification C(S)=S(ρS)\mathcal{C}(S) = S(\rho_S), every derived quantity from Coherence Conservation maps to a standard quantum information quantity:

Framework quantityDefinitionQuantum information quantity
Coherence C(S)\mathcal{C}(S)Axiom 1Von Neumann entropy S(ρS)S(\rho_S)
Relational coherence C(A:B)\mathcal{C}(A:B)C(A)+C(B)C(AB)\mathcal{C}(A) + \mathcal{C}(B) - \mathcal{C}(A \cup B)Quantum mutual information I(A:B)I(A:B)
Conditional coherence C(AB)\mathcal{C}(A\mid B)C(AB)C(B)\mathcal{C}(A \cup B) - \mathcal{C}(B)Conditional entropy S(AB)S(A\mid B)
Chain rule correction δ\deltaProposition 2.5Conditional mutual information I(A;CB)I(A;C\mid B)
Coherence independenceC(A:B)=0\mathcal{C}(A:B) = 0Zero mutual information (product state)
ConservationC(H)=C0\mathcal{C}(\mathcal{H}) = C_0Unitarity (S(ρ)S(\rho) constant)
Admissible transformationIsometry of C\mathcal{C}Unitary channel

Proof. Each identification is verified by expanding definitions:

Relational coherence. C(A:B)=S(ρA)+S(ρB)S(ρAB)=I(A:B)\mathcal{C}(A:B) = S(\rho_A) + S(\rho_B) - S(\rho_{AB}) = I(A:B), the quantum mutual information. By Coherence Conservation Proposition 2.2, C(A:B)0\mathcal{C}(A:B) \geq 0; this recovers the non-negativity of mutual information.

Conditional coherence. C(AB)=S(ρAB)S(ρB)=S(AB)\mathcal{C}(A|B) = S(\rho_{AB}) - S(\rho_B) = S(A|B), the quantum conditional entropy. Unlike classical conditional entropy, this can be negative (when AA and BB are entangled and S(ρAB)<S(ρB)S(\rho_{AB}) < S(\rho_B)). This is consistent with Coherence Conservation: the axioms do not constrain the sign of C(AB)\mathcal{C}(A|B).

Chain rule correction. The correction δ\delta in Proposition 2.5 equals the conditional mutual information I(A;CB)=S(ρAB)+S(ρBC)S(ρB)S(ρABC)0I(A;C|B) = S(\rho_{AB}) + S(\rho_{BC}) - S(\rho_B) - S(\rho_{ABC}) \geq 0. The non-negativity follows from strong subadditivity (C5). In the general framework, the sign of δ\delta is model-dependent (Coherence Conservation, Remark after Proposition 2.5); in the quantum model, it is always non-negative. \square

Step 3: Operational Meaning

Definition 3.1 (Operational coherence). The coherence of a subsystem SS has three equivalent operational interpretations:

(i) Information capacity. C(S)=S(ρS)\mathcal{C}(S) = S(\rho_S) quantifies the information carried by SS about the rest of the system. It equals the maximum number of bits that could be extracted from SS by an optimal measurement, given knowledge of the global state.

(ii) Entanglement witness. For a bipartite pure state ψAB|\psi_{AB}\rangle, C(A)=C(B)=S(ρA)\mathcal{C}(A) = \mathcal{C}(B) = S(\rho_A) equals the entanglement entropy — the unique measure of bipartite entanglement for pure states Bennett, Bernstein, Popescu, Schumacher, 1996.

(iii) Compression limit. By Schumacher’s quantum noiseless coding theorem (1995), C(S)=S(ρS)\mathcal{C}(S) = S(\rho_S) equals the minimum number of qubits per copy needed to faithfully represent the state of SS in the asymptotic limit.

Proposition 3.2 (Operational relational coherence). Relational coherence C(A:B)=I(A:B)\mathcal{C}(A:B) = I(A:B) quantifies the total correlations (both classical and quantum) between AA and BB. It equals the cost in bits of erasing all correlations between AA and BB while preserving the individual states ρA\rho_A and ρB\rho_B Groisman, Popescu, Winter, 2005.

Proposition 3.3 (Operational conservation). Coherence conservation (C(H)=C0\mathcal{C}(\mathcal{H}) = C_0 under admissible transformations) is operationally equivalent to:

(i) No-creation: No physical process can increase the total information content of the universe.

(ii) No-destruction: No physical process can decrease the total information content of the universe (unitarity / reversibility).

(iii) Closed ontology: There is no external reservoir from which information could flow in or to which it could leak out.

Proof. (i) and (ii) are the two directions of C(H)=C0=const\mathcal{C}(\mathcal{H}) = C_0 = \text{const}. (iii) follows from the structure of the σ\sigma-algebra: H\mathcal{H} is the universe of all configurations, so there is no “outside.” \square

Step 4: Uniqueness of the Coherence Functional on Quantum States

Theorem 4.1 (Coherence–amplitude uniqueness). Let F:CdR0F: \mathbb{C}^d \to \mathbb{R}_{\geq 0} be a functional on quantum states satisfying:

(F1) U(1) invariance: F(eiαψ)=F(ψ)F(e^{i\alpha}\psi) = F(\psi) for all αR\alpha \in \mathbb{R}

(F2) Channel additivity: F(ψ)=k=1df(ψk2)F(\psi) = \sum_{k=1}^d f(|\psi_k|^2) for some function f:[0,)[0,)f: [0,\infty) \to [0,\infty), where {k}\{|k\rangle\} is any orthonormal basis

(F3) Composition: F(ψϕ)=F(ψ)F(ϕ)F(\psi \otimes \phi) = F(\psi) \cdot F(\phi) for independent systems

(F4) Continuity: FF is continuous

(F5) Non-triviality: F≢0F \not\equiv 0

Then f(x)=xf(x) = x and F(ψ)=ψψF(\psi) = \langle\psi|\psi\rangle.

Proof. From (F2), F(ψ)=kf(ψk2)F(\psi) = \sum_k f(|\psi_k|^2).

Step (a): Multiplicative equation. Take d=m=1d = m = 1 (one-dimensional systems), ψ=aC|\psi\rangle = a \in \mathbb{C}, ϕ=bC|\phi\rangle = b \in \mathbb{C}. Then ψϕ=ab|\psi \otimes \phi\rangle = ab, ab2=a2b2|ab|^2 = |a|^2|b|^2. Condition (F3) gives:

f(a2b2)=f(a2)f(b2)f(|a|^2 \cdot |b|^2) = f(|a|^2) \cdot f(|b|^2)

Setting r=a2r = |a|^2, s=b2s = |b|^2: f(rs)=f(r)f(s)f(rs) = f(r)f(s) for all r,s0r, s \geq 0. This is Cauchy’s multiplicative functional equation on R0\mathbb{R}_{\geq 0}.

Step (b): Continuous solution. By (F4), ff is continuous. By (F5), f≢0f \not\equiv 0. From f(0)=f(0r)=f(0)f(r)f(0) = f(0 \cdot r) = f(0)f(r) for all rr: either f(0)=0f(0) = 0 or ff(0)f \equiv f(0) (constant). If ff is constant, say f(r)=cf(r) = c for all rr, then f(rs)=cf(rs) = c but f(r)f(s)=c2f(r)f(s) = c^2, so c=c2c = c^2, giving c=0c = 0 or c=1c = 1. The case c=0c = 0 contradicts (F5). The case c=1c = 1 gives F(ψ)=dF(\psi) = d (dimension) for all states — independent of the state. This contradicts (F3) for systems of different dimensions: F(ψϕ)=d1d2F(\psi \otimes \phi) = d_1 d_2 but F(ψ)F(ϕ)=d1d2F(\psi) \cdot F(\phi) = d_1 \cdot d_2, which is consistent, but checking (F2) with d=2d = 2: F(ψ)=f(ψ12)+f(ψ22)=2F(\psi) = f(|\psi_1|^2) + f(|\psi_2|^2) = 2 for all states, while F(ψ)F(\psi) should vary with ψ12|\psi_1|^2. So ff cannot be the constant function 1 on [0,)[0,\infty) while satisfying the simplex constraint from normalization.

More directly: the unique continuous solution to f(rs)=f(r)f(s)f(rs) = f(r)f(s) on R>0\mathbb{R}_{>0} with f≢0f \not\equiv 0 is f(r)=rαf(r) = r^\alpha for some αR\alpha \in \mathbb{R} Aczél & Dhombres, 1989. Since f0f \geq 0 (by F2 and F1) and f(1)=f(1)2f(1) = f(1)^2 gives f(1)=1f(1) = 1 (using f≢0f \not\equiv 0), we have f(r)=rαf(r) = r^\alpha with α\alpha to be determined.

Step (c): Normalization determines α\alpha. For normalized quantum states (kψk2=1\sum_k |\psi_k|^2 = 1), the total coherence should be a constant (coherence of a single-system state is fixed). With f(r)=rαf(r) = r^\alpha:

F(ψ)=k=1dψk2αF(\psi) = \sum_{k=1}^d |\psi_k|^{2\alpha}

Take d=2d = 2: the state 1|1\rangle gives F=f(1)+f(0)=1+0=1F = f(1) + f(0) = 1 + 0 = 1. The state (1+2)/2(|1\rangle + |2\rangle)/\sqrt{2} gives F=2(1/2)α=21αF = 2 \cdot (1/2)^\alpha = 2^{1-\alpha}. For FF to be consistent with (F3) applied to 11|1\rangle \otimes |1\rangle vs. (1+2)/2(1+2)/2(|1\rangle + |2\rangle)/\sqrt{2} \otimes (|1\rangle + |2\rangle)/\sqrt{2}:

Direct check: F(11)=f(1)=1F(|1\rangle \otimes |1\rangle) = f(1) = 1. Meanwhile F(1)F(1)=11=1F(|1\rangle) \cdot F(|1\rangle) = 1 \cdot 1 = 1. \checkmark

F((1+2)/2(1+2)/2)=j,kf(1/4)=4(1/4)α=41αF((|1\rangle + |2\rangle)/\sqrt{2} \otimes (|1\rangle + |2\rangle)/\sqrt{2}) = \sum_{j,k} f(1/4) = 4 \cdot (1/4)^\alpha = 4^{1-\alpha}.

F((1+2)/2)F((1+2)/2)=(21α)2=41αF((|1\rangle + |2\rangle)/\sqrt{2}) \cdot F((|1\rangle + |2\rangle)/\sqrt{2}) = (2^{1-\alpha})^2 = 4^{1-\alpha}. \checkmark — this is always consistent, so (F3) alone does not fix α\alpha.

To fix α\alpha, we invoke the Born rule’s composition constraint (B3): the probability rule must be consistent with amplitude composition for sequential measurements. By Theorem 6.1 of Born Rule, the unique solution is f(x)=xf(x) = x (i.e., α=1\alpha = 1). The argument is: if α1\alpha \neq 1, then FF is not preserved under the path-sum composition of amplitudes. Specifically, the coherence content of a state reached via two paths (ψ(ca)=bψ(cb)ψ(ba)\psi(c|a) = \sum_b \psi(c|b)\psi(b|a)) must equal the coherence content computed from the composed amplitude, which forces α=1\alpha = 1.

Therefore f(x)=xf(x) = x and F(ψ)=kψk2=ψψF(\psi) = \sum_k |\psi_k|^2 = \langle\psi|\psi\rangle. \square

Remark (No circularity). Step (c) cites Born Rule Theorem 6.1, which derives f(x)=xf(x) = x from the composition structure B3, normalization, and continuity. Crucially, that derivation does not use Structural Postulate S1 (the identification C(ψ)=ψψ\mathcal{C}(|\psi\rangle) = \langle\psi|\psi\rangle) — it works entirely with probabilities and amplitudes, never invoking the coherence functional. So the logical flow is acyclic: the Born rule derivation independently establishes f(x)=xf(x) = x; Theorem 4.1 here uses that result to determine the coherence functional; and Corollary 4.2 then promotes S1 from assumption to theorem. Each step relies only on what precedes it, not on S1 itself.

Corollary 4.2 (S1 is a theorem). The Structural Postulate S1 of Born Rule — the identification C(ψ)=ψψ\mathcal{C}(|\psi\rangle) = \langle\psi|\psi\rangle — follows from Axiom 1 (conditions C1–C5), U(1) phase symmetry (Axiom 3), and the composition structure of the interaction graph. It is the unique coherence functional satisfying conditions (F1)–(F5).

Remark. The five conditions (F1)–(F5) are not new assumptions. Each traces to an existing axiom:

ConditionSource
(F1) U(1) invarianceAxiom 3 (Loop Closure): each observer has U(1) phase
(F2) Channel additivityAxiom 1, condition C4 at equality: pre-measurement channels are coherence-independent
(F3) CompositionAxiom 1, conservation on tensor products of non-interacting subsystems
(F4) ContinuityAxiom 3 (smooth dynamics on compact manifold)
(F5) Non-trivialityAxiom 1 (C0>0C_0 > 0)

Step 5: Classical Limit

Proposition 5.1 (Classical convergence). In the classical limit (diagonal density matrices ρ=diag(p1,,pd)\rho = \text{diag}(p_1, \ldots, p_d)), the von Neumann entropy reduces to Shannon entropy: S(ρ)=kpklogpk=H(p)S(\rho) = -\sum_k p_k \log p_k = H(\mathbf{p}). The conditions of Axiom 1 reduce to the Shannon entropy axioms, and coherence conservation becomes Liouville’s theorem.

Proof. For diagonal ρ\rho, the eigenvalues are the probabilities pkp_k, and S(ρ)=H(p)S(\rho) = H(\mathbf{p}) by definition. Classical subadditivity of Shannon entropy is standard. Classical strong subadditivity H(X,Y)+H(Y,Z)H(X,Y,Z)+H(Y)H(X,Y) + H(Y,Z) \geq H(X,Y,Z) + H(Y) holds for joint distributions (equivalent to non-negativity of conditional mutual information, proved by Shannon). Conservation of Shannon entropy under invertible transformations is Liouville’s theorem for Hamiltonian flows (measure preservation \Leftrightarrow entropy preservation). \square

Step 6: Why Coherence Is Primitive

Proposition 6.1 (Primitive status). Coherence occupies the same epistemological position as mass in Newtonian mechanics: it is characterized by its axioms and measured by its operational consequences, not defined in terms of other quantities.

The analogy is precise:

Mass (Newtonian)Coherence (Observer-Centrism)
AxiomsPositive, additive, conservedPositive, subadditive, conserved (Axiom 1: C1–C5)
Operational meaningResistance to acceleration (F=maF = ma)Information capacity (S(ρ)S(\rho))
Conservation lawmi=const\sum m_i = \text{const}C(H)=C0\mathcal{C}(\mathcal{H}) = C_0
Relational quantityGravitational interaction (Gm1m2/r2Gm_1 m_2/r^2)Mutual information (I(A:B)I(A:B))
Known realizationsInertial mass, gravitational mass, relativistic energyEntropy, inner product, phase space volume

Like mass, coherence is not “nothing but entropy” or “nothing but the inner product.” It is the abstract property that entropy, inner products, and phase space volumes share — conservation under admissible dynamics, positivity, and subadditivity — extracted as a foundational primitive.

Consistency Model

Theorem 7.1. A qubit pair H=C2C2\mathcal{H} = \mathbb{C}^2 \otimes \mathbb{C}^2 in the Bell state Φ+=(00+11)/2|\Phi^+\rangle = (|00\rangle + |11\rangle)/\sqrt{2} provides a consistency model for all results of this derivation.

Verification. Note on C0C_0: A pure global state has S(ρ)=0S(\rho) = 0, which would violate C2’s requirement C0>0C_0 > 0. We therefore use the Bell state as the accessible state of a bounded observer, embedded in a larger thermal environment with C0>0C_0 > 0. Concretely, take the total system in a thermal state ρtotal=eβH/Z\rho_{\text{total}} = e^{-\beta H}/Z with C0=S(ρtotal)=βH+logZ>0C_0 = S(\rho_{\text{total}}) = \beta \langle H \rangle + \log Z > 0, and the Bell pair as a subsystem. The axiom structure is verified on the subsystem:

Rigor Assessment

Fully rigorous (established mathematics):

Rigorous given axioms:

Structural:

Assessment: The convergence results are mathematical facts — they show that the conditions of Axiom 1 are satisfied by quantum entropy, that the derived quantities match quantum information quantities, and that the coherence functional on states is uniquely determined. The promotion of S1 to a theorem depends on the composition constraint from the Born rule derivation, which is itself rigorous.

Open Gaps

  1. Quantitative C0C_0: The total coherence C0=S(ρtotal)C_0 = S(\rho_{\text{total}}) depends on the global state. Whether C0C_0 is a free parameter or determined by self-consistency (the bootstrap fixed point) remains open — see Bootstrap Mechanism Gap 3.
  2. Rényi generalization: Axiom 1 is also satisfied by Rényi entropies Sα(ρ)=(1α)1logTr(ρα)S_\alpha(\rho) = (1-\alpha)^{-1}\log\text{Tr}(\rho^\alpha) for α(0,1)\alpha \in (0,1) (strong subadditivity holds in this range). Whether the framework selects von Neumann (α=1\alpha = 1) over other Rényi orders, or whether this is a free choice, is open.
  3. Algebraic vs. measure-theoretic coherence: The σ-algebra formulation (Axiom 1, conditions C1–C5) and the Hilbert space formulation (C=ψψ\mathcal{C} = \langle\psi|\psi\rangle) operate at different levels. The former concerns subsystem structure; the latter concerns state-level properties. A unified formulation treating both as aspects of a single categorical structure (a functor from subsystem inclusions to coherence values) would strengthen the convergence.

Enables