This is an old revision of the document!
Albantakis, Prentner & Durham (2023) — Computing the integrated information of a quantum mechanism
Quick metadata
| Key | albantakis2023computing |
| Type | Journal article |
| Journal | Entropy |
| Year | 2023 |
| Volume / Issue | 25 / 3 |
| Pages | 449 |
| Publisher | MDPI |
| Status | Review in progress |
| Topics | IIT, QIIT, quantum foundations, causality, information measures |
Citation
Albantakis, Larissa; Prentner, Robert; Durham, Ian. (2023).
Computing the integrated information of a quantum mechanism. Entropy 25(3):449.
BibTeX
@article{albantakis2023computing, title={Computing the integrated information of a quantum mechanism}, author={Albantakis, Larissa and Prentner, Robert and Durham, Ian}, journal={Entropy}, volume={25}, number={3}, pages={449}, year={2023}, publisher={MDPI} }
One-paragraph summary (plain language)
The paper proposes a method (QIIT/QIIT-like) to compute “integrated information” for quantum systems by defining cause/effect “repertoires” using density matrices and comparing them to a baseline “chance” state. It adapts classical IIT machinery (mechanism/purview partitions, counterfactual noise injection, integration via partitioning) into a quantum formalism.
Review
What it claims to do (in operational terms)
- Defines quantum cause/effect repertoires for a chosen “mechanism” and “purview”.
- Defines an “intrinsic difference” measure (QID) between a constrained repertoire and a baseline repertoire.
- Defines “integration” by how much this measure drops under partitions (analogous to classical IIT).
What is genuinely new (vs renaming)
- A concrete proposal for handling entanglement when factorizing repertoires (nontrivial compared to classical product-factor approaches).
- A specific divergence-like measure (QID) intended to encode “intrinsic” rather than channel-designer information.
What looks like repackaging / relabeling
- “Mechanism” ≈ chosen subsystem.
- “Purview” ≈ another chosen subsystem.
- “Repertoire” ≈ a (conditional / counterfactual) reduced density matrix.
- The conceptual novelty is not in the quantum objects, but in the *interpretation* (intrinsic/self-specifying) and the *intervention rule*.
Core conceptual friction points (physics-first critique)
- Counterfactual noise injection: The method “disconnects” everything outside the mechanism by replacing it with maximally mixed noise. This is not a physical open-system approximation; it is a *chosen intervention rule*. Any “intrinsic” claims depend on this convention.
- Factorization dependence: Results depend on the choice of subsystem decomposition (“units”). In quantum theory, factorization is not always unique or physically privileged; if the computed structure changes under refactorization, it’s hard to call it intrinsic.
- Unitary bias / measurement gap: The clean formalism largely lives in unitary evolution; measurement/non-unitary updates create ambiguity for “cause” directionality and can become interpretation-dependent.
- Mixed-state ambiguity: A density matrix can represent ignorance (epistemic) or a reduced state from entanglement (ontic-but-subsystem). The framework’s language often slides between these readings.
Where the prose risks misleading the reader
- Phrases like “the system knows” or “specifies information about itself” read like ontology, but the actual operations are: choose a partition, apply an intervention/noise rule, compute a state, compute a divergence, pick a maximizing element.
- The mathematical pipeline can be valid as a *defined metric*, but the paper’s language can make it sound like a derived physical necessity.
Strongest charitable reading
The framework is a proposed *measure of “how concentrated and partition-resistant” a mechanism’s counterfactually-defined influence is* under a particular intervention scheme. It is a formal extension of IIT-style attribution to density matrices.
Strongest skeptical reading
It is a rebranding of subsystem/channel calculations with a heavy interpretive layer. “Intrinsic” properties are not shown to be invariant under factorization, interpretation of measurement, or physically constrained interventions—so the ontological talk outruns what the formalism guarantees.
Notes / excerpts
- (Add your own quotes here as you read.)
- (Add page/section pointers you want to revisit.)
Open questions to test the framework
- If you compute the quantity under two physically equivalent descriptions (different tensor factorizations / dilations), do you get the same “intrinsic” structure?
- If you replace “maximally mixed noise” with a physically motivated environment state (thermal, constrained by energy), how stable are the results?
- Does QID overemphasize top-eigenvalue behavior in ways that wash out phase-sensitive/coherence structure you’d expect to matter?
Albantakis et al. (2023) — QM re-formulation and failure modes
This note rewrites the paper’s core definitions in standard quantum information language, and isolates where the framework is (i) pure relabeling, (ii) a specific intervention convention, and (iii) where it becomes physically ambiguous.
1. Translation into standard QM language
Let the total system be a finite-dimensional composite Hilbert space $$ \mathcal{H}_Q \;=\; \mathcal{H}_M \otimes \mathcal{H}_{M^0}, $$ where $M$ is the chosen subsystem (in the paper refered to as the “mechanism”, which implies “set of actions” more than “set of states”) and $M^0 := Q\setminus M$ is its complement. Let the system update $T$ be a completely positive, trace preserving (CPTP) map (often unitary in their examples) $$ T:\mathcal{B}(\mathcal{H}_Q) \to \mathcal{B}(\mathcal{H}_Q). $$
For a chosen subsystem $Z \subseteq Q$, write $Z^0 := Q\setminus Z$. This is called a “purview” in the paper. It is just the particular subsystem of states chosen.
1.1 Their “effect repertoire” is a channel with a fixed environment state
They define (single-node case, or single qubit if we discard the node terminology) the “effect repertoire” as $$ \pi_e(Z_i \mid m) \;=\; \rho_{Z_i \mid m,\, t+1} \;=\; \operatorname{tr}_{Z_i^0}\!\Bigl(T\bigl(\rho_M \otimes \rho^{\mathrm{mm}}_{M^0}\bigr)\Bigr), $$ where $\rho^{\mathrm{mm}}_{M^0} = I_{M^0}/\dim(\mathcal{H}_{M^0})$ is maximally mixed.
In standard QIT terms: this is simply the output of the effective channel $$ \mathcal{E}_{M\to Z}(X) \;:=\; \operatorname{tr}_{Z^0}\!\Bigl(T\bigl(X \otimes \rho^{\mathrm{mm}}_{M^0}\bigr)\Bigr), \qquad \pi_e(Z\mid m) = \mathcal{E}_{M\to Z}(\rho_M). $$
So “mechanism/purview/effect repertoire” = “pick subsystems + apply a derived channel + take a reduced state”.
This is mostly renaming as far as I can tell.
The nontrivial part is the choice $\rho_{M^0}\mapsto \rho^{\mathrm{mm}}_{M^0}$, i.e. replacing the outside world with maximally mixed “noise”.
1.2 Their multi-node “discount extraneous correlations” step is an entanglement-cluster factorization
For $|Z|>1$ they first form $$ \rho_{Z\mid m,\,t+1} := \operatorname{tr}_{Z^0}\!\Bigl(T\bigl(\rho_M \otimes \rho^{\mathrm{mm}}_{M^0}\bigr)\Bigr) $$ and then define a “maximal separability partition” $P^*(\rho_{Z\mid m,t+1})$ into subsets $Z^{(1)},\dots,Z^{(r^*)}$ that are internally entangled but not mutually entangled, and set $$ \pi_e(Z\mid m) \;:=\; \bigotimes_{i=1}^{r^*}\rho_{Z^{(i)}\mid m,\,t+1}. $$ This is their Definition 4 / Eq. (23).
This is equivalent to: “factor the chosen subset of states into entanglement clusters, then take a product state across clusters”, which removes correlations they label “extraneous classical correlations”.
Finding multipartite mixed-state entanglement structure is nontrivial.
2. Their information measure is a max-eigenvalue score against a maximally mixed baseline
They define quantum intrinsic difference (QID) as $$ \mathrm{QID}(\rho\Vert\sigma) \;:=\; \max_i \; p_i \left(\log p_i - \sum_j P_{ij}\log q_j\right), $$ where $\rho=\sum_i p_i|i\rangle\langle i|$, $\sigma=\sum_j q_j|j\rangle\langle j|$, and $P_{ij}=\langle i|j\rangle\langle j|i\rangle$.
They then set the “unconstrained” baseline to maximally mixed: $$ \sigma = \pi_e(Z) = \rho_Z^{\mathrm{mm}} = I_Z/d,\qquad d=\dim(\mathcal{H}_Z). $$
With $\sigma=I_Z/d$ one gets $q_j=1/d$ and $\sum_j P_{ij}\log q_j = \log(1/d)=-\log d$, hence $$ \mathrm{QID}\bigl(\rho\Vert I_Z/d\bigr) = \max_i \; p_i(\log p_i + \log d). $$
So, for their chosen baseline, QID depends only on the eigenvalues of $\rho$ and is maximized at the largest eigenvalue $p_{\max}$.
That is why they conclude the “intrinsic effect state” is $$ z_e^0(m,Z)=\arg\max_i\;p_i(\cdots) \quad\Rightarrow\quad z_e^0 \text{ is the eigenvector of } \pi_e(Z\mid m) \text{ with largest eigenvalue.} $$
They explicitly note: if $\pi_e(Z\mid m)$ is mixed, then $z_e^0 \neq \pi_e(Z\mid m)$.
This is the novelty of the paper.
3. What is “integrated information” here in plain QM terms?
Given a partition $\theta$ that “cuts” ?? connections from parts of $M$ to parts of $Z$, they construct a partitioned repertoire $\pi_e^\theta(Z\mid m)$ (also using maximally mixed noise for missing inputs), and evaluate $$ \varphi(m,Z,\theta) \;\equiv\; \mathrm{QID}\bigl(\pi_e(Z\mid m)\Vert \pi_e^\theta(Z\mid m)\bigr) \quad\text{but only evaluated on the maximizing eigenstate } z_e^0. $$ (Their Eq. (31) is this in components.)
Then they minimize over partitions to get a “minimum information partition” (MIP) and maximize over purviews $Z$.
In standard language:
- define a family of cut channels $\mathcal{E}_{M\to Z}^{(\theta)}$ by replacing some inputs with maximally mixed noise,
- compare the output state of the original channel vs cut channel by a max-eigenvalue-weighted relative-entropy-like score,
- take min/max over partitions/purviews.
4. Where this is “just QM” vs where it stops being physics
4.1 What is genuinely standard (QM/QIT)
- partial trace and reduced states,
- CPTP maps / unitary channels,
- maximally mixed state as a reference,
- relative entropy and related divergences as distinguishability measures.
4.2 What is *not* standard QM (it is an IIT intervention convention)
Key point: the replacement $\rho_{M^0}\mapsto \rho^{\mathrm{mm}}_{M^0}$ is not “neglecting the environment” in the open-systems sense. It is defining a counterfactual: “what would happen if everything outside $M$ were randomized”.
This is a choice of *causal attribution rule*.
A physically evolved reduced state of $Z$ would be $$ \rho^{\mathrm{phys}}_{Z,t+1} = \operatorname{tr}_{Z^0}\!\bigl(T(\rho_{Q,t})\bigr), $$ with the actual joint state $\rho_{Q,t}$ (including correlations/entanglement). Their repertoire is instead $$ \rho^{\mathrm{IIT}}_{Z,t+1} = \operatorname{tr}_{Z^0}\!\Bigl(T\bigl(\rho_M \otimes \rho^{\mathrm{mm}}_{M^0}\bigr)\Bigr), $$ which equals $\rho^{\mathrm{phys}}_{Z,t+1}$ only if the actual joint state factorizes as $\rho_{Q,t}=\rho_M\otimes \rho^{\mathrm{mm}}_{M^0}$ (or if the dynamics makes $Z$ independent of $M^0$).
So: their pipeline is not deriving a physical prediction; it is defining a counterfactual dependence score.
4.3 Energy conservation / Hamiltonian structure is not enforced
Even if $T(\cdot)=U(\cdot)U^\dagger$ is unitary, nothing in the construction requires $U=e^{-iHt}$ for a specified Hamiltonian, nor does it enforce conservation constraints under the intervention $\rho_{M^0}\mapsto I/d$.
If you interpret $M^0$ as a physical environment with a physical state (e.g. thermal), replacing it by maximally mixed corresponds to an “infinite temperature” reference and can inject or remove energy relative to that environment model. The framework avoids this by treating the replacement as a formal intervention, not a physical process. Still, the connection to the evolution of physical processes is what this paper, I think, is trying to evaluate.
4.4 The “entanglement-cluster product” step is a major weak link for mixed states
They require a decomposition of mixed states to define $P^*(\rho)$ and acknowledge that identifying multipartite mixed-state entanglement structure is hard and not fully solved in general. So for nontrivial mixed states, the factorization step can be:
- non-unique,
- computationally hard,
- sensitive to the chosen criterion of separability.
4.5 The measure collapses “effect content” to a single eigenvector
With the maximally mixed baseline, QID reduces to a function of eigenvalues and the “intrinsic effect” is the top-eigenvalue eigenvector.
That means:
- the framework’s “content” is often not the full density matrix $\rho$,
- phase-sensitive/coherence structure matters only insofar as it affects the spectrum (or the later partitioning step),
- for mixed repertoires, “what the mechanism specifies” becomes “the most probable eigenstate”, not the full state.
This is a deliberate design choice (“specificity”), but it is not forced by QM.
4.6 Built-in causal asymmetry even under unitary dynamics
They explicitly note an asymmetry: the product structure is imposed on parts of $\rho_M$ for causes, not on $\rho_{Z|m}$, yielding a time-asymmetry in “causes vs effects” even when the underlying dynamics is unitary/reversible.
5. Steelman vs critique (short)
Steelman
As a *defined* counterfactual causal attribution scheme, the framework:
- avoids common-cause “spurious correlation” effects by construction (their COPY-XOR/CNOT motivation),
- tries to preserve entanglement-generated correlations by clustering entanglement before taking products,
- can differentiate internal structure of multipartite entanglement classes (e.g. GHZ vs W under $T=I$) in a compositional way.
Critique
If read as physics (rather than as a convention):
- the central “disconnect via maximally mixed noise” is not physically derived and ignores real correlations/entanglement with $M^0$,
- conservation laws and Hamiltonian constraints are not part of the axioms,
- mixed-state entanglement factorization is a weak/ill-defined step in general,
- the “intrinsic effect” reduces to a top-eigenvector selection, discarding much of the state’s structure in mixed cases.
6. Minimal “one-line” summary
Mathematically, the paper computes a max-eigenvalue-weighted distinguishability between the output of a derived channel $$ \mathcal{E}_{M\to Z}(X)=\operatorname{tr}_{Z^0}(T(X\otimes I/d)) $$ and the output of its cut/partitioned variants, after optionally factorizing $Z$ into entanglement clusters.
Everything else is terminology.