Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| bib:albantakis2023computing [2026/01/28 08:10] – [3. What is “integrated information” here in plain QM terms?] kymki | bib:albantakis2023computing [2026/05/21 06:29] (current) – [Steelman] kymki | ||
|---|---|---|---|
| Line 17: | Line 17: | ||
| ===== BibTeX ===== | ===== BibTeX ===== | ||
| - | < | + | < |
| @article{albantakis2023computing, | @article{albantakis2023computing, | ||
| title={Computing the integrated information of a quantum mechanism}, | title={Computing the integrated information of a quantum mechanism}, | ||
| Line 29: | Line 29: | ||
| } | } | ||
| </ | </ | ||
| + | ====== Albantakis et al. (2023) — QM re-formulation and failure modes ====== | ||
| - | ===== One-paragraph summary (plain language) ===== | + | Im reading this with minimal background |
| - | The paper proposes a method (QIIT/ | + | |
| - | + | ||
| - | ===== Review ===== | + | |
| - | + | ||
| - | ==== What it claims to do (in operational terms) ==== | + | |
| - | * Defines quantum cause/ | + | |
| - | * Defines an “intrinsic difference” measure (QID) between a constrained repertoire and a baseline repertoire. | + | |
| - | * Defines “integration” by how much this measure drops under partitions (analogous to classical | + | |
| - | + | ||
| - | ==== What is genuinely new (vs renaming) ==== | + | |
| - | * A concrete proposal for handling entanglement when factorizing repertoires (nontrivial compared to classical product-factor approaches). | + | |
| - | * A specific divergence-like measure (QID) intended to encode “intrinsic” rather than channel-designer information. | + | |
| - | + | ||
| - | ==== What looks like repackaging / relabeling ==== | + | |
| - | * “Mechanism” ≈ chosen subsystem. | + | |
| - | * “Purview” ≈ another chosen subsystem. | + | |
| - | * “Repertoire” ≈ a (conditional / counterfactual) reduced density matrix. | + | |
| - | * The conceptual novelty is not in the quantum objects, but in the *interpretation* (intrinsic/ | + | |
| - | + | ||
| - | ==== Core conceptual friction points (physics-first critique) ==== | + | |
| - | - **Counterfactual noise injection: | + | |
| - | - **Factorization dependence: | + | |
| - | - **Unitary bias / measurement gap: | + | |
| - | - **Mixed-state ambiguity: | + | |
| - | + | ||
| - | ==== Where the prose risks misleading the reader ==== | + | |
| - | * Phrases like “the system knows” or “specifies information about itself” read like ontology, but the actual operations are: choose a partition, apply an intervention/ | + | |
| - | * The mathematical pipeline can be valid as a *defined metric*, but the paper’s language can make it sound like a derived physical necessity. | + | |
| - | + | ||
| - | ==== Strongest charitable | + | |
| - | The framework | + | |
| - | + | ||
| - | ==== Strongest skeptical reading ==== | + | |
| - | It is a rebranding of subsystem/ | + | |
| - | + | ||
| - | ===== Notes / excerpts ===== | + | |
| - | * (Add your own quotes here as you read.) | + | |
| - | * (Add page/ | + | |
| - | + | ||
| - | ===== Open questions to test the framework ===== | + | |
| - | - If you compute the quantity under two physically equivalent descriptions (different tensor factorizations / dilations), do you get the same “intrinsic” structure? | + | |
| - | - If you replace “maximally mixed noise” with a physically motivated environment state (thermal, constrained by energy), how stable are the results? | + | |
| - | - Does QID overemphasize top-eigenvalue behavior in ways that wash out phase-sensitive/ | + | |
| - | + | ||
| - | ====== Albantakis et al. (2023) — QM re-formulation and failure modes ====== | + | |
| - | This note rewrites the paper’s core definitions in standard quantum information language, and isolates where the framework is (i) pure relabeling, (ii) a specific intervention convention, and (iii) where it becomes physically ambiguous. | + | This note rewrites the paper’s core definitions in standard quantum information language, and tries to explain conclusions from that standpoint. The case may be that I completely miss subtleties due to not being well read in IIT. |
| ===== 1. Translation into standard QM language ===== | ===== 1. Translation into standard QM language ===== | ||
| Line 187: | Line 143: | ||
| ---- | ---- | ||
| - | ==== 3.1 Full (uncut) effective channel $\\mathcal{E}_{M\\to Z}$ ==== | + | ==== 3.1 Full (uncut) effective channel $\mathcal{E}_{M\to Z}$ ==== |
| Assume a global update map (channel) on the total system $Q$, | Assume a global update map (channel) on the total system $Q$, | ||
| Line 193: | Line 149: | ||
| T: | T: | ||
| $$ | $$ | ||
| - | and a factorization $\\mathcal{H}_Q=\\mathcal{H}_M\\otimes\\mathcal{H}_{M^0}$. | + | and a factorization $\mathcal{H}_Q=\mathcal{H}_M\otimes\mathcal{H}_{M^0}$. |
| Define the maximally mixed state on the complement of $M$: | Define the maximally mixed state on the complement of $M$: | ||
| $$ | $$ | ||
| - | \\rho^{\\mathrm{mm}}_{M^0} := \\frac{I_{M^0}}{\\dim(\\mathcal{H}_{M^0})}. | + | \rho^{\mathrm{mm}}_{M^0} := \frac{I_{M^0}}{\dim(\mathcal{H}_{M^0})}. |
| $$ | $$ | ||
| Then the “effect repertoire” is exactly the output of the induced (effective) channel | Then the “effect repertoire” is exactly the output of the induced (effective) channel | ||
| $$ | $$ | ||
| - | \\mathcal{E}_{M\\to Z}(X) | + | \mathcal{E}_{M\to Z}(X) |
| := | := | ||
| - | \\operatorname{tr}_{Z^0}\\Bigl(T\\bigl(X\\otimes\\rho^{\\mathrm{mm}}_{M^0}\\bigr)\\Bigr), | + | \operatorname{tr}_{Z^0}\Bigl(T\bigl(X\otimes\rho^{\mathrm{mm}}_{M^0}\bigr)\Bigr), |
| - | \\qquad | + | \qquad |
| - | \\pi_e(Z\\mid m) = \\mathcal{E}_{M\\to Z}(\\rho_M). | + | \pi_e(Z\mid m) = \mathcal{E}_{M\to Z}(\rho_M). |
| $$ | $$ | ||
| Line 213: | Line 169: | ||
| ---- | ---- | ||
| - | ==== 3.2 What a “partition” $\\theta$ is ==== | + | ==== 3.2 What a “partition” $\theta$ is ==== |
| - | A partition $\\theta$ is a rule that splits both $M$ and $Z$ into matched parts: | + | A partition $\theta$ is a rule that splits both $M$ and $Z$ into matched parts: |
| $$ | $$ | ||
| - | \\theta = \\{(M^{(i)}\\to Z^{(i)})\\}_{i=1}^k, | + | \theta = \{(M^{(i)}\to Z^{(i)})\}_{i=1}^k, |
| $$ | $$ | ||
| where the parts are disjoint and cover the sets: | where the parts are disjoint and cover the sets: | ||
| $$ | $$ | ||
| - | M = \\bigsqcup_{i=1}^k M^{(i)},\\qquad | + | M = \bigsqcup_{i=1}^k M^{(i)}, |
| - | Z = \\bigsqcup_{i=1}^k Z^{(i)}. | + | Z = \bigsqcup_{i=1}^k Z^{(i)}. |
| $$ | $$ | ||
| - | For example, if $M=\\{A,B\\}$ and $Z=\\{A,B\\}$ then a common bipartition is | + | For example, if $M=\{A,B\}$ and $Z=\{A,B\}$ then a common bipartition is |
| $$ | $$ | ||
| - | \\theta = (A\\to A)\\cup(B\\to B). | + | \theta = (A\to A)\cup(B\to B). |
| $$ | $$ | ||
| Line 238: | Line 194: | ||
| *when computing the output on each $Z^{(i)}$, all inputs not belonging to $M^{(i)}$ are replaced by maximally mixed noise.* | *when computing the output on each $Z^{(i)}$, all inputs not belonging to $M^{(i)}$ are replaced by maximally mixed noise.* | ||
| - | So cross-part influences $M^{(j)}\\to Z^{(i)}$ for $j\\neq i$ are disabled by construction. | + | So cross-part influences $M^{(j)}\to Z^{(i)}$ for $j\neq i$ are disabled by construction. |
| In QIT terms: you construct a family of **cut channels** where some inputs are fixed to $I/d$ instead of being allowed to carry state-dependent information. | In QIT terms: you construct a family of **cut channels** where some inputs are fixed to $I/d$ instead of being allowed to carry state-dependent information. | ||
| Line 248: | Line 204: | ||
| For each part $i$, define a local induced channel | For each part $i$, define a local induced channel | ||
| $$ | $$ | ||
| - | \\mathcal{E}^{(i)}_{M^{(i)}\\to Z^{(i)}}(X) | + | \mathcal{E}^{(i)}_{M^{(i)}\to Z^{(i)}}(X) |
| := | := | ||
| - | \\operatorname{tr}_{(Z^{(i)})^0}\\Bigl( | + | \operatorname{tr}_{(Z^{(i)})^0}\Bigl( |
| - | T\\bigl(X\\otimes | + | T\bigl(X\otimes \rho^{\mathrm{mm}}_{(M^{(i)})^0}\bigr) |
| - | \\Bigr), | + | \Bigr), |
| $$ | $$ | ||
| - | where $(M^{(i)})^0: | + | where $(M^{(i)})^0: |
| This is “feed only $M^{(i)}$ as an input; everything else is noise”. | This is “feed only $M^{(i)}$ as an input; everything else is noise”. | ||
| Then the partitioned (cut) effect repertoire is assembled as a product: | Then the partitioned (cut) effect repertoire is assembled as a product: | ||
| $$ | $$ | ||
| - | \\pi_e^{\\theta}(Z\\mid m) | + | \pi_e^{\theta}(Z\mid m) |
| - | \\;:=\\; | + | \;:=\; |
| - | \\bigotimes_{i=1}^k | + | \bigotimes_{i=1}^k |
| - | \\mathcal{E}^{(i)}_{M^{(i)}\\to Z^{(i)}}\\bigl(\\rho_{M^{(i)}}\\bigr). | + | \mathcal{E}^{(i)}_{M^{(i)}\to Z^{(i)}}\bigl(\rho_{M^{(i)}}\bigr). |
| $$ | $$ | ||
| Line 271: | Line 227: | ||
| ---- | ---- | ||
| - | ==== 3.5 The score $\\varphi(m, | + | ==== 3.5 The score $\varphi(m, |
| - | They define an “integrated information for this purview under this partition” as a divergence between: | + | Let (Q) be factorized and let |
| + | $$ | ||
| + | T:\mathcal B(\mathcal H_Q)\to\mathcal B(\mathcal H_Q)) | ||
| + | $$ | ||
| + | be the global channel. For a chosen **input subsystem** (M) prepared in state $\rho_M$, define the **intervention-induced effective channel** | ||
| + | $$ | ||
| - | - the full repertoire $\\pi_e(Z\\mid m)$, and | + | \mathcal E_{M\to Z}(X): |
| - | - the cut repertoire $\\pi_e^{\\theta}(Z\\mid m)$, | + | \qquad \tau_{M^0}:=\frac{I_{M^0}}{d_{M^0}}, |
| - | using their QID: | ||
| $$ | $$ | ||
| - | \\varphi(m,Z,\\theta) | + | and the corresponding **output marginal** on the chosen **target subsystem** |
| - | \\; | + | |
| - | \\mathrm{QID}\\Bigl(\\pi_e(Z\\mid m)\\; | + | |
| $$ | $$ | ||
| - | Crucial detail: with their maximally mixed baseline choice, QID behaves like a **max-eigenvalue-weighted** distinguishability, | + | \rho_Z:=\mathcal E_{M\to Z}(\rho_M). |
| - | (Their Eq. (31) writes this out in components and evaluates it at the maximizing eigenstate.) | + | $$ |
| - | ---- | + | For a partition $\theta\in\Theta(M, |
| + | $$ | ||
| + | |||
| + | \sigma_Z^{(\theta)}: | ||
| + | |||
| + | $$ | ||
| + | |||
| + | Now diagonalize | ||
| + | $$ | ||
| + | |||
| + | \rho_Z=\sum_i p_i|i\rangle\langle i|,\qquad | ||
| + | \sigma_Z^{(\theta)}=\sum_j q_j|j\rangle\langle j|, | ||
| + | \qquad | ||
| + | P_{ij}: | ||
| + | |||
| + | $$ | ||
| + | **Intrinsic (selected) effect state.** The paper first selects a particular eigenstate $|i^*\rangle$ of $\rho_Z$ via an “intrinsic information vs baseline” optimization. With their maximally mixed baseline on $Z$, this selection reduces to choosing the **largest-eigenvalue eigenvector** of $\rho_Z$ (or the corresponding eigenspace if degenerate). Call that chosen eigenstate | ||
| + | $$ | ||
| + | |||
| + | z' | ||
| + | |||
| + | $$ | ||
| + | **Integrated effect information for a partition.** The paper then evaluates its QID expression **at that chosen eigenstate** (not by maximizing over (i) again): | ||
| + | $$ | ||
| + | |||
| + | \phi_e(m, | ||
| + | : | ||
| + | = | ||
| + | |||
| + | p_{i^*}\left(\log p_{i^*}-\sum_j P_{i^*j}\log q_j\right). | ||
| + | |||
| + | $$ | ||
| + | Equivalently, | ||
| + | $$ | ||
| + | |||
| + | \bar q_{i^*}: | ||
| + | |||
| + | $$ | ||
| + | so that | ||
| + | $$ | ||
| + | |||
| + | \phi_e(m, | ||
| + | |||
| + | $$ | ||
| ==== 3.6 Optimization: | ==== 3.6 Optimization: | ||
| - | For each purview | + | For fixed $M,\rho_M$ and candidate target |
| $$ | $$ | ||
| - | \\theta^{\\star}(m,Z) | + | \theta'(m,Z) |
| - | \\;:=\\; | + | = |
| - | \\arg\\min_{\\theta}\\;\\varphi(m, | + | \arg\min_{\theta\in\Theta(M, |
| + | \frac{\phi_e(m,Z,\theta)}{\max_{T' | ||
| $$ | $$ | ||
| + | Here $\max*{T' | ||
| - | Then they define | + | Then the (reported) |
| $$ | $$ | ||
| - | \\varphi_e(m) | + | \phi_e(m,Z):=\phi_e(m,Z,\theta'(m,Z)). |
| - | \\;:=\\; | + | |
| - | \\max_{Z\\subseteq Q}\\; | + | |
| $$ | $$ | ||
| - | (They do an analogous construction for causes and combine them in the full IIT-style definition.) | + | Finally, the best target subsystem |
| + | $$ | ||
| + | Z^**e(m)=\arg\max*{Z\subseteq Q}\phi_e(m, | ||
| + | \qquad | ||
| + | \phi_e(m)=\max_{Z\subseteq Q}\phi_e(m, | ||
| + | $$ | ||
| - | ---- | ||
| - | ==== 3.7 Plain-language translation (no IIT terms) | + | ==== 3.7 Mathematics Framework |
| - | In standard | + | We assume a finite-dimensional composite system with a chosen tensor decomposition |
| + | $$ | ||
| + | \mathcal{H}_Q \cong \bigotimes_{i=1}^n \mathcal{H}_i, | ||
| + | $$ | ||
| + | and we pick subsets of indices to define subsystems: | ||
| + | $$ | ||
| + | M \subseteq \{1, | ||
| + | $$ | ||
| + | Write the complements as $M^0 := Q\setminus M$ and $Z^0 := Q\setminus Z$, so that | ||
| + | $$ | ||
| + | \mathcal{H}_Q \cong \mathcal{H}_M \otimes \mathcal{H}_{M^0} | ||
| + | \cong \mathcal{H}_Z \otimes \mathcal{H}_{Z^0}. | ||
| + | $$ | ||
| + | |||
| + | Let the global update be a quantum channel (CPTP map) | ||
| + | $$ | ||
| + | T: | ||
| + | $$ | ||
| + | often $T(\rho)=U\rho U^\dagger$ for a unitary $U$. | ||
| + | |||
| + | ----- | ||
| + | |||
| + | === Step 1: Build an embedding of a local state into the global system === | ||
| + | |||
| + | The input for the mechanism is its reduced density matrix $\rho_M\in\mathcal{D}(\mathcal{H}_M)$. | ||
| + | |||
| + | Define the maximally mixed state on the complement | ||
| + | $$ | ||
| + | \rho^{\mathrm{mm}}_{M^0} := \frac{I_{M^0}}{d_{M^0}}, | ||
| + | \qquad d_{M^0}: | ||
| + | $$ | ||
| + | |||
| + | Define the embedding map | ||
| + | $$ | ||
| + | \iota_M(\rho_M) := \rho_M \otimes \rho^{\mathrm{mm}}_{M^0} \in \mathcal{D}(\mathcal{H}_Q). | ||
| + | $$ | ||
| + | |||
| + | This is the first non-physical convention in the pipeline: it replaces whatever the actual state of $M^0$ is by $\rho^{\mathrm{mm}}_{M^0}$. | ||
| + | |||
| + | ----- | ||
| + | |||
| + | === Step 2: Evolve globally and reduce to the purview === | ||
| + | |||
| + | Evolve the embedded state: | ||
| + | $$ | ||
| + | \rho' | ||
| + | $$ | ||
| + | |||
| + | Reduce to the purview: | ||
| + | $$ | ||
| + | \rho' | ||
| + | $$ | ||
| + | |||
| + | This defines the induced channel from $M$ to $Z$: | ||
| + | $$ | ||
| + | \mathcal{E}_{M\to Z}(X) | ||
| + | := | ||
| + | \operatorname{tr}_{Z^0}\!\left(T\left(X\otimes \rho^{\mathrm{mm}}_{M^0}\right)\right), | ||
| + | \qquad | ||
| + | \rho' | ||
| + | $$ | ||
| + | |||
| + | So far, everything is standard QIT manipulation (tensoring, channel application, | ||
| + | |||
| + | ----- | ||
| + | |||
| + | === Step 3: Define a partition $\theta$ and the corresponding cut construction === | ||
| + | |||
| + | A partition $\theta$ | ||
| + | $$ | ||
| + | \theta=\{(M^{(i)}\to Z^{(i)})\}_{i=1}^k, | ||
| + | $$ | ||
| + | with disjoint unions | ||
| + | $$ | ||
| + | M=\bigsqcup_{i=1}^k M^{(i)}, | ||
| + | Z=\bigsqcup_{i=1}^k Z^{(i)}. | ||
| + | $$ | ||
| + | |||
| + | For each part $i$, define an embedding that treats everything outside $M^{(i)}$ as maximally mixed: | ||
| + | $$ | ||
| + | \iota_{M^{(i)}}(X) := X \otimes \rho^{\mathrm{mm}}_{(M^{(i)})^0}, | ||
| + | \qquad (M^{(i)})^0 := Q\setminus M^{(i)}. | ||
| + | $$ | ||
| + | |||
| + | Define a local induced channel for that part: | ||
| + | $$ | ||
| + | \mathcal{E}^{(i)}_{M^{(i)}\to Z^{(i)}}(X) | ||
| + | := | ||
| + | \operatorname{tr}_{(Z^{(i)})^0}\!\left( | ||
| + | T\left(\iota_{M^{(i)}}(X)\right) | ||
| + | \right), | ||
| + | \qquad (Z^{(i)})^0 := Q\setminus Z^{(i)}. | ||
| + | $$ | ||
| + | |||
| + | Then define the cut (partitioned) output on the whole purview as the tensor product: | ||
| + | $$ | ||
| + | \mathcal{E}^{(\theta)}_{M\to Z}(\rho_M) | ||
| + | := | ||
| + | \bigotimes_{i=1}^k | ||
| + | \mathcal{E}^{(i)}_{M^{(i)}\to Z^{(i)}}(\rho_{M^{(i)}}), | ||
| + | $$ | ||
| + | and correspondingly | ||
| + | $$ | ||
| + | \rho^{(\theta)}_Z := \mathcal{E}^{(\theta)}_{M\to Z}(\rho_M). | ||
| + | $$ | ||
| + | |||
| + | Interpretation: | ||
| + | |||
| + | (For multi-node purviews, the paper may add an entanglement-clustering step before taking the product; the above is the clean “channel surgery” skeleton.) | ||
| + | |||
| + | ----- | ||
| + | |||
| + | === Step 4: Define the distance between full and cut outputs using QID === | ||
| + | |||
| + | Let | ||
| + | $$ | ||
| + | \rho_Z := \mathcal{E}_{M\to Z}(\rho_M), | ||
| + | \qquad | ||
| + | \rho^{(\theta)}_Z := \mathcal{E}^{(\theta)}_{M\to Z}(\rho_M). | ||
| + | $$ | ||
| + | |||
| + | Their divergence is QID. A practical way to present it is: | ||
| + | diagonalize both states | ||
| + | $$ | ||
| + | \rho_Z=\sum_i p_i |i\rangle\langle i|, | ||
| + | \qquad | ||
| + | \rho^{(\theta)}_Z=\sum_j q_j |j\rangle\langle j|, | ||
| + | $$ | ||
| + | and define overlaps | ||
| + | $$ | ||
| + | P_{ij}: | ||
| + | $$ | ||
| + | |||
| + | Then | ||
| + | $$ | ||
| + | \mathrm{QID}(\rho_Z\Vert \rho^{(\theta)}_Z) | ||
| + | = | ||
| + | \max_i\, p_i\left(\log p_i - \sum_j P_{ij}\log q_j\right). | ||
| + | $$ | ||
| + | |||
| + | They also define an associated maximizing eigenstate (or eigenspace if degenerate) by selecting the index that achieves the maximum: | ||
| + | $$ | ||
| + | i^\star \in \arg\max_i\, | ||
| + | \qquad | ||
| + | z_e^0 := |i^\star\rangle. | ||
| + | $$ | ||
| + | |||
| + | The partition score is then | ||
| + | $$ | ||
| + | \varphi(m, | ||
| + | := | ||
| + | \mathrm{QID}(\rho_Z\Vert \rho^{(\theta)}_Z). | ||
| + | $$ | ||
| + | |||
| + | ----- | ||
| + | |||
| + | === Step 5: Optimize over partitions and purviews === | ||
| + | |||
| + | For a fixed purview $Z$, define the minimizing partition (minimum-information partition): | ||
| + | $$ | ||
| + | \theta^\star(m, | ||
| + | $$ | ||
| + | and the corresponding value | ||
| + | $$ | ||
| + | \varphi(m, | ||
| + | $$ | ||
| + | |||
| + | Then select the purview that maximizes this value: | ||
| + | $$ | ||
| + | Z^\star(m) \in \arg\max_{Z\subseteq Q} \varphi(m, | ||
| + | \qquad | ||
| + | \varphi_e(m) | ||
| + | $$ | ||
| - | | + | This completes the effect-side pipeline: full induced channel |
| - | - Derive an induced channel | + | |
| - | * embedding the input $\\rho_M$ into $Q$, | + | |
| - | * fixing the complement $M^0$ to $I/d$, | + | |
| - | * evolving with $T$, | + | |
| - | * tracing down to $Z$. | + | |
| - | - For every partition $\\theta$ of inputs/ | + | |
| - | - Score how different the output of the full channel | + | |
| - | - Define “integration” as the **least** difference | + | |
| - | This is not a theorem of quantum mechanics; it is a specific *counterfactual channel surgery* definition used to quantify “how non-factorizable | + | The key point is that the construction depends |
| ===== 4. Where this is “just QM” vs where it stops being physics ===== | ===== 4. Where this is “just QM” vs where it stops being physics ===== | ||
| Line 387: | Line 556: | ||
| ==== Steelman ==== | ==== Steelman ==== | ||
| - | As a *defined* counterfactual causal attribution scheme, the framework: | + | As a defined counterfactual causal attribution scheme, the framework: |
| * avoids common-cause “spurious correlation” effects by construction (their COPY-XOR/ | * avoids common-cause “spurious correlation” effects by construction (their COPY-XOR/ | ||
| * tries to preserve entanglement-generated correlations by clustering entanglement before taking products, | * tries to preserve entanglement-generated correlations by clustering entanglement before taking products, | ||