6.3.2 POVM#
Prompts
What is a POVM, and how does it generalize projective measurement? Why can a POVM have more outcomes than the Hilbert space dimension?
Explain unambiguous state discrimination: how can a three-outcome POVM distinguish non-orthogonal states with certainty (when it gives a definite answer)?
State Naimark’s theorem. What does it mean physically that every POVM can be realized as a projective measurement on a larger system?
Give an example of a POVM that is not a projective measurement and explain what advantage it provides.
Lecture Notes#
Overview#
A key limitation of projective measurements is that they cannot reliably distinguish non-orthogonal quantum states. If we relax the requirement that measurement operators be projectors and allow general positive operators—as long as they sum to the identity—we gain flexibility to improve state discrimination. This generalization defines positive operator-valued measures (POVMs).
Why We Need POVMs: Beyond Projectors#
Projective measurements require outcomes to correspond to orthogonal eigenspaces. In quantum information and state discrimination tasks, this orthogonality constraint is often too restrictive. POVMs relax this requirement: measurement operators need only be positive semidefinite and sum to identity, enabling measurements that gain information from non-orthogonal states.
POVM Definition#
A positive operator-valued measure (POVM) is a set of operators \(\{\hat{M}_1, \hat{M}_2, \ldots, \hat{M}_n\}\) on Hilbert space \(\mathcal{H}\) satisfying two conditions:
1. Positive semidefiniteness:
2. Completeness:
These conditions ensure that measurement probabilities are non-negative and sum to one.
Measurement with a POVM#
If a system in state \(\hat{\rho}\) is measured using POVM \(\{\hat{M}_i\}\):
Outcome probability:
Post-measurement state:
where \(\sqrt{\hat{M}_i}\) is the unique positive square root (defined via spectral decomposition: \(\hat{M}_i = \sum_j \lambda_{i,j} \vert v_{i,j}\rangle\langle v_{i,j}\vert \Rightarrow \sqrt{\hat{M}_i} = \sum_j \sqrt{\lambda_{i,j}} \vert v_{i,j}\rangle\langle v_{i,j}\vert\)).
Normalization check: \(\sum_i p_i = \sum_i \operatorname{Tr}(\hat{M}_i \hat{\rho}) = \operatorname{Tr}(I \hat{\rho}) = 1\) ✓
Discussion: more outcomes than dimensions
A POVM can have more outcomes than the Hilbert space dimension (\(n > \dim(\mathcal{H})\)). Why is this impossible for projective measurements? What does this overcomplete structure enable?
Poll: POVM vs. projective measurement
A POVM is a set of operators \(\{\hat{E}_i\}\) satisfying \(\sum_i \hat{E}_i = \hat{I}\) and \(\hat{E}_i \geq 0\), but the \(\hat{E}_i\) need not be projectors. What is a key advantage of POVMs over projective measurements?
(A) POVMs are simpler to implement experimentally.
(B) POVMs can distinguish between non-orthogonal states, whereas projective measurements cannot.
(C) POVMs require fewer measurement outcomes than projective measurements.
(D) POVMs provide perfect information about the state, whereas projective measurements do not.
POVM vs. Projective Measurement#
A projective measurement is a special case of a POVM where elements are orthogonal projectors \(\hat{M}_i = \hat{P}_i\) with:
\(\hat{P}_i^2 = \hat{P}_i\) (idempotent)
\(\hat{P}_i \hat{P}_j = \delta_{ij} \hat{P}_i\) (orthogonal)
\(\sum_i \hat{P}_i = I\) (complete)
Key differences:
Property |
Projective Measurement |
General POVM |
|---|---|---|
Outcomes |
Limited to \(d\) (Hilbert space dimension) |
Can have \(n > d\) (overcomplete) |
Orthogonality |
\(\hat{M}_i \hat{M}_j = 0\) for \(i \neq j\) |
Elements can overlap: \(\hat{M}_i \hat{M}_j \neq 0\) |
Information |
Binary outcome per qubit |
More outcomes = more information per measurement |
Rank |
Rank-1 projectors |
Can be rank-1 or higher |
Physical meaning of outcome |
Eigenvalue of measured observable |
Index number (no direct physical meaning) |
Information trade-off: General POVMs extract more information about the system by allowing non-orthogonal measurement elements, at the cost of less definite outcome meaning.
Unambiguous State Discrimination#
One of the most striking applications of POVMs: distinguishing non-orthogonal quantum states with certainty when a definite answer is given.
Problem: Given a qubit in one of two non-orthogonal states \(\vert \psi_1\rangle\) or \(\vert \psi_2\rangle\) with overlap \(\langle\psi_1\vert \psi_2\rangle = c\) (where \(0 < \vert c\vert < 1\)), can you determine which state with certainty (never make an error)?
Projective measurement: No. Non-orthogonal states have overlapping eigenspaces, so any projective measurement will sometimes give wrong answers.
POVM solution: Yes, with a trade-off. Use a three-outcome POVM:
Outcome 1: “State is \(\vert \psi_1\rangle\)” (definite, no error)
Outcome 2: “State is \(\vert \psi_2\rangle\)” (definite, no error)
Outcome 3: “Inconclusive” (try again or use different strategy)
Construction: Unambiguous State Discrimination POVM
For two states \(\vert \psi_1\rangle, \vert \psi_2\rangle\) with overlap \(c = \langle\psi_1\vert \psi_2\rangle\) where \(0 < \vert c\vert < 1\):
Define POVM elements:
Key property: If the state is \(\vert \psi_1\rangle\):
\(\langle\psi_1\vert \hat{M}_2\vert \psi_1\rangle = 0\) (outcome 2 never occurs)
Outcomes 1 and 3 occur with probabilities \(1-\vert c|^2\) and \(\vert c|^2\) respectively
Similarly, if state is \(\vert \psi_2\rangle\): outcome 1 never occurs. This guarantees no false identifications.
Naimark’s Theorem: POVM as Extended Projective Measurement#
Every POVM on a Hilbert space can be realized as a projective measurement on an extended Hilbert space (original space plus auxiliary system).
Naimark’s Theorem
Every POVM \(\{\hat{M}_i\}\) on \(\mathcal{H}\) can be realized as a projective measurement on an extended space \(\mathcal{H} \otimes \mathcal{H}_\text{aux}\). Specifically, there exist:
An auxiliary (ancilla) Hilbert space \(\mathcal{H}_\text{aux}\)
Orthogonal projectors \(\{\hat{P}_i\}\) on \(\mathcal{H} \otimes \mathcal{H}_\text{aux}\)
such that:
Physical picture: Entangle the system with an ancilla, perform a projective measurement on the combined system, and trace out (discard) the ancilla measurement outcome.
This theorem reveals a profound truth: POVMs are not fundamentally more general than projective measurements—they are equivalent to projective measurements on a larger system. The apparent “softness” of a POVM comes from not observing (or not having access to) the ancilla measurement outcome.
Examples#
Example: Six-State Octahedral IC-POVM on a Qubit
One convenient informationally complete POVM for a qubit uses the six eigenstates of \(\hat{\sigma}^{x}\), \(\hat{\sigma}^{y}\), and \(\hat{\sigma}^{z}\). These states sit at the vertices of a regular octahedron on the Bloch sphere. Define
This is a valid POVM because each Pauli axis contributes a pair of opposite projectors summing to \(\hat{I}\):
The six probabilities are informationally complete: they contain enough data to reconstruct the Bloch vector of any qubit state. This octahedral POVM is therefore useful for tomography from three mutually unbiased bases.
A different standard construction is the qubit SIC-POVM, which uses four tetrahedral states (the minimal number \(d^{2}=4\) for informational completeness in dimension \(d=2\)). Both are IC-POVMs; the octahedral version is overcomplete, while the SIC-POVM is minimal and symmetric.
Example: Heterodyne Detection (Optical POVM)
In quantum optics, measuring quadrature amplitudes (amplitude and phase of a light field) is described by a continuous POVM:
where \(\vert \alpha\rangle\) are coherent states indexed by complex amplitude \(\alpha \in \mathbb{C}\).
Properties:
Continuous outcome space (not discrete)
Coherent states are highly non-orthogonal: \(\langle\alpha\vert \beta\rangle = \mathrm{e}^{-|\alpha-\beta|^2/2} \neq 0\) for \(\alpha \neq \beta\)
Enables full quantum tomography of optical field states
Summary#
POVM definition: Positive operator-valued measures \(\{E_m\}\) satisfy \(E_m \geq 0\) and \(\sum_m E_m = I\), but are not required to be orthogonal or satisfy \(E_m^2 = E_m\) (unlike projectors).
Measurement probabilities: \(p_m = \operatorname{Tr}(E_m \hat{\rho})\) (same formula as projective), but post-measurement state is not determined by POVM alone—different physical realizations can give same outcome statistics.
Advantages over projectors: More general; one outcome can have multiple post-measurement states (depends on probe details); allows finer-grained outcome statistics with fewer outcomes than dimension of Hilbert space.
Unambiguous state discrimination: POVM enables non-projective measurements that sometimes identify unknown quantum states with zero ambiguity (no error-free projective equivalent exists).
See Also
6.3.1 Projective Measurement: Projection-valued measures as the special case of POVMs.
6.3.3 Quantum Channels: CPTP maps and instruments that realize POVM outcomes with auxiliary systems.
1.2.3 Measurement Operators: Effects, Kraus rank-one updates, and generalized outcomes.
Homework#
1. Projective measurement and POVM. Show that any projective measurement \(\{P_i\}\) (with \(P_i^2 = P_i\), \(P_i P_j = \delta_{ij}P_i\), \(\sum_i P_i = \hat{I}\)) is a special case of a POVM by setting \(E_i = P_i\). Verify the POVM conditions.
2. Trine POVM on qubits. Consider a trine POVM on a qubit: \(E_k = \frac{2}{3}\vert \phi_k\rangle\langle\phi_k\vert\) for \(k=0,1,2\), where \(\vert \phi_0\rangle = \vert 0\rangle\), \(\vert \phi_1\rangle = -\frac{1}{2}\vert 0\rangle + \frac{\sqrt{3}}{2}\vert 1\rangle\), \(\vert \phi_2\rangle = -\frac{1}{2}\vert 0\rangle - \frac{\sqrt{3}}{2}\vert 1\rangle\). Verify that \(\sum_{k=0}^2 E_k = \hat{I}\).
3. Unambiguous state discrimination. (Unambiguous state discrimination) Consider two non-orthogonal states \(\vert \psi_1\rangle\) and \(\vert \psi_2\rangle\) with \(\langle\psi_1\vert \psi_2\rangle = s \neq 0\). A POVM for unambiguous discrimination has three outcomes: “\(\psi_1\)”, “\(\psi_2\)”, and “?” (inconclusive). Explain why this task is impossible with a projective measurement if \(s \neq 0\), but possible with a POVM. What condition must the POVM elements satisfy to make discrimination unambiguous when a definite answer is given?
4. POVM state discrimination. Construct an explicit POVM for unambiguous discrimination of \(\vert 0\rangle\) and \(\vert +\rangle = \frac{1}{\sqrt{2}}(\vert 0\rangle + \vert 1\rangle)\). Find POVM elements \(E_1\) (identifies \(\vert 0\rangle\)), \(E_2\) (identifies \(\vert +\rangle\)), and \(E_?\) (inconclusive) that satisfy all POVM conditions. What is the probability of an inconclusive result for each state?
5. Naimark dilation. (Naimark dilation) Any POVM \(\{E_i\}\) on \(\mathcal{H}\) can be realized as a projective measurement on a larger Hilbert space \(\mathcal{H} \otimes \mathcal{K}\): introduce an ancilla in state \(\vert 0\rangle_\mathcal{K}\), perform a unitary \(U\) on \(\mathcal{H} \otimes \mathcal{K}\), then measure the ancilla. Show that if \(U(\vert \psi\rangle \otimes \vert 0\rangle) = \sum_i (\sqrt{E_i}\vert \psi\rangle) \otimes \vert i\rangle_\mathcal{K}\), then the probability of outcome \(i\) is \(\mathrm{Tr}[E_i \vert \psi\rangle\langle\psi\vert]\).
6. Informationally complete POVM. An informationally complete POVM (IC-POVM) has enough elements to uniquely determine any density matrix \(\hat{\rho}\) from the measurement statistics \(\{p_i = \mathrm{Tr}[E_i \hat{\rho}]\}\). For a \(d\)-dimensional system, what is the minimum number of POVM elements needed for an IC-POVM? Justify your answer by counting the degrees of freedom in \(\hat{\rho}\).
7. POVM outcome probabilities. For a POVM with elements \(\{E_i\}\), the probability of outcome \(i\) on mixed state \(\hat{\rho}\) is \(p_i = \mathrm{Tr}[E_i \hat{\rho}]\). Show that if \(\hat{\rho} = \sum_k q_k \hat{\rho}_k\) is a convex mixture, then \(p_i = \sum_k q_k \mathrm{Tr}[E_i \hat{\rho}_k]\). This shows POVM statistics are linear in the state.
8. POVM vs projective measurement. Compare the information gain of a POVM vs. a projective measurement. For a qubit initially in an unknown state on the Bloch sphere: (a) A projective measurement gives binary outcomes. How much classical information (in bits) does it convey? (b) A 4-element symmetric IC-POVM (SIC-POVM) with \(E_k = \frac{1}{2}\vert \phi_k\rangle\langle\phi_k\vert\) for 4 tetrahedral states. Explain qualitatively why this conveys more information about \(\hat{\rho}\) per measurement than a projective measurement.