6.3.2 POVM#

Prompts

  • What is a POVM, and how does it generalize projective measurement? Why can a POVM have more outcomes than the Hilbert space dimension?

  • Explain unambiguous state discrimination: how can a three-outcome POVM distinguish non-orthogonal states with certainty (when it gives a definite answer)?

  • State Naimark’s theorem. What does it mean physically that every POVM can be realized as a projective measurement on a larger system?

  • Give an example of a POVM that is not a projective measurement and explain what advantage it provides.

Lecture Notes#

Overview#

A key limitation of projective measurements is that they cannot reliably distinguish non-orthogonal quantum states. If we relax the requirement that measurement operators be projectors and allow general positive operators—as long as they sum to the identity—we gain flexibility to improve state discrimination. This generalization defines positive operator-valued measures (POVMs).

Why We Need POVMs: Beyond Projectors#

Projective measurements require outcomes to correspond to orthogonal eigenspaces. In quantum information and state discrimination tasks, this orthogonality constraint is often too restrictive. POVMs relax this requirement: measurement operators need only be positive semidefinite and sum to identity, enabling measurements that gain information from non-orthogonal states.

POVM Definition#

A positive operator-valued measure (POVM) is a set of operators \(\{\hat{M}_1, \hat{M}_2, \ldots, \hat{M}_n\}\) on Hilbert space \(\mathcal{H}\) satisfying two conditions:

1. Positive semidefiniteness:

(238)#\[ \hat{M}_i \geq 0 \quad \text{for all } i \quad (\text{all eigenvalues} \geq 0) \]

2. Completeness:

(239)#\[ \sum_i \hat{M}_i = I \]

These conditions ensure that measurement probabilities are non-negative and sum to one.

Measurement with a POVM#

If a system in state \(\hat{\rho}\) is measured using POVM \(\{\hat{M}_i\}\):

Outcome probability:

(240)#\[ p_i = \operatorname{Tr}(\hat{M}_i \hat{\rho}) \]

Post-measurement state:

(241)#\[ \hat{\rho}'_i = \frac{\sqrt{\hat{M}_i} \hat{\rho} \sqrt{\hat{M}_i}}{\operatorname{Tr}(\hat{M}_i \hat{\rho})} \]

where \(\sqrt{\hat{M}_i}\) is the unique positive square root (defined via spectral decomposition: \(\hat{M}_i = \sum_j \lambda_{i,j} \vert v_{i,j}\rangle\langle v_{i,j}\vert \Rightarrow \sqrt{\hat{M}_i} = \sum_j \sqrt{\lambda_{i,j}} \vert v_{i,j}\rangle\langle v_{i,j}\vert\)).

Normalization check: \(\sum_i p_i = \sum_i \operatorname{Tr}(\hat{M}_i \hat{\rho}) = \operatorname{Tr}(I \hat{\rho}) = 1\)

POVM vs. Projective Measurement#

A projective measurement is a special case of a POVM where elements are orthogonal projectors \(\hat{M}_i = \hat{P}_i\) with:

  • \(\hat{P}_i^2 = \hat{P}_i\) (idempotent)

  • \(\hat{P}_i \hat{P}_j = \delta_{ij} \hat{P}_i\) (orthogonal)

  • \(\sum_i \hat{P}_i = I\) (complete)

Key differences:

Property

Projective Measurement

General POVM

Outcomes

Limited to \(d\) (Hilbert space dimension)

Can have \(n > d\) (overcomplete)

Orthogonality

\(\hat{M}_i \hat{M}_j = 0\) for \(i \neq j\)

Elements can overlap: \(\hat{M}_i \hat{M}_j \neq 0\)

Information

Binary outcome per qubit

More outcomes = more information per measurement

Rank

Rank-1 projectors

Can be rank-1 or higher

Physical meaning of outcome

Eigenvalue of measured observable

Index number (no direct physical meaning)

Information trade-off: General POVMs extract more information about the system by allowing non-orthogonal measurement elements, at the cost of less definite outcome meaning.

Unambiguous State Discrimination#

One of the most striking applications of POVMs: distinguishing non-orthogonal quantum states with certainty when a definite answer is given.

Problem: Given a qubit in one of two non-orthogonal states \(\vert \psi_1\rangle\) or \(\vert \psi_2\rangle\) with overlap \(\langle\psi_1\vert \psi_2\rangle = c\) (where \(0 < \vert c\vert < 1\)), can you determine which state with certainty (never make an error)?

Projective measurement: No. Non-orthogonal states have overlapping eigenspaces, so any projective measurement will sometimes give wrong answers.

POVM solution: Yes, with a trade-off. Use a three-outcome POVM:

  • Outcome 1: “State is \(\vert \psi_1\rangle\)” (definite, no error)

  • Outcome 2: “State is \(\vert \psi_2\rangle\)” (definite, no error)

  • Outcome 3: “Inconclusive” (try again or use different strategy)

Naimark’s Theorem: POVM as Extended Projective Measurement#

Every POVM on a Hilbert space can be realized as a projective measurement on an extended Hilbert space (original space plus auxiliary system).

Naimark’s Theorem

Every POVM \(\{\hat{M}_i\}\) on \(\mathcal{H}\) can be realized as a projective measurement on an extended space \(\mathcal{H} \otimes \mathcal{H}_\text{aux}\). Specifically, there exist:

  • An auxiliary (ancilla) Hilbert space \(\mathcal{H}_\text{aux}\)

  • Orthogonal projectors \(\{\hat{P}_i\}\) on \(\mathcal{H} \otimes \mathcal{H}_\text{aux}\)

such that:

(242)#\[ \hat{M}_i = \operatorname{Tr}_\text{aux}(\hat{P}_i (\hat{\rho} \otimes \vert 0\rangle\langle 0\vert_\text{aux})) \]

Physical picture: Entangle the system with an ancilla, perform a projective measurement on the combined system, and trace out (discard) the ancilla measurement outcome.

This theorem reveals a profound truth: POVMs are not fundamentally more general than projective measurements—they are equivalent to projective measurements on a larger system. The apparent “softness” of a POVM comes from not observing (or not having access to) the ancilla measurement outcome.

Examples#

Summary#

  • POVM definition: Positive operator-valued measures \(\{E_m\}\) satisfy \(E_m \geq 0\) and \(\sum_m E_m = I\), but are not required to be orthogonal or satisfy \(E_m^2 = E_m\) (unlike projectors).

  • Measurement probabilities: \(p_m = \operatorname{Tr}(E_m \hat{\rho})\) (same formula as projective), but post-measurement state is not determined by POVM alone—different physical realizations can give same outcome statistics.

  • Advantages over projectors: More general; one outcome can have multiple post-measurement states (depends on probe details); allows finer-grained outcome statistics with fewer outcomes than dimension of Hilbert space.

  • Unambiguous state discrimination: POVM enables non-projective measurements that sometimes identify unknown quantum states with zero ambiguity (no error-free projective equivalent exists).

See Also

Homework#

1. Projective measurement and POVM. Show that any projective measurement \(\{P_i\}\) (with \(P_i^2 = P_i\), \(P_i P_j = \delta_{ij}P_i\), \(\sum_i P_i = \hat{I}\)) is a special case of a POVM by setting \(E_i = P_i\). Verify the POVM conditions.

2. Trine POVM on qubits. Consider a trine POVM on a qubit: \(E_k = \frac{2}{3}\vert \phi_k\rangle\langle\phi_k\vert\) for \(k=0,1,2\), where \(\vert \phi_0\rangle = \vert 0\rangle\), \(\vert \phi_1\rangle = -\frac{1}{2}\vert 0\rangle + \frac{\sqrt{3}}{2}\vert 1\rangle\), \(\vert \phi_2\rangle = -\frac{1}{2}\vert 0\rangle - \frac{\sqrt{3}}{2}\vert 1\rangle\). Verify that \(\sum_{k=0}^2 E_k = \hat{I}\).

3. Unambiguous state discrimination. (Unambiguous state discrimination) Consider two non-orthogonal states \(\vert \psi_1\rangle\) and \(\vert \psi_2\rangle\) with \(\langle\psi_1\vert \psi_2\rangle = s \neq 0\). A POVM for unambiguous discrimination has three outcomes: “\(\psi_1\)”, “\(\psi_2\)”, and “?” (inconclusive). Explain why this task is impossible with a projective measurement if \(s \neq 0\), but possible with a POVM. What condition must the POVM elements satisfy to make discrimination unambiguous when a definite answer is given?

4. POVM state discrimination. Construct an explicit POVM for unambiguous discrimination of \(\vert 0\rangle\) and \(\vert +\rangle = \frac{1}{\sqrt{2}}(\vert 0\rangle + \vert 1\rangle)\). Find POVM elements \(E_1\) (identifies \(\vert 0\rangle\)), \(E_2\) (identifies \(\vert +\rangle\)), and \(E_?\) (inconclusive) that satisfy all POVM conditions. What is the probability of an inconclusive result for each state?

5. Naimark dilation. (Naimark dilation) Any POVM \(\{E_i\}\) on \(\mathcal{H}\) can be realized as a projective measurement on a larger Hilbert space \(\mathcal{H} \otimes \mathcal{K}\): introduce an ancilla in state \(\vert 0\rangle_\mathcal{K}\), perform a unitary \(U\) on \(\mathcal{H} \otimes \mathcal{K}\), then measure the ancilla. Show that if \(U(\vert \psi\rangle \otimes \vert 0\rangle) = \sum_i (\sqrt{E_i}\vert \psi\rangle) \otimes \vert i\rangle_\mathcal{K}\), then the probability of outcome \(i\) is \(\mathrm{Tr}[E_i \vert \psi\rangle\langle\psi\vert]\).

6. Informationally complete POVM. An informationally complete POVM (IC-POVM) has enough elements to uniquely determine any density matrix \(\hat{\rho}\) from the measurement statistics \(\{p_i = \mathrm{Tr}[E_i \hat{\rho}]\}\). For a \(d\)-dimensional system, what is the minimum number of POVM elements needed for an IC-POVM? Justify your answer by counting the degrees of freedom in \(\hat{\rho}\).

7. POVM outcome probabilities. For a POVM with elements \(\{E_i\}\), the probability of outcome \(i\) on mixed state \(\hat{\rho}\) is \(p_i = \mathrm{Tr}[E_i \hat{\rho}]\). Show that if \(\hat{\rho} = \sum_k q_k \hat{\rho}_k\) is a convex mixture, then \(p_i = \sum_k q_k \mathrm{Tr}[E_i \hat{\rho}_k]\). This shows POVM statistics are linear in the state.

8. POVM vs projective measurement. Compare the information gain of a POVM vs. a projective measurement. For a qubit initially in an unknown state on the Bloch sphere: (a) A projective measurement gives binary outcomes. How much classical information (in bits) does it convey? (b) A 4-element symmetric IC-POVM (SIC-POVM) with \(E_k = \frac{1}{2}\vert \phi_k\rangle\langle\phi_k\vert\) for 4 tetrahedral states. Explain qualitatively why this conveys more information about \(\hat{\rho}\) per measurement than a projective measurement.