5.1.3 Degenerate Perturbation Theory#
Prompts
Why does non-degenerate perturbation theory fail when multiple unperturbed states share the same energy, and what is the minimal fix?
In a degenerate subspace, why must we diagonalize the perturbation first, and how does this define the correct zeroth-order basis?
What is the physical difference between first-order splitting inside the degenerate manifold and second-order shifts from virtual coupling to outside states?
For a concrete model, how can you diagnose whether degeneracy is fully lifted, partially lifted, or unchanged at first order, and what should be done next in each case?
Lecture Notes#
Overview#
§5.1.2 assumed isolated levels and produced scalar corrections. This subsection handles the complementary case: when unperturbed levels are degenerate, perturbation theory must be organized as a block problem inside each degenerate manifold.
The practical workflow is: identify the degenerate manifold, diagonalize in-manifold matrix elements at first order, then add virtual-coupling corrections from outside manifolds at second order.
Why non-degenerate formulas fail#
For \(\hat{H}(\lambda)=\hat{H}_0+\lambda\hat{V}\), suppose \(\hat{H}_0\) has a \(d_n\)-fold degenerate eigenspace at energy \(E_n\), spanned by \(\vert n,\alpha\rangle\) (\(\alpha=1,\dots,d_n\)):
A quick wrong attempt is to reuse the non-degenerate structure directly:
Then terms with \(m=n\) but \(\beta\neq\alpha\) give
so the denominator diverges.
Core Idea
The proper strategy is divide and conquer:
reorganize the Hilbert space into degenerate-manifold blocks,
construct the effective Hamiltonian for each block,
diagonalize each block,
if residual degeneracy remains, continue to the next order/block level.
The central goal of degenerate perturbation theory is to construct the effective Hamiltonian within each degenerate subspace.
Problem Setup#
Degenerate Perturbation Problem
Consider \(\hat H(\lambda)=\hat H_0+\lambda\hat V\), and work in a basis \(\vert n,\alpha\rangle\) where \(n\) labels the manifold and \(\alpha=1,\dots,d_n\) labels orthonormal states inside that manifold:
Here \(V_{m\alpha,n\beta}=\langle m,\alpha\vert\hat V\vert n,\beta\rangle\), and states inside one degenerate manifold can mix.
The corresponding eigenvalue equation is \(\hat H(\lambda)\vert n,\beta(\lambda)\rangle=\sum_\alpha E_{n,\alpha\beta}(\lambda)\vert n,\alpha(\lambda)\rangle\), and our objective is to construct \(E_{n,\alpha\beta}(\lambda)\) and \(\vert n,\alpha(\lambda)\rangle\) order by order in \(\lambda\).
Hellmann-Feynman Identities (Degenerate Form)#
Differentiate the matrix eigenproblem and project onto \(\langle m,\gamma\vert\) at \(\lambda=0\):
Hellmann-Feynman identities (degenerate)
1st Hellmann-Feynman Identity (energy matrix derivative, within manifold):
2nd Hellmann-Feynman Identity (state-mixing derivative, across manifolds):
Derivation: degenerate Hellmann-Feynman identities
Step 1: Differentiate the matrix eigenvalue equation
Start from
Differentiate with respect to \(\lambda\):
Step 2: Project with \(\langle m,\gamma\vert\) and evaluate at \(\lambda=0\)
Left-multiply by \(\langle m,\gamma\vert\):
Evaluate at \(\lambda=0\):
So
Step 3: Split into \(m=n\) and \(m\neq n\)
If \(m=n\),
If \(m\neq n\),
Conjugating both sides — using \(\langle\psi\vert\phi\rangle^{*}=\langle\phi\vert\psi\rangle\) on the left and \(V_{m\alpha,n\beta}^{*}=V_{n\beta,m\alpha}\) (Hermiticity of \(\hat V\)) on the right — and relabeling \(m\leftrightarrow n\), \(\alpha\leftrightarrow\beta\) gives the conjugate identity
These are the two degenerate Hellmann-Feynman identities (with the conjugate form) used below.
Energy Corrections#
Use the Taylor expansion of the in-manifold energy matrix:
Energy matrix expansion up to second order
Using degenerate Hellmann-Feynman identities, the energy matrix correction is given by:
Derivation: first-order energy matrix derivative
Set \(m=n\) in the projected derivative equation:
With \(\partial_\lambda\hat H=\hat V\), this gives
Derivation: second-order energy matrix derivative
Differentiate \(\partial_\lambda E_{n,\alpha\beta}=\langle n,\alpha\vert\hat V\vert n,\beta\rangle\) once more:
Insert a complete basis explicitly:
Now split into \(m\neq n\) and \(m=n\) pieces. For \(m\neq n\), use the second Hellmann-Feynman identity:
and its conjugate relation, giving
For \(m=n\), terms involve in-manifold overlaps \(A_{\gamma\beta}=\langle n,\gamma\vert\partial_\lambda n,\beta\rangle\). These are gauge/basis-choice dependent within the degenerate manifold and can be set to zero by parallel-transport gauge after fixing the first-order in-manifold basis.
Therefore
State Corrections#
The corrected basis vectors are also expanded in \(\lambda\).
State expansion and first-order mixing
Using the second Hellmann-Feynman identity, the state correction is given by:
Derivation: first-order state derivative
Expand in the unperturbed basis and separate the two parts:
For \(m\neq n\), apply
For \(m=n\), define in-manifold overlaps \(A_{\beta\alpha}=\langle n,\beta\vert\partial_\lambda n,\alpha\rangle\). From orthonormality, \(A\) is anti-Hermitian, and by basis/gauge choice inside the degenerate manifold (after first-order diagonalization) one may set \(A=0\).
Under that gauge,
Note: Operator Form
The same result can be written with projectors \(\hat{P}_n=\sum_\alpha \vert n,\alpha\rangle\langle n,\alpha\vert\) and \(\hat{Q}_n=\hat I-\hat P_n\):
This is exactly the same content as the component formulas above:
\(\hat P_n\hat V\hat P_n \leftrightarrow V_{n\alpha,n\beta}\) (first-order block),
second-order projector term \(\leftrightarrow \sum_{m\neq n,\gamma}V_{n\alpha,m\gamma}V_{m\gamma,n\beta}/(E_n-E_m)\).
Use component form for calculations; use operator form as a compact summary.
Example: Three-level bright/dark mechanism
Problem. Consider
with \(\vert\lambda_{1,2}\vert\ll\Delta\). The degenerate manifold is \(\{\vert 1\rangle,\vert 2\rangle\}\) at energy \(0\).
Step 1. First-order block check. Inside the manifold,
so first-order splitting is absent.
Step 2. Build second-order block element-by-element. Because the only outside state is \(\vert 3\rangle\) with energy \(\Delta\),
Now evaluate four entries explicitly:
Hence
with \(\vert\lambda\rangle=\lambda_1\vert 1\rangle+\lambda_2\vert 2\rangle\).
Step 3. Diagonalize and identify bright/dark states. This is a rank-1 projector form, so eigenvalues are immediate:
A normalized bright state is
and a normalized dark state orthogonal to \(\vert B\rangle\) is
Step 4. Exact zero-mode check (not only perturbative).
For \(\hat H_0\): since \(\hat H_0=\Delta\vert 3\rangle\langle 3\vert\) and \(\vert D\rangle\in\mathrm{span}\{\vert 1\rangle,\vert 2\rangle\}\) has no \(\vert 3\rangle\) component (so \(\langle 3\vert D\rangle=0\)),
For \(\hat V\), rewrite the perturbation in factored form,
and act on \(\vert D\rangle=(\lambda_2^*\vert 1\rangle-\lambda_1^*\vert 2\rangle)/\sqrt{\vert\lambda_1\vert^2+\vert\lambda_2\vert^2}\):
The first term vanishes because \(\langle 3\vert D\rangle=0\). Expanding the second term with \(\langle 1\vert 1\rangle=\langle 2\vert 2\rangle=1\) and \(\langle 1\vert 2\rangle=\langle 2\vert 1\rangle=0\),
Therefore \((\hat H_0+\hat V)\vert D\rangle=0\) exactly, regardless of \(\lambda_{1,2}\).
Interpretation. First-order splitting is absent, but second-order virtual coupling generates an effective interaction with one bright mode and one protected dark mode.
Example: Spin-1 splitting from a degenerate manifold
Problem. Consider a spin-1 system in basis \(\{\vert+1\rangle,\vert 0\rangle,\vert-1\rangle\}\) with
In units where \(\hbar=1\), the operator matrices are
so
Unperturbed manifolds: \(E_1=1\) with \(\{\vert+1\rangle,\vert-1\rangle\}\) (degenerate), and \(E_0=0\) with \(\vert 0\rangle\).
Step 1 (degenerate PT): construct effective blocks.
For the \(E_1\) manifold (\(\alpha,\beta\in\{+1,-1\}\)), the only state outside the manifold is \(\vert 0\rangle\), with \(E_1-E_0=1\). The relevant matrix elements of \(\hat V\), read off from the matrix above, are \(V_{+1,+1}=1\), \(V_{-1,-1}=-1\), \(V_{+1,-1}=V_{-1,+1}=0\), and \(V_{\pm 1,0}=V_{0,\pm 1}=1/\sqrt 2\). Substituting into the Energy matrix expansion up to second order formula entry by entry:
Collecting these four entries gives, to \(O(\lambda^2)\),
(on the first-round corrected basis). For the \(E_0\) manifold,
At this stage, the original degeneracy is already lifted at \(O(\lambda)\).
Step 2 (second round: non-degenerate PT inside the lifted block).
In \(\hat H^{\text{eff}}_{1}\), the off-diagonal element is \(\lambda^2/2\), while the diagonal splitting is
So the induced mixing scale is
Hence the induced second-round energy correction is
which is negligible through \(O(\lambda^2)\). The induced second-round state correction, however, is \(O(\lambda)\) — the same order as the first-round cross-manifold mixing — so it must be carried out.
Therefore, up to \(O(\lambda^2)\),
A representative first-order state structure follows by substituting the \(\hat V\) matrix elements above into the State expansion and first-order mixing formula. For \(\vert\pm1\rangle\) (in the \(E_1\) manifold), the only state outside the manifold is \(\vert 0\rangle\) with \(E_1-E_0=1\):
For \(\vert 0\rangle\) (the \(E_0\) singlet), the sum over \(m\neq 0\) runs over \(\beta\in\{+1,-1\}\) with \(E_0-E_1=-1\):
The second round mixes the two corrected basis vectors \(\vert+1\rangle'\) and \(\vert-1\rangle'\) via standard non-degenerate PT inside the lifted block. Reading the matrix elements of \(\hat H_1^{\text{eff}}\) off Step 1, \(V'_{\mp1,\pm1}=\lambda^2/2\) and \(E'_{\pm1}-E'_{\mp1}=\pm 2\lambda\), so
Substituting the first-round results \(\vert\pm1\rangle'=\vert\pm1\rangle+(\lambda/\sqrt2)\vert 0\rangle+O(\lambda^2)\) and dropping terms beyond \(O(\lambda)\),
The \(\vert 0\rangle\) singlet has no second-round mixing partner, so \(\vert 0\rangle'\) above is the final state through \(O(\lambda)\).
Interpretation. This example shows why degenerate perturbation theory is naturally hierarchical: first build and diagonalize effective blocks, then iterate with non-degenerate perturbation inside the lifted blocks. The two-round structure is required even when the second-round energy shift is negligible, because the second-round state mixing is the same order as the first-round mixing.
Discussion: residual degeneracy at second order
If diagonalizing \(V_{n\alpha,n\beta}\) still leaves repeated eigenvalues, first order is not enough. Then inspect second-order block terms (and symmetry constraints) to decide whether residual degeneracy is lifted.
Summary#
Why non-degenerate formulas fail: Inside a degenerate manifold the denominator \(E_n-E_m=0\) blows up, so perturbation theory must be reorganized as a block algorithm — partition the Hilbert space by manifold, build the effective Hamiltonian inside each block, then diagonalize.
Degenerate Hellmann-Feynman identities drive every formula: the in-manifold identity \(\partial_\lambda E_{n,\alpha\beta}=V_{n\alpha,n\beta}\) controls splitting, and the cross-manifold identity \(\langle m,\alpha\vert\partial_\lambda n,\beta\rangle=V_{m\alpha,n\beta}/(E_n-E_m)\) for \(m\neq n\) controls mixing.
Good zeroth-order basis = eigenvectors of \(V_{n\alpha,n\beta}\): diagonalizing the perturbation inside each manifold first yields the first-order energy splittings and selects the unique basis to which perturbation theory applies.
Second-order shifts come from virtual transitions to other manifolds and back, weighted by \(1/(E_n-E_m)\); the same physics is written in component form or in projector form.
First-order state correction is purely cross-manifold: the second Hellmann-Feynman identity gives the inter-manifold mixing, while intra-manifold mixing is fixed by the good-basis choice, not by a formula.
Hierarchical iteration: if the first-order matrix \(V_{n\alpha,n\beta}\) still has repeated eigenvalues, residual degeneracy must be lifted at the next order via second-order block terms (or by symmetry constraints).
See Also
5.1.2 Non-Degenerate Perturbation Theory: coupling-over-gap formulas before degeneracy
5.2.1 Interaction Picture: co-moving frame that carries the small-coupling expansion into the time-dependent setting where \(\hat{V}(t)\) drives evolution off an \(\hat{H}_0\) baseline
5.1.1 Toy Model: exact level repulsion and perturbative expansion benchmark
Homework#
1. Why the old formula fails. Start from the non-degenerate first-order state correction formula and explain precisely where divergence appears for a \(d\)-fold degenerate level. Which hidden assumption about labeling eigenstates fails?
2. Block first, levels later. For a degenerate manifold with basis \(\{\vert n,\alpha\rangle\}_{\alpha=1}^d\), define \(W^{(n)}_{\alpha\beta}=\langle n,\alpha\vert\hat V\vert n,\beta\rangle\).
(a) Show that first-order shifts are eigenvalues of \(W^{(n)}\).
(b) Show that eigenvectors of \(W^{(n)}\) define the good zeroth-order basis.
(c) Explain why this removes the divergence problem before applying higher-order corrections.
3. Effective Hamiltonian and dark state. Consider a three-level system with \(\hat H_0=\Delta\,\vert 3\rangle\langle 3\vert\) (\(\Delta>0\)), so the ground manifold \(\{\vert 1\rangle,\vert 2\rangle\}\) is doubly degenerate at \(E=0\). Add
with real \(\mu,\lambda\) and \(\vert\mu\vert,\vert\lambda\vert\ll\Delta\).
(a) Compute \(\hat P_d\hat V\hat P_d\) in \(\{\vert 1\rangle,\vert 2\rangle\}\), where \(\hat P_d\) projects onto the degenerate subspace. Read off the first-order splitting.
(b) Set \(\mu=0\). Build the second-order effective Hamiltonian \(\hat H^{(2)}_{\mathrm{eff}}\) in \(\{\vert 1\rangle,\vert 2\rangle\}\) from virtual transitions through \(\vert 3\rangle\). Diagonalize and identify the bright state with shift \(-2\lambda^2/\Delta\) and the dark state with zero shift.
(c) Still with \(\mu=0\), show that the dark state is an exact zero-energy eigenstate of \(\hat H_0+\hat V\) (not only at second order), and explain in one sentence why.
(d) Restore \(\mu\ne 0\). Show that for the symmetric coupling here (equal \(\vert 3\rangle\langle 1\vert\) and \(\vert 3\rangle\langle 2\vert\) matrix elements), \(\hat P_d\hat V\hat P_d=\mu\hat X\) and the second-order rank-1 matrix \(\propto\begin{pmatrix}1&1\\1&1\end{pmatrix}\) commute, so the two splittings share an eigenbasis and combine independently. Write the two ground-manifold energies to order \(\mu+\lambda^2\).
4. Hydrogen Stark splitting. In hydrogen (ignoring spin), use basis \(\{\vert 2,0,0\rangle,\vert 2,1,0\rangle,\vert 2,1,1\rangle,\vert 2,1,-1\rangle\}\) and \(\hat V=e\mathcal E_0\hat{z}\).
(a) Use selection rules \(\Delta\ell=\pm1\), \(\Delta m=0\) to write the effective matrix structure.
(b) Explain why two states split linearly while two remain unsplit at first order.
(c) State the symmetry reason in one sentence.
5. Residual degeneracy. For \(\hat H_{\text{eff}}=\begin{pmatrix}a&b\\b^*&c\end{pmatrix}\):
(a) find eigenvalues,
(b) give the condition for no first-order splitting,
(c) explain what physical information must then be checked at second order (or via symmetry).
6. When to switch methods. For each Hamiltonian below, decide whether non-degenerate or degenerate perturbation theory is appropriate at first order, and justify in one sentence:
(a) Hydrogen \(n=2\) manifold in a small uniform electric field \(\mathcal E\hat{z}\).
(b) The \(n=1\) ground state of hydrogen in the same field.
(c) Two bands with gap \(\gg V\) and \(V\) mixing across the gap.
(d) Two nearly-degenerate bands with gap \(\ll V\).