5.1.3 Degenerate Perturbation Theory#

Prompts

  • Why does non-degenerate perturbation theory fail when multiple unperturbed states share the same energy, and what is the minimal fix?

  • In a degenerate subspace, why must we diagonalize the perturbation first, and how does this define the correct zeroth-order basis?

  • What is the physical difference between first-order splitting inside the degenerate manifold and second-order shifts from virtual coupling to outside states?

  • For a concrete model, how can you diagnose whether degeneracy is fully lifted, partially lifted, or unchanged at first order, and what should be done next in each case?

Lecture Notes#

Overview#

§5.1.2 assumed isolated levels and produced scalar corrections. This subsection handles the complementary case: when unperturbed levels are degenerate, perturbation theory must be organized as a block problem inside each degenerate manifold.

The practical workflow is: identify the degenerate manifold, diagonalize in-manifold matrix elements at first order, then add virtual-coupling corrections from outside manifolds at second order.

Why non-degenerate formulas fail#

For \(\hat{H}(\lambda)=\hat{H}_0+\lambda\hat{V}\), suppose \(\hat{H}_0\) has a \(d_n\)-fold degenerate eigenspace at energy \(E_n\), spanned by \(\vert n,\alpha\rangle\) (\(\alpha=1,\dots,d_n\)):

\[ \hat{H}_0\vert n,\alpha\rangle=E_n\vert n,\alpha\rangle. \]

A quick wrong attempt is to reuse the non-degenerate structure directly:

\[ \vert n,\alpha\rangle^{(1)}\sim\sum_{(m,\beta)\neq(n,\alpha)}\vert m,\beta\rangle\frac{(\cdots)}{E_n-E_m}. \]

Then terms with \(m=n\) but \(\beta\neq\alpha\) give

\[ E_n-E_m=E_n-E_n=0, \]

so the denominator diverges.

Core Idea

The proper strategy is divide and conquer:

  • reorganize the Hilbert space into degenerate-manifold blocks,

  • construct the effective Hamiltonian for each block,

  • diagonalize each block,

  • if residual degeneracy remains, continue to the next order/block level.

The central goal of degenerate perturbation theory is to construct the effective Hamiltonian within each degenerate subspace.

Problem Setup#

Degenerate Perturbation Problem

Consider \(\hat H(\lambda)=\hat H_0+\lambda\hat V\), and work in a basis \(\vert n,\alpha\rangle\) where \(n\) labels the manifold and \(\alpha=1,\dots,d_n\) labels orthonormal states inside that manifold:

\[ \hat H_0=\sum_n\sum_\alpha E_n\,\vert n,\alpha\rangle\langle n,\alpha\vert. \]
\[ \hat V=\sum_{m,\alpha}\sum_{n,\beta}\vert m,\alpha\rangle V_{m\alpha,n\beta}\langle n,\beta\vert. \]

Here \(V_{m\alpha,n\beta}=\langle m,\alpha\vert\hat V\vert n,\beta\rangle\), and states inside one degenerate manifold can mix.

The corresponding eigenvalue equation is \(\hat H(\lambda)\vert n,\beta(\lambda)\rangle=\sum_\alpha E_{n,\alpha\beta}(\lambda)\vert n,\alpha(\lambda)\rangle\), and our objective is to construct \(E_{n,\alpha\beta}(\lambda)\) and \(\vert n,\alpha(\lambda)\rangle\) order by order in \(\lambda\).

Hellmann-Feynman Identities (Degenerate Form)#

Differentiate the matrix eigenproblem and project onto \(\langle m,\gamma\vert\) at \(\lambda=0\):

\[ \langle m,\gamma\vert\partial_\lambda\hat H\vert n,\beta\rangle =\partial_\lambda E_{n,\gamma\beta}\,\delta_{mn} +(E_n-E_m)\langle m,\gamma\vert\partial_\lambda n,\beta\rangle. \]

Hellmann-Feynman identities (degenerate)

  • 1st Hellmann-Feynman Identity (energy matrix derivative, within manifold):

\[ \partial_\lambda E_{n,\alpha\beta}=V_{n\alpha,n\beta}. \]
  • 2nd Hellmann-Feynman Identity (state-mixing derivative, across manifolds):

\[ \langle m,\alpha\vert\partial_\lambda n,\beta\rangle =\frac{V_{m\alpha,n\beta}}{E_n-E_m} \text{ for }m\neq n. \]

Energy Corrections#

Use the Taylor expansion of the in-manifold energy matrix:

Energy matrix expansion up to second order

Using degenerate Hellmann-Feynman identities, the energy matrix correction is given by:

\[\begin{split} \begin{split} E_{n,\alpha\beta}(\lambda) &=E_n\delta_{\alpha\beta} +\lambda\,\partial_\lambda E_{n,\alpha\beta} +\frac{\lambda^2}{2}\,\partial_\lambda^2 E_{n,\alpha\beta} +O(\lambda^3)\\ &=E_n\delta_{\alpha\beta} +\lambda V_{n\alpha,n\beta} +\lambda^2\sum_{m\neq n}\sum_\gamma \frac{V_{n\alpha,m\gamma}V_{m\gamma,n\beta}}{E_n-E_m} +O(\lambda^3). \end{split} \end{split}\]

State Corrections#

The corrected basis vectors are also expanded in \(\lambda\).

State expansion and first-order mixing

Using the second Hellmann-Feynman identity, the state correction is given by:

\[\begin{split} \begin{split} \vert n,\alpha(\lambda)\rangle &=\vert n,\alpha\rangle +\lambda\vert\partial_\lambda n,\alpha\rangle +O(\lambda^2)\\ &=\vert n,\alpha\rangle +\lambda\sum_{m\neq n}\sum_\beta \vert m,\beta\rangle\frac{V_{m\beta,n\alpha}}{E_n-E_m} +O(\lambda^2). \end{split} \end{split}\]

Summary#

  • Why non-degenerate formulas fail: Inside a degenerate manifold the denominator \(E_n-E_m=0\) blows up, so perturbation theory must be reorganized as a block algorithm — partition the Hilbert space by manifold, build the effective Hamiltonian inside each block, then diagonalize.

  • Degenerate Hellmann-Feynman identities drive every formula: the in-manifold identity \(\partial_\lambda E_{n,\alpha\beta}=V_{n\alpha,n\beta}\) controls splitting, and the cross-manifold identity \(\langle m,\alpha\vert\partial_\lambda n,\beta\rangle=V_{m\alpha,n\beta}/(E_n-E_m)\) for \(m\neq n\) controls mixing.

  • Good zeroth-order basis = eigenvectors of \(V_{n\alpha,n\beta}\): diagonalizing the perturbation inside each manifold first yields the first-order energy splittings and selects the unique basis to which perturbation theory applies.

  • Second-order shifts come from virtual transitions to other manifolds and back, weighted by \(1/(E_n-E_m)\); the same physics is written in component form or in projector form.

  • First-order state correction is purely cross-manifold: the second Hellmann-Feynman identity gives the inter-manifold mixing, while intra-manifold mixing is fixed by the good-basis choice, not by a formula.

  • Hierarchical iteration: if the first-order matrix \(V_{n\alpha,n\beta}\) still has repeated eigenvalues, residual degeneracy must be lifted at the next order via second-order block terms (or by symmetry constraints).

See Also

Homework#

1. Why the old formula fails. Start from the non-degenerate first-order state correction formula and explain precisely where divergence appears for a \(d\)-fold degenerate level. Which hidden assumption about labeling eigenstates fails?

2. Block first, levels later. For a degenerate manifold with basis \(\{\vert n,\alpha\rangle\}_{\alpha=1}^d\), define \(W^{(n)}_{\alpha\beta}=\langle n,\alpha\vert\hat V\vert n,\beta\rangle\).

(a) Show that first-order shifts are eigenvalues of \(W^{(n)}\).

(b) Show that eigenvectors of \(W^{(n)}\) define the good zeroth-order basis.

(c) Explain why this removes the divergence problem before applying higher-order corrections.

3. Effective Hamiltonian and dark state. Consider a three-level system with \(\hat H_0=\Delta\,\vert 3\rangle\langle 3\vert\) (\(\Delta>0\)), so the ground manifold \(\{\vert 1\rangle,\vert 2\rangle\}\) is doubly degenerate at \(E=0\). Add

\[ \hat V=\mu\,(\vert 1\rangle\langle 2\vert+\vert 2\rangle\langle 1\vert)+\lambda\,(\vert 3\rangle\langle 1\vert+\vert 3\rangle\langle 2\vert+\mathrm{h.c.}), \]

with real \(\mu,\lambda\) and \(\vert\mu\vert,\vert\lambda\vert\ll\Delta\).

(a) Compute \(\hat P_d\hat V\hat P_d\) in \(\{\vert 1\rangle,\vert 2\rangle\}\), where \(\hat P_d\) projects onto the degenerate subspace. Read off the first-order splitting.

(b) Set \(\mu=0\). Build the second-order effective Hamiltonian \(\hat H^{(2)}_{\mathrm{eff}}\) in \(\{\vert 1\rangle,\vert 2\rangle\}\) from virtual transitions through \(\vert 3\rangle\). Diagonalize and identify the bright state with shift \(-2\lambda^2/\Delta\) and the dark state with zero shift.

(c) Still with \(\mu=0\), show that the dark state is an exact zero-energy eigenstate of \(\hat H_0+\hat V\) (not only at second order), and explain in one sentence why.

(d) Restore \(\mu\ne 0\). Show that for the symmetric coupling here (equal \(\vert 3\rangle\langle 1\vert\) and \(\vert 3\rangle\langle 2\vert\) matrix elements), \(\hat P_d\hat V\hat P_d=\mu\hat X\) and the second-order rank-1 matrix \(\propto\begin{pmatrix}1&1\\1&1\end{pmatrix}\) commute, so the two splittings share an eigenbasis and combine independently. Write the two ground-manifold energies to order \(\mu+\lambda^2\).

4. Hydrogen Stark splitting. In hydrogen (ignoring spin), use basis \(\{\vert 2,0,0\rangle,\vert 2,1,0\rangle,\vert 2,1,1\rangle,\vert 2,1,-1\rangle\}\) and \(\hat V=e\mathcal E_0\hat{z}\).

(a) Use selection rules \(\Delta\ell=\pm1\), \(\Delta m=0\) to write the effective matrix structure.

(b) Explain why two states split linearly while two remain unsplit at first order.

(c) State the symmetry reason in one sentence.

5. Residual degeneracy. For \(\hat H_{\text{eff}}=\begin{pmatrix}a&b\\b^*&c\end{pmatrix}\):

(a) find eigenvalues,

(b) give the condition for no first-order splitting,

(c) explain what physical information must then be checked at second order (or via symmetry).

6. When to switch methods. For each Hamiltonian below, decide whether non-degenerate or degenerate perturbation theory is appropriate at first order, and justify in one sentence:

(a) Hydrogen \(n=2\) manifold in a small uniform electric field \(\mathcal E\hat{z}\).

(b) The \(n=1\) ground state of hydrogen in the same field.

(c) Two bands with gap \(\gg V\) and \(V\) mixing across the gap.

(d) Two nearly-degenerate bands with gap \(\ll V\).