5.1.2 Non-Degenerate Perturbation Theory#
Prompts
If you know \(\hat{H}_0\) and matrix elements \(V_{mn}=\langle m\vert\hat V\vert n\rangle\), how can you systematically build first- and second-order energy corrections and first-order state corrections?
Starting from the parameter-dependent eigenvalue equation, why does projection with \(m=n\) isolate energy shifts while \(m\neq n\) gives state-mixing amplitudes?
How do coupling strength and level spacing together control the size of corrections, convergence quality, and hybridization?
What signals that non-degenerate perturbation theory is no longer controlled, and why does that signal point to degenerate perturbation theory?
Lecture Notes#
Overview#
Sec. 5.1.1 used an exactly solvable toy model as a benchmark. This subsection takes the next step: derive the perturbative coefficients directly from unperturbed data, without exact diagonalization.
We focus on the non-degenerate case and build formulas for first- and second-order energy shifts, first-order state mixing, and the validity condition that tells us when this method breaks down and degenerate perturbation theory is needed.
Problem Setup#
Non-Degenerate Perturbation Problem
Consider \(\hat H(\lambda)=\hat H_0+\lambda\hat V\), and work in the eigenbasis of \(\hat H_0\):
Here \(V_{mn}=\langle m\vert\hat V\vert n\rangle\), and we assume a non-degenerate unperturbed spectrum at \(\lambda=0\) (that is, \(E_n\neq E_m\) for \(n\neq m\)).
The corresponding eigenvalue equation is \(\hat H(\lambda)\vert n(\lambda)\rangle=E_n(\lambda)\vert n(\lambda)\rangle\), and our objective is to construct \(E_n(\lambda)\) and \(\vert n(\lambda)\rangle\) order by order in \(\lambda\).
Notation Clarification
To simplify notation, when \((\lambda)\) is omitted we evaluate at \(\lambda=0\):
Also, \(\vert\partial_\lambda n\rangle\) means derivative of the state vector \(\vert n(\lambda)\rangle\) (not derivative of the integer label \(n\)):
If \(\vert n(\lambda)\rangle=\sum_k c_{nk}(\lambda)\vert k\rangle\), then \(\vert\partial_\lambda n\rangle=\sum_k\left(\partial_\lambda c_{nk}(\lambda)\right)_{\lambda=0}\vert k\rangle\).
The Rayleigh-Schrödinger notation used in the Summary and Homework writes Taylor coefficients with explicit order superscripts:
In particular, \(\vert n^{(0)}\rangle=\vert n\rangle\), \(E_n^{(0)}=E_n\), \(\vert n^{(1)}\rangle=\vert\partial_\lambda n\rangle\), \(E_n^{(1)}=\partial_\lambda E_n\), and \(E_n^{(2)}=\frac{1}{2}\partial_\lambda^2 E_n\).
Hellmann-Feynman Identities#
From the Taylor-expansion viewpoint, perturbation theory is a derivative problem: we need derivatives of energies and states with respect to \(\lambda\).
The Hellmann-Feynman identities are exactly the tool we need: they convert those derivatives into matrix elements of \(\hat V\), giving a recursive route to higher-order corrections.
Hellmann-Feynman identities (non-degenerate)
Differentiating the eigenvalue equation with respect to \(\lambda\) gives the two Hellmann-Feynman identities below.
1st Hellmann-Feynman Identity (energy derivative):
2nd Hellmann-Feynman Identity (state derivative):
Derivation: Hellmann-Feynman identities
Start from the eigenvalue equation:
Differentiate with respect to \(\lambda\):
Left-multiply by \(\langle m\vert\):
Use \(\langle m\vert\hat H=E_m\langle m\vert\), orthonormality \(\langle m\vert n\rangle=\delta_{mn}\), and \(\partial_\lambda\hat H=\hat V\):
Now separate two cases:
If \(m=n\):
If \(m\neq n\):
so
This is the coupling-over-gap structure that controls state mixing.
With these identities in hand, we can now compute energy and state corrections order by order.
Energy Corrections#
Now use the Taylor expansion of the eigenenergy around \(\lambda=0\). Once \(\partial_\lambda E_n\) and \(\partial_\lambda^2E_n\) are known, the perturbative coefficients follow immediately.
Energy Expansion
Using Hellmann-Feynman identities, the energy correction is given by:
Derivation: first-order energy derivative
From the first Hellmann-Feynman identity (the \(m=n\) case):
This is exactly the first derivative needed in the Taylor series.
Derivation: second-order energy derivative
Start from
Differentiate once more; since \(\hat V\) has no \(\lambda\) dependence in the linear ansatz \(\hat H(\lambda)=\hat H_0+\lambda\hat V\) (so \(\partial_\lambda\hat V=0\)), only the bra and ket are differentiated:
Insert the resolution of identity carefully:
Now separate the \(m\neq n\) and \(m=n\) contributions:
Here the second Hellmann-Feynman identity is used only for \(m\neq n\). For the \(m=n\) piece,
because normalization gives \(\langle n\vert n\rangle=1\).
Therefore
Poll: sign of ground-state shift
For a non-degenerate ground state \(n=0\), what is the sign of \(E_0^{(2)}=\sum_{m\neq 0}\frac{\vert V_{m0}\vert^2}{E_0-E_m}\)?
(A) always positive
(B) always negative
(C) either sign
(D) always zero
State Corrections#
Apply the same Taylor logic to states:
State expansion and first-order correction
Using the second Hellmann-Feynman identity, the state correction is given by:
Derivation: first-order state derivative
Expand the derivative vector in the unperturbed basis and separate diagonal/off-diagonal parts:
For \(m\neq n\), apply the second Hellmann-Feynman identity:
For \(m=n\), normalization gives
so \(\langle n\vert\partial_\lambda n\rangle\) is pure imaginary. One may choose the phase convention (parallel-transport gauge) so that
Under this gauge choice,
This makes the physical meaning transparent: mixing is stronger for larger coupling and smaller energy gap.
Physical Intuition and Validity#
Diagonal matrix elements \(V_{nn}\) shift energies at first order.
Off-diagonal matrix elements \(V_{mn}\) mix states and generate second-order shifts.
Level repulsion: virtual transitions push levels apart.
Breakdown criterion: if \(\vert E_n-E_m\vert\) becomes comparable to \(\vert V_{mn}\vert\), denominators become large and non-degenerate perturbation theory loses validity.
Then we must switch to degenerate perturbation theory (Sec. 5.1.3).
Example: check against the qubit toy model
Take exactly the toy model from Sec. 5.1.1:
Use the \(\{\vert 0\rangle,\vert 1\rangle\}\) basis of \(\hat Z\):
and matrix elements
For the branch connected to \(\vert 0\rangle\) at \(\lambda=0\):
so
For the branch connected to \(\vert 1\rangle\):
so
These match the exact energies
expanded at small \(\lambda\).
For state derivatives:
which is also consistent with the exact eigenvector expansion.
Discussion: what does divergence mean?
When \(E_n-E_m\to 0\), formulas like \(V_{mn}/(E_n-E_m)\) diverge. Is this a physical singularity, or a sign that our basis choice is no longer appropriate? Explain how block-diagonalization inside the nearly degenerate subspace resolves this issue.
Summary#
Non-degenerate perturbation theory is an iterative derivative algorithm built from Hellmann-Feynman identities.
First-order energy comes from diagonal matrix elements: \(E_n^{(1)}=V_{nn}\).
First-order state mixing is coupling over gap: \(\vert n^{(1)}\rangle=\sum_{m\neq n}\vert m\rangle V_{mn}/(E_n-E_m)\).
Second-order energy is a sum over virtual processes: \(E_n^{(2)}=\sum_{m\neq n}\vert V_{mn}\vert^2/(E_n-E_m)\).
Near degeneracy is not a failure of QM; it signals a change of method (degenerate perturbation theory).
See Also
5.1.1 Toy Model: exact qubit solution and small-\(\lambda\) expansion
5.1.3 Degenerate Perturbation Theory: block treatment near resonance
Homework#
The problems are ordered to follow the lecture algorithm: setup and identities \(\to\) corrections \(\to\) physical interpretation and breakdown.
1. Gauge choice and normalization convention. Starting from \(\langle n(\lambda)\vert n(\lambda)\rangle=1\), show that one can choose the phase convention so that
Explain why this choice simplifies perturbative state corrections.
2. Hellmann-Feynman in practice. Consider
so \(\hat H(\lambda)=\hat H_0+\lambda\hat V\).
(a) Use the first Hellmann-Feynman identity \(E_n^{(1)}=V_{nn}\) to read off the three first-order energies.
(b) Use the second Hellmann-Feynman identity to compute \(\langle 2\vert\partial_\lambda 1\rangle\) and \(\langle 3\vert\partial_\lambda 1\rangle\) at \(\lambda=0\). Assemble \(\vert 1^{(1)}\rangle\).
(c) Verify your result for \(E_1^{(1)}\) by diagonalizing \(\hat H(\lambda)\) at \(\lambda=0.01\) and checking \([E_1(0.01)-E_1(0)]/0.01\approx E_1^{(1)}\).
3. Coupling over gap. Two two-level systems share the same coupling magnitude \(\vert V_{12}\vert=\hbar\omega_c\) but different gaps. System A has \(E_2-E_1=10\hbar\omega_c\); system B has \(E_2-E_1=2\hbar\omega_c\). Set \(\lambda=1\) in both.
(a) Compute the first-order state correction \(\vert 1^{(1)}\rangle=\sum_{m\neq 1}\vert m\rangle V_{m1}/(E_1-E_m)\) for each system.
(b) Compute the second-order energy correction \(E_1^{(2)}\) for each.
(c) Estimate the largest coupling magnitude \(\vert V_{12}\vert\) for which the perturbative expansion is well-behaved (next correction small compared to the gap). Comment on which system is “more perturbative” and why.
4. Second-order energy correction and sign. Starting from
(a) show \(E_0^{(2)}\le 0\) for the non-degenerate ground state,
(b) interpret the result as level repulsion,
(c) connect it to the variational principle.
5. Toy-model consistency check. For \(\hat H=\hat Z+\lambda \hat X\):
(a) compute \(E_\pm^{(1)}\) and \(E_\pm^{(2)}\) by perturbation theory,
(b) compute \(\vert\pm^{(1)}\rangle\),
(c) compare with the exact expansion of \(E_\pm(\lambda)=\pm\sqrt{1+\lambda^2}\) and comment on agreement order-by-order.
6. Harmonic oscillator with linear perturbation. Let
(a) Compute \(V_{nn}\), \(V_{n+1,n}\), \(V_{n-1,n}\).
(b) Use non-degenerate perturbation theory to obtain \(E_n\) up to second order.
(c) Solve the full Hamiltonian by completing the square and compare with perturbation theory.
7. Selection rules and parity. For a 1D parity-symmetric potential with odd perturbation \(\hat V=\lambda \hat x\):
(a) show \(E_n^{(1)}=0\) for all \(n\),
(b) identify which matrix elements contribute to \(E_n^{(2)}\),
(c) explain how symmetry controls which virtual transitions are allowed.
8. Near-degeneracy and breakdown. A 3-level system has unperturbed energies \(E_1=0\), \(E_2=\Delta\), \(E_3=10\Delta\), with nonzero couplings \(V_{12}\) and \(V_{23}\).
(a) Write \(E_1^{(2)}\) explicitly.
(b) Analyze \(\Delta\to 0\) and identify which term causes the breakdown.
(c) Give the correct next-step method (basis choice and effective subspace treatment) instead of applying non-degenerate formulas blindly.