25  The helium atom

Now we’re ready to consider another example, a very important one: the helium atom. We will study helium first with perturbation theory, and then the variational method to compare. (We won’t do the example of Rayleigh-Ritz, but it yields the most precise results for helium energies and states.)

The helium atom has two orbital electrons, which we’ll label 1 and 2. Writing only the kinetic and Coulomb terms, the helium Hamiltonian is \hat{H} = \frac{\hat{p}_1^2}{2m} - \frac{Ze^2}{r_1} + \frac{\hat{p}_2^2}{2m} - \frac{Ze^2}{r_2} + \frac{e^2}{|\vec{r_2} - \vec{r_1}|}. This is no longer a central potential, due to the repulsive force between the two electrons given by the last term. We can’t solve exactly, so we’ll treat the electron repulsion as a perturbation, writing the above as \hat{H} = \hat{H}_1 + \hat{H}_2 + \hat{W}.

With this choice, the unperturbed wavefunction can be decomposed into a product of two hydrogenic wavefunctions. However, we have to keep in mind that with two identical electrons, spin-statistics is active, which means that we must have either a symmetric or antisymmetric combination of wavefunctions, \psi_{\textrm{space}}^{(0)}(\vec{r}_1, \vec{r}_2) = \frac{1}{\sqrt{2}} \left[ \psi_{n_1,l_1,m_1}(\vec{r}_1) \psi_{n_2,l_2,m_2} (\vec{r}_2) \pm \psi_{n_1,l_1,m_1}(\vec{r}_2) \psi_{n_2,l_2,m_2}(\vec{r_1}) \right]. Which choice we make for a given state depends on the spin state of the electrons, which itself can also be symmetric or anti-symmetric; we’ll see how this works in some specific examples. Before we consider spin, we can observe that with this setup the unperturbed energies for helium are just the sum of two hydrogenic energies: E_{n_1,n_2}^{(0)} = -Z^2 \left( \frac{1}{n_1^2} + \frac{1}{n_2^2} \right)\ \textrm{Ry} = (-54.4\ {\rm eV}) \left( \frac{1}{n_1^2} + \frac{1}{n_2^2} \right) plugging in Z=2 for helium. For the ground state n_1 = n_2 = 1, we find an energy which is 8 times the ground-state energy of hydrogen, or -108.8 eV.

Excited states of helium and instability

It’s interesting to note that the ionization energy required to liberate one of the electrons is only half of the ground-state energy. The threshold for a free electron is at n=\infty, so we have for example E_{\infty,1}^{(0)} = -54.4\ {\rm eV}. This immediately implies that even the lowly n_1=2, n_2=2 energy level is unstable, since E_{2,2}^{(0)} = -54.4\ {\rm eV} \left( \frac{1}{4} + \frac{1}{4} \right) = -27.2\ {\rm eV}. This is well above the energy scale E_{\infty,1}^{(0)}. Because of this energy difference, the state \ket{2,2} will auto-ionize; there is a transition in which one electron goes to the ground state and the other escapes, leaving behind a {\rm He}^+ ion.

We will return to say more about auto-ionization later in these notes when we have better tools to describe the transition. For now, the key point to remember is that the stable states of an ordinary helium atom have at least one electron in the ground state. (This is all zeroth-order in perturbation theory, which isn’t quite right, but the qualitative conclusions and the existence of auto-ionizing states are robust.)

25.0.1 Helium ground-state energy: perturbative estimate

Let’s focus on just the ground state, where both electrons have n = 1. This forces the individual wavefunctions for both electrons to be in the \ket{nlm} = \ket{100} state, which in turn tells us that the spatial wavefunction must be symmetric, \psi_{g,\rm space}^{(0)}(\vec{r}_1, \vec{r}_2) = \psi_{100}(\vec{r}_1) \psi_{100} (\vec{r}_2) Fermi statistics then requires the overall wavefunction to be antisymmetric under exchange, which means that the spin wavefunction is definitely in the s=0 singlet state, \ket{\chi_g} = \frac{1}{\sqrt{2}} (\ket{\uparrow}_1 \ket{\downarrow}_2 - \ket{\downarrow}_1 \ket{\uparrow}_2).

Now let’s include the electron repulsion as a perturbation. We can evaluate the first-order energy correction for the ground state as an integral: E_0^{(1)} = \left\langle \frac{e^2}{|\vec{r}_1 - \vec{r}_2|} \right\rangle \\ = \int d^3 r_1 \int d^3r_2 \frac{|\psi_{100}(r_1)|^2 |\psi_{100}(r_2)|^2}{|\vec{r_1} - \vec{r_2}|}. The hydrogenic ground-state wavefunction is, being careful to include the spherical harmonic to get the normalization right, \psi_{100}(r) = \frac{1}{\sqrt{\pi}} \sqrt{\frac{Z^3}{a_0^3}} e^{-Zr/a_0}. Dealing with the relative position of the electrons is trickier; we can rewrite |\vec{r}_1 - \vec{r}_2| = \sqrt{(\vec{r}_1 - \vec{r}_2)^2} \\ = \sqrt{r_1^2 + r_2^2 - 2r_1 r_2 \cos \theta}, where \theta is the relative angle between the two position vectors. Substituting in above, we have E_0^{(1)} = \left(\frac{Z^3}{\pi a_0^3} \right)^2 \int dr_1 r_1^2 e^{-2Zr_1 / a_0} \int dr_2 r_2^2 e^{-2Zr_2 / a_0} \\ \int d\Omega_1 \int d\Omega_2 \frac{e^2}{\sqrt{r_1^2 + r_2^2 - 2r_1 r_2 \cos \theta}}. We haven’t specified any coordinate axes yet; if we take the z axis to point along \vec{r}_1, then the angle \theta is just \theta_2, and we have E_0^{(1)} = \frac{8 Z^6 e^2}{a_0^6} \int dr_1 \int dr_2 (...) \int d(\cos \theta_2) \frac{1}{\sqrt{r_1^2 + r_2^2 - 2r_1 r_2 \cos \theta_2}}. The last remaining angular integral isn’t too bad: \int_{-1}^1 d(\cos \theta_2) \frac{1}{\sqrt{r_1^2 + r_2^2 - 2r_1 r_2 \cos \theta_2}} \\ = -\frac{1}{r_1 r_2} \left. \sqrt{r_1^2 + r_2^2 - 2r_1 r_2 \cos \theta_2}\right|_{-1}^1 \\ = -\frac{1}{r_1 r_2} (\sqrt{(r_1 - r_2)^2} - \sqrt{(r_1 + r_2)^2}) \\ = \frac{1}{r_1 r_2} (r_1 + r_2 - |r_1 - r_2|). Recognizing that the integral is totally symmetric under exchange of r_1 and r_2, we can assume r_1 > r_2 and just double the result; with this assumption, the above expression becomes simply 4 / r_1 (including the doubling due to symmetry.) Because we made this assumption, we also have to enforce it for the radial integrals by integrating r_1 over [r_2, \infty] instead of [0, \infty]. Doing a u-substitution first with u = Zr/a_0, we have

E_0^{(1)} = \frac{32 Ze^2}{a_0} \int_0^\infty du_2 \int_{u_2}^\infty du_1 u_1 u_2^2 e^{-2(u_1+u_2)}

The radial integrals are straightforward from here: they give a numerical factor of 5/256, so that we find for the energy correction E_0^{(1)} = \frac{5}{8} \frac{Ze^2}{a_0} = \frac{5Z}{4}\ \textrm{Ry}. This is a positive shift, changing the estimated ground-state energy for helium from -108.8 eV to -74.8 eV. This is a reasonably good estimate; the experimental value is around -78.98 eV.

25.0.2 Helium ground-state energy: variational estimate

Let’s see how the variational method does for helium, and whether we can gain any additional physical insight from that approach. There are many possible choices of variational wave functions, but a relatively simple approach turns out to work quite well. We’ll continue to assume that we can write the total wavefunction as a product of two hydrogenic wavefunctions, \tilde{\psi}(r) = \frac{1}{\sqrt{\pi}} \left( \frac{Z^\star}{a_0}\right)^{3/2} e^{-Z^\star r/a_0} and then \tilde{\psi}(r_1, r_2) = \tilde{\psi}(r_1) \tilde{\psi}(r_2). We take Z^\star to be our variational parameter; we could have tried a_0 as well, but since the wavefunction only depends on the ratio of the two the outcome would be the same. The trial energy is thus E(Z^\star) = \int d^3 r_1 \int d^3 r_2 \tilde{\psi}^\star(r_1) \tilde{\psi}^\star(r_2) \left( \frac{\hat{p}_1^2}{2m} - \frac{Ze^2}{r_1} \right. \\ \left. + \frac{\hat{p}_2^2}{2m} - \frac{Ze^2}{r_2} + \frac{e^2}{|\vec{r}_1 - \vec{r}_2|} \right) \tilde{\psi}(r_1) \tilde{\psi}(r_2). (note that this is the Hamiltonian so it is correct that the factors here contain Z and not Z^\star.) Rather than evaluating the integral directly this time, we can use some tricks. Notice that since the \tilde{\psi} function is just a ground-state hydrogenic wavefunction but with Z^\star as the atomic number, which means that \left( \frac{\hat{p}_1^2}{2m} - \frac{Z^\star e^2}{r_1} \right) \tilde{\psi}(r_1) = E_0(Z^\star) \tilde{\psi}(r_1) where E_0(Z^\star) = -(Z^\star)^2\ \textrm{Ry}, and similarly for r_2. In other words, \tilde{\psi}(r) is a ground-state eigenfunction of the “unphysical” hydrogen Hamiltonian with Z^\star. By rewriting -\frac{Ze^2}{r} = -\frac{Z^\star e^2}{r} + \frac{(Z^\star - Z) e^2}{r}, we reduce the big integral above to E(Z^\star) = 2 E_0(Z^\star) + 2(Z^\star - Z) e^2 \left\langle \frac{1}{r} \right\rangle_{Z^\star} + e^2 \left\langle \frac{1}{|\vec{r}_1 - \vec{r}_2|} \right\rangle_{Z^\star}. where the subscript Z^\star reminds us that we’re taking the expectation value with respect to the trial wavefunction. The first expectation value is just given by the virial theorem, e^2 \left\langle \frac{1}{r} \right\rangle_{Z^\star} = \frac{Z^\star e^2}{a_0} = 2Z^\star\ \textrm{Ry}. There’s no neat trick to evaluate the second expectation value, but we just found what it was in our perturbative estimate: e^2 \left\langle \frac{1}{|\vec{r}_1 - \vec{r}_2|} \right\rangle_{Z^\star} = \frac{5}{4} Z^\star\ \textrm{Ry}. Putting everything together, we see that E(Z^\star) = \left( 2(Z^\star)^2 - 4ZZ^\star + \frac{5}{4} Z^\star \right)\ \textrm{Ry}. This function is minimized for Z^\star_{\textrm{min}} = Z - \frac{5}{16}, giving the variational ground-state energy E(Z^\star_{\textrm{min}}) = - \left[ 2 \left(Z - \frac{5}{16} \right) \right]^2\ \textrm{Ry} \\ \approx -77.38\ \textrm{eV}. Comparing to the experimental value of -78.975 eV and our perturbative estimate of -74.8 eV, we see that the variational approach has given a significant improvement; and if we were more clever in our choice of variational function and parameters, we would be able to lower the variational bound and do even better. (As I mentioned, a Rayleigh-Ritz approach gives the best variational bounds on the helium ground-state and excited-state energies in practice.)

One other interesting point to make is that the result coming from our variational calculation has a sensible physical interpretation. We can think of Z^\star as an effective nuclear charge seen by each electron, which has been reduced by some amount due to “screening” by the presence of the other electron. The fact that this gives a very close answer to the true ground-state energy implies that this charge screening is the dominant correction to the non-interacting picture.

25.1 Exchange interaction and excited states of helium

Now let’s extend our discussion beyond the ground state. As we noted before, for a helium atom all stable bound states require one of the electrons to be in the n=1 orbital, or else the atom will have enough energy to auto-ionize. This means that the most interesting case to study is that where one electron is in state \ket{100}, and the other in a general state \ket{nlm}. The spatial part of the two-electron wave function can be written as \psi_{\textrm{space}}(\vec{r}_1, \vec{r}_2) = \frac{1}{\sqrt{2}} \left[ \psi_{100}(\vec{r}_1) \psi_{nlm} (\vec{r}_2) \pm \psi_{100}(\vec{r}_2) \psi_{nlm}(\vec{r_1}) \right]. The sign here is dictated by what the spin state of the electrons is. There are two possibilities:

  • Total spin s=0 (singlet): the spin wavefunction is antisymmetric, which means that the spatial wavefunction is symmetric. This was the case for the n_1 = 1, n_2 = 1 ground state. The s=0 states are known collectively as para-helium states.
  • Total spin s=1 (triplet): the spin wavefunction is symmetric, so the spatial wavefunction is antisymmetric. The s=1 states of helium are known as ortho-helium states.

Out of all of these states, the ground state is special since it can only have a symmetric spatial wavefunction, which means ground-state helium only exists as a para-helium state. However, for any of the other excited states of this form, both the ortho- and para-helium states will be present.

Let’s have a look at the excited-state energies, once again treating the Coulomb repulsion between the electrons as a perturbation. For any choice of \ket{nlm}, we have for the unperturbed energy E^{(0)} = - Z^2 \left( 1 + \frac{1}{n^2} \right)\ \textrm{Ry}, and the energy correction is \Delta E = \left\langle \frac{e^2}{|\vec{r}_1 - \vec{r}_2|} \right\rangle. The spin part of the wavefunction has no effect on this perturbation, so the expectation value is taken purely with respect to the spatial part: \Delta E = \int d^3r_1 \int d^3r_2 |\psi_{\textrm{space}}(\vec{r}_1, \vec{r}_2)|^2 \frac{e^2}{|\vec{r}_1 - \vec{r}_2|}. If we expand out the spatial wavefunction, we find that we can write \Delta E = I \pm J, where I = \int d^3 r_1 \int d^3 r_2 |\psi_{100}(\vec{r}_1)|^2 |\psi_{nlm}(\vec{r}_2)|^2 \frac{e^2}{|\vec{r}_1 - \vec{r}_2|} is an integral of the form that we saw above for the ground state, and J = \textrm{Re} \left[ \int d^3 r_1 \int d^3 r_2 \psi_{100}^\star(\vec{r}_1) \psi_{nlm}^\star(\vec{r}_2) \frac{e^2}{|\vec{r}_1 - \vec{r}_2|} \psi_{100}(\vec{r}_2) \psi_{nlm}(\vec{r}_1) \right] is a new term, known as the exchange interaction (or “exchange energy.”)

In general, the value of J versus I will depend on what system we’re studying; it is even possible in some cases for J to be negative. For helium, it is the case that the para-helium states (singlet), which are spatially symmetric, have increased energy, while ortho-helium (triplet) states have lower energy. This makes physical sense, since we can think of the symmetrized spatial wavefunctions on average putting the electrons “closer” to each other, giving a larger energy contribution from the Coulomb repulsion between them, and vice-versa.

Since the difference between the two energies depends entirely on the spin state of the two electrons, it’s interesting to notice that we can actually rewrite the total energy shift in the form \Delta E = I - \frac{1}{2} \left(1 + \vec{\sigma}_1 \cdot \vec{\sigma}_2\right) J where as a reminder, the \sigma_i matrices are the dimensionless versions of the \vec{S}_i operators; we have \vec{\sigma}_1 \cdot \vec{\sigma}_2 = +1 for the S=1 triplet state, and -3 for the S=0 singlet. Thus, we find that even without any explicit spin dependence included in the Hamiltonian, the presence of Fermi-Dirac statistics here has induced a spin-dependent energy splitting - and a very large one, compared to the fine-structure spin effects we saw before! (This is purely electromagnetic and will be of order eV, not \alpha^2 times eV.)