The Dirac Equation

Our goal is to find the analog of the Schrödinger equation for relativistic spin one-half particles, however, we should note that even in the Schrödinger equation, the interaction of the field with spin was rather ad hoc. There was no explanation of the gyromagnetic ratio of 2. One can incorporate spin into the non-relativistic equation by using the Schrödinger-Pauli Hamiltonianwhich contains the dot product of the Pauli matrices with the momentum operator.

$\begin{displaymath}\bgroup\color{black} H={1\over 2m}\left(\vec{\sigma}\cdot[\ve... ...{e\over c}\vec{A}(\vec{r},t)]\right)^2-e\phi(\vec{r},t) \egroup\end{displaymath}$

A little computation shows that this gives the correct interaction with spin.

$\begin{displaymath}\bgroup\color{black} H={1\over 2m}[\vec{p}+{e\over c}\vec{A}(... ...t)+{e\hbar\over 2mc}\vec{\sigma}\cdot\vec{B}(\vec{r},t) \egroup\end{displaymath}$

This Hamiltonian acts on a two component spinor.

We can extend this concept to use the relativistic energy equation. The idea is to replace $\bgroup\color{black}$\vec{p}$\egroup$ with $\bgroup\color{black}$\vec{\sigma}\cdot\vec{p}$\egroup$ in the relativistic energy equation.

$\begin{eqnarray*} \left({E\over c}\right)^2-p^2=(mc)^2 \\ \left({E\over c}-\vec... ...0}-i\hbar\vec{\sigma}\cdot\vec{\nabla}\right)\phi=(mc)^2\phi \\ \end{eqnarray*}$

Instead of an equation which is second order in the time derivative, we can make a first order equation, like the Schrödinger equation, by extending this equation to four components.

$\begin{eqnarray*} \phi^{(L)}&=&\phi \\ \phi^{(R)}&=&{1\over mc}\left(i\hbar{\pa... ...al x_0}-i\hbar\vec{\sigma}\cdot\vec{\nabla}\right)\phi^{(L)} \\ \end{eqnarray*}$

Now rewriting in terms of $\bgroup\color{black}$\psi_A=\phi^{(R)}+\phi^{(L)}$\egroup$ and $\bgroup\color{black}$\psi_B=\phi^{(R)}-\phi^{(L)}$\egroup$ and ordering it as a matrix equation, we get an equation that can be written as a dot product between 4-vectors.

$\begin{eqnarray*} \pmatrix{-i\hbar{\partial\over\partial x_0} & -i\hbar\vec{\sig... ...] =\hbar\left[\gamma_\mu{\partial\over\partial x_\mu}\right] \\ \end{eqnarray*}$

Define the 4 by 4 matrices $\bgroup\color{black}$\gamma_\mu$\egroup$ are by.

$\begin{eqnarray*} \gamma_i&=&\pmatrix{0 & -i\sigma_i \cr i\sigma_i & 0 \cr} \\ \gamma_4&=&\pmatrix{ 1 & 0 \cr 0 & -1 \cr} \\ \end{eqnarray*}$

With this definition, the relativistic equation can be simplified a great deal

$\begin{eqnarray*} \left(\gamma_\mu{\partial\over\partial x_\mu}+{mc\over\hbar}\right)\psi=0 \\ \end{eqnarray*}$

where the gamma matrices are given by

$\bgroup\color{black}$\gamma_1=\pmatrix{0 & 0 & 0 & -i \cr 0 & 0 & -i & 0 \cr 0 & i & 0 & 0 \cr i & 0 & 0 & 0 \cr}$\egroup$ $\bgroup\color{black}$\gamma_2=\pmatrix{0 & 0 & 0 & -1 \cr 0 & 0 & 1 & 0 \cr 0 & 1 & 0 & 0 \cr -1 & 0 & 0 & 0 \cr}$\egroup$ $\bgroup\color{black}$\gamma_3=\pmatrix{0 & 0 & -i & 0 \cr 0 & 0 & 0 & i \cr i & 0 & 0 & 0 \cr 0 & -i & 0 & 0 \cr}$\egroup$

and they satisfy anti-commutation relations.

$\begin{displaymath}\bgroup\color{black} \{\gamma_\mu,\gamma_\nu\}=2\delta_{\mu\nu} \egroup\end{displaymath}$

In fact any set of matrices that satisfy the anti-commutation relations would yield equivalent physics results, however, we will work in the above explicit representation of the gamma matrices.

Defining $\bgroup\color{black}$\bar{\psi}=\psi^\dagger\gamma_4$\egroup$ ,

$\begin{displaymath}\bgroup\color{black} j_\mu=ic\bar{\psi}\gamma_\mu\psi \egroup\end{displaymath}$

satisfies the equation of a conserved 4-vector current

$\begin{displaymath}\bgroup\color{black} {\partial\over\partial x_\mu}j_\mu=0 \egroup\end{displaymath}$

and also transforms like a 4-vector. The fourth component of the vector shows that the probability density is $\bgroup\color{black}$\psi^\dagger\psi$\egroup$ . This indicates that the normalization of the state includes all four components of the Dirac spinors.

For non-relativistic electrons, the first two components of the Dirac spinor are large while the last two are small.

$\begin{eqnarray*} \psi=\pmatrix{\psi_A\cr \psi_B} \\ \psi_B \approx {c\over 2mc... ...p}+{e\over c}\vec{A}\right)\psi_A\approx{pc\over 2mc^2}\psi_A\\ \end{eqnarray*}$

We use this fact to write an approximate two-component equation derived from the Dirac equation in the non-relativistic limit.

$\begin{eqnarray*} \left({p^2\over 2m}-{Ze^2\over 4\pi r}-{p^4\over 8m^3c^2}+{Ze^... ...^2\over 8m^2c^2}\delta^3(\vec{r})\right)\psi &=&E^{(NR)}\psi \\ \end{eqnarray*}$

This ``Schrödinger equation'', derived from the Dirac equation, agrees well with the one we used to understand the fine structure of Hydrogen. The first two terms are the kinetic and potential energy terms for the unperturbed Hydrogen Hamiltonian. The third term is the relativistic correction to the kinetic energy. The fourth term is the correct spin-orbit interaction, including the Thomas Precession effect that we did not take the time to understand when we did the NR fine structure. The fifth term is the so called Darwin term which we said would come from the Dirac equation; and now it has.

For a free particle, each component of the Dirac spinor satisfies the Klein-Gordon equation.

$\begin{displaymath}\bgroup\color{black} \psi_{\vec{p}}=u_{\vec{p}}e^{i(\vec{p}\cdot\vec{x}-Et)/\hbar} \egroup\end{displaymath}$

This is consistent with the relativistic energy relation.

The four normalized solutions for a Dirac particle at rest are.

$\begin{eqnarray*} \psi^{(1)}=\psi_{E=+mc^2,+\hbar/2}&=&{1\over\sqrt{V}}\pmatrix{... ...\over\sqrt{V}}\pmatrix{0\cr 0\cr 0\cr 1\cr}e^{+imc^2t/\hbar} \\ \end{eqnarray*}$

The first and third have spin up while the second and fourth have spin down. The first and second are positive energy solutions while the third and fourth are ``negative energy solutions'', which we still need to understand.

The next step is to find the solutions with definite momentum. The four plane wave solutions to the Dirac equation are

$\begin{displaymath}\bgroup\color{black} \psi^{(r)}_{\vec{p}}\equiv \sqrt{mc^2\ov... ... V}u^{(r)}_{\vec{p}}e^{i(\vec{p}\cdot\vec{x}-Et)/\hbar} \egroup\end{displaymath}$

where the four spinors are given by.

$\begin{eqnarray*} u^{(1)}_{\vec{p}}=\sqrt{E+mc^2\over 2mc^2}\pmatrix{1\cr 0\cr {... ..._x-ip_y)c\over -E+mc^2}\cr {p_zc\over -E+mc^2}\cr 0\cr 1\cr} \\ \end{eqnarray*}$

$\bgroup\color{black}$E$\egroup$ is positive for solutions 1 and 2 and negative for solutions 3 and 4. The spinors are orthogonal

$\begin{displaymath}\bgroup\color{black} u^{(r)\dagger}_{\vec{p}} u^{(r')}_{\vec{p}}={\vert E\vert\over mc^2}\delta_{rr'} \egroup\end{displaymath}$

and the normalization constants have been set so that the states are properly normalized and the spinors follow the convention given above, with the normalization proportional to energy.

The solutions are not in general eigenstates of any component of spin but are eigenstates of helicity, the component of spin along the direction of the momentum.

Note that with $\bgroup\color{black}$E$\egroup$ negative, the exponential $\bgroup\color{black}$e^{i(\vec{p}\cdot\vec{x}-Et)/\hbar}$\egroup$ has the phase velocity, the group velocity and the probability flux all in the opposite direction of the momentum as we have defined it. This clearly doesn't make sense. Solutions 3 and 4 need to be understood in a way for which the non-relativistic operators have not prepared us. Let us simply relabel solutions 3 and 4 such that

$\begin{eqnarray*} \vec{p}\rightarrow -\vec{p} \\ E\rightarrow -E \\ \end{eqnarray*}$

so that all the energies are positive and the momenta point in the direction of the velocities. This means we change the signs in solutions 3 and 4 as follows.

$\begin{eqnarray*} \psi^{(1)}_{\vec{p}}&=&\sqrt{E+mc^2\over 2EV} \pmatrix{1\cr 0\... ...er E+mc^2}\cr 0\cr 1\cr}e^{-i(\vec{p}\cdot\vec{x}-Et)/\hbar} \\ \end{eqnarray*}$

We have plane waves of the form

$\begin{displaymath}\bgroup\color{black} e^{\pm ip_\mu x_\mu/\hbar} \egroup\end{displaymath}$

with the plus sign for solutions 1 and 2 and the minus sign for solutions 3 and 4. These $\bgroup\color{black}$\pm$\egroup$ sign in the exponential is not very surprising from the point of view of possible solutions to a differential equation. The problem now is that for solutions 3 and 4 the momentum and energy operators must have a minus sign added to them and the phase of the wave function at a fixed position behaves in the opposite way as a function of time than what we expect and from solutions 1 and 2. It is as if solutions 3 and 4 are moving backward in time.

If we change the charge on the electron from $\bgroup\color{black}$-e$\egroup$ to $\bgroup\color{black}$+e$\egroup$ and change the sign of the exponent, the Dirac equation remains the invariant. Thus, we can turn the negative exponent solution (going backward in time) into the conventional positive exponent solution if we change the charge to $\bgroup\color{black}$+e$\egroup$ . We can interpret solutions 3 and 4 as positrons. We will make this switch more carefully when we study the charge conjugation operator.

The Dirac equation should be invariant under Lorentz boosts and under rotations, both of which are just changes in the definition of an inertial coordinate system. Under Lorentz boosts, $\bgroup\color{black}${\partial\over\partial x_\mu}$\egroup$ transforms like a 4-vector but the $\bgroup\color{black}$\gamma_\mu$\egroup$ matrices are constant. The Dirac equation is shown to be invariant under boosts along the $\bgroup\color{black}$x_i$\egroup$ direction if we transform the Dirac spinor according to

$\begin{eqnarray*} \psi'&=&S_{boost}\psi \\ S_{boost}&=&\cosh{\chi\over 2}+i\gamma_i\gamma_4\sinh{\chi\over 2} \\ \end{eqnarray*}$

with $\bgroup\color{black}$\tanh\chi=\beta$\egroup$ .

The Dirac equation is invariant under rotations about the $\bgroup\color{black}$k$\egroup$ axis if we transform the Dirac spinor according to

$\begin{eqnarray*} \psi'&=&S_{rot}\psi \\ S_{rot}&=&\cos{\theta\over 2}+\gamma_i\gamma_j\sin{\theta\over 2} \end{eqnarray*}$

with $\bgroup\color{black}$ijk$\egroup$ is a cyclic permutation.

Another symmetry related to the choice of coordinate system is parity. Under a parity inversion operation the Dirac equation remains invariant if

$\begin{displaymath}\bgroup\color{black} \psi'=S_P\psi=\gamma_4\psi \egroup\end{displaymath}$

Since $\bgroup\color{black}$\gamma_4=\pmatrix{1 & 0 & 0 & 0 \cr 0 & 1 & 0 & 0 \cr 0 & 0 & -1 & 0 \cr 0 & 0 & 0 & -1 \cr}$\egroup$ , the third and fourth components of the spinor change sign while the first two don't. Since we could have chosen $\bgroup\color{black}$-\gamma_4$\egroup$ , all we know is that components 3 and 4 have the opposite parity of components 1 and 2.

From 4 by 4 matrices, we may derive 16 independent components of covariant objects. We define the product of all gamma matrices.

$\begin{displaymath}\bgroup\color{black} \gamma_5=\gamma_1\gamma_2\gamma_3\gamma_4 \egroup\end{displaymath}$

which obviously anticommutes with all the gamma matrices.

$\begin{displaymath}\bgroup\color{black} \{\gamma_\mu,\gamma_5\}=0 \egroup\end{displaymath}$

For rotations and boosts, $\bgroup\color{black}$\gamma_5$\egroup$ commutes with $\bgroup\color{black}$S$\egroup$ since it commutes with the pair of gamma matrices. For a parity inversion, it anticommutes with $\bgroup\color{black}$S_P=\gamma_4$\egroup$ .

The simplest set of covariants we can make from Dirac spinors and $\bgroup\color{black}$\gamma$\egroup$ matrices are tabulated below.

Classification	Covariant Form	no. of Components

Scalar	$\bgroup\color{black}$\bar{\psi}\psi$\egroup$	1
Pseudoscalar	$\bgroup\color{black}$\bar{\psi}\gamma_5\psi$\egroup$	1
Vector	$\bgroup\color{black}$\bar{\psi}\gamma_\mu\psi$\egroup$	4
Axial Vector	$\bgroup\color{black}$\bar{\psi}\gamma_5\gamma_\mu\psi$\egroup$	4
Rank 2 antisymmetric tensor	$\bgroup\color{black}$\bar{\psi}\sigma_{\mu\nu}\psi$\egroup$	6
Total		16

Products of more $\bgroup\color{black}$\gamma$\egroup$ matrices turn out to repeat the same quantities because the square of any $\bgroup\color{black}$\gamma$\egroup$ matrix is 1.

For many purposes, it is useful to write the Dirac equation in the traditional form $\bgroup\color{black}$H\psi=E\psi$\egroup$ . To do this, we must separate the space and time derivatives, making the equation less covariant looking.

$\begin{eqnarray*} \left(\gamma_\mu{\partial\over\partial x_\mu}+{mc\over\hbar}\r... ...mc^2}\gamma_4\right)\psi=-\hbar{\partial\over\partial t}\psi \\ \end{eqnarray*}$

Thus we can identify the operator below as the Hamiltonian.

$\begin{displaymath}\bgroup\color{black} H=ic\gamma_4\gamma_jp_j+mc^2\gamma_4 \egroup\end{displaymath}$

The Hamiltonian helps us identify constants of the motion. If an operator commutes with $\bgroup\color{black}$H$\egroup$ , it represents a conserved quantity.

Its easy to see the $\bgroup\color{black}$p_k$\egroup$ commutes with the Hamiltonian for a free particle so that momentum will be conserved. The components of orbital angular momentum do not commute with $\bgroup\color{black}$H$\egroup$ .

$\begin{displaymath}\bgroup\color{black} [H,L_z]=ic\gamma_4[\gamma_jp_j,xp_y-yp_x]=\hbar c\gamma_4(\gamma_1p_y-\gamma_2 p_x) \egroup\end{displaymath}$

The components of spin also do not commute with $\bgroup\color{black}$H$\egroup$ .

$\begin{displaymath}\bgroup\color{black} {[H,S_z]}=\hbar c\gamma_4[\gamma_2p_x-\gamma_1p_y] \egroup\end{displaymath}$

But, from the above, the components of total angular momentum do commute with $\bgroup\color{black}$H$\egroup$ .

$\begin{eqnarray*}[H,J_z]=[H,L_z]+[H,S_z]=\hbar c\gamma_4(\gamma_1p_y-\gamma_2 p_x)+\hbar c\gamma_4[\gamma_2p_x-\gamma_1p_y]=0 \\ \end{eqnarray*}$

The Dirac equation naturally conserves total angular momentum but not the orbital or spin parts of it.

We can also see that the helicity, or spin along the direction of motion does commute.

$\begin{displaymath}\bgroup\color{black} [H,\vec{S}\cdot\vec{p}]=[H,\vec{S}]\cdot\vec{p}=0 \egroup\end{displaymath}$

For any calculation, we need to know the interaction term with the Electromagnetic field. Based on the interaction of field with a current

$\begin{displaymath}\bgroup\color{black} H_{int}=-{1\over c}j_\mu A_\mu \egroup\end{displaymath}$

and the current we have found for the Dirac equation, the interaction Hamiltonian is.

$\begin{displaymath}\bgroup\color{black} H_{int}=ie\gamma_4\gamma_k A_k \egroup\end{displaymath}$

This is simpler than the non-relativistic case, with no $\bgroup\color{black}$A^2$\egroup$ term and only one power of $\bgroup\color{black}$e$\egroup$ .

The Dirac equation has some unexpected phenomena which we can derive. Velocity eigenvalues for electrons are always $\bgroup\color{black}$\pm c$\egroup$ along any direction. Thus the only values of velocity that we could measure are $\bgroup\color{black}$\pm c$\egroup$ .

Localized states, expanded in plane waves, contain all four components of the plane wave solutions. Mixing components 1 and 2 with components 3 and 4 gives rise to Zitterbewegung, the very rapid oscillation of an electrons velocity and position.

$\begin{eqnarray*} \langle v_k\rangle&=&\sum\limits_{\vec{p}}\sum\limits_{r=1}^4\... ...mma_k u^{(r')}_{\vec{p}} e^{2i\vert E\vert t/\hbar}\right] \\ \end{eqnarray*}$

The last sum which contains the cross terms between negative and positive energy represents extremely high frequency oscillations in the expected value of the velocity, known as Zitterbewegung. The expected value of the position has similar rapid oscillations.

It is possible to solve the Dirac equation exactly for Hydrogen in a way very similar to the non-relativistic solution. One difference is that it is clear from the beginning that the total angular momentum is a constant of the motion and is used as a basic quantum number. There is another conserved quantum number related to the component of spin along the direction of $\bgroup\color{black}$\vec{J}$\egroup$ . With these quantum numbers, the radial equation can be solved in a similar way as for the non-relativistic case yielding the energy relation.

$\begin{displaymath}\bgroup\color{black} E={mc^2\over\sqrt{1+{Z^2\alpha^2\over\le... ...rt{\left(j+{1\over 2}\right)^2-Z^2\alpha^2}\right)^2}}} \egroup\end{displaymath}$

We can identify the standard principle quantum number in this case as $\bgroup\color{black}$n=n_r+j+{1\over 2}$\egroup$ . This result gives the same answer as our non-relativistic calculation to order $\bgroup\color{black}$\alpha^4$\egroup$ but is also correct to higher order. It is an exact solution to the quantum mechanics problem posed but does not include the effects of field theory, such as the Lamb shift and the anomalous magnetic moment of the electron.

A calculation of Thomson scattering shows that even simple low energy photon scattering relies on the ``negative energy'' or positron states to get a non-zero answer. If the calculation is done with the two diagrams in which a photon is absorbed then emitted by an electron (and vice-versa) the result is zero at low energy because the interaction Hamiltonian connects the first and second plane wave states with the third and fourth at zero momentum. This is in contradiction to the classical and non-relativistic calculations as well as measurement. There are additional diagrams if we consider the possibility that the photon can create and electron positron pair which annihilates with the initial electron emitting a photon (or with the initial and final photons swapped). These two terms give the right answer. The calculation of Thomson scattering makes it clear that we cannot ignore the new ``negative energy'' or positron states.

The Dirac equation is invariant under charge conjugation, defined as changing electron states into the opposite charged positron states with the same momentum and spin (and changing the sign of external fields). To do this the Dirac spinor is transformed according to.

$\begin{displaymath}\bgroup\color{black} \psi'= \gamma_2\psi^* \egroup\end{displaymath}$

Of course a second charge conjugation operation takes the state back to the original $\bgroup\color{black}$\psi$\egroup$ . Applying this to the plane wave solutions gives

$\begin{eqnarray*} \psi^{(1)}_{\vec{p}}=\sqrt{mc^2\over\vert E\vert V} u^{(1)}_{... ...{-\vec{p}} e^{i(-\vec{p}\cdot\vec{x}-\vert E\vert t)/\hbar} \\ \end{eqnarray*}$

which defines new positron spinors $\bgroup\color{black}$v^{(1)}_{\vec{p}}$\egroup$ and $\bgroup\color{black}$v^{(2)}_{\vec{p}}$\egroup$ that are charge conjugates of $\bgroup\color{black}$u^{(1)}_{\vec{p}}$\egroup$ and $\bgroup\color{black}$u^{(2)}_{\vec{p}}$\egroup$ .

Jim Branson 2013-04-22