Photon polarization is the quantum mechanical description of the classical polarized sinusoidal plane electromagnetic wave. Individual photons are completely polarized. Their polarization state can be elliptical, circular, or linear. The description contains many of the physical concepts and much of the mathematical machinery of more involved quantum descriptions, such as the quantum mechanics of an electron in a potential well, and forms a fundamental basis for an understanding of more complicated quantum phenomena. Much of the mathematical machinery of quantum mechanics, such as state vectors, probability amplitudes, unitary operators, and Hermitian operators, emerge naturally from the classical Maxwell's equations in the description. The quantum polarization state vector for the photon, for instance, is identical with the Jones vector, usually used to describe the polarization of a classical wave. Unitary operators emerge from the classical requirement of the conservation of energy of a classical wave propagating through media that alter the polarization state of the wave. Hermitian operators then follow for infinitesimal transformations of a classical polarization state. Many of the implications of the mathematical machinery are easily verified experimentally. In fact, many of the experiments can be performed with two pairs (or one broken pair) of polaroid sunglasses.
The connection with quantum mechanics is made through the identification of a minimum packet size, called a photon, for energy in the electromagnetic field. The identification is based on the experiments of Planck and the interpretation of those experiments by Einstein. The correspondence principle then allows the identification of momentum and angular momentum (called spin), as well as energy, with the photon.
The polarization of a classical sinusoidal plane wave traveling in the z direction can be characterized by the Jones vector
where the angle θ describes the relation between the amplitudes of the electric fields in the x and y directions.
and
This implies
The angles αx and αy characterize the phase relationship between the wave polarized in x and the wave polarized in y.
are known, then the state of the wave is completely characterized.
Here ω is the angular frequency of the wave, and c is the speed of light.
The notation for the Jones vector is the bra-ket notation of Dirac, which is normally used in a quantum context. The quantum notation is used here in anticipation of the interpretation of the Jones vector as a quantum state vector.
The Jones vector has a dual given by
The Jones vector is normalized. The inner product of the vector with itself is
This represents a wave with phase α polarized at an angle θ with respect to the x axis. In that case the Jones vector can be written
The state vectors for linear polarization in x or y are special cases of this state vector.
If unit vectors are defined such that
and
then the polarization state can written in the "x-y basis" as
If αy is rotated by π / 2 radians with respect to αx and the x amplitude equals the y amplitude the wave is circularly polarized. The Jones vector is
where the plus sign indicates right circular polarization and the minus sign indicates left circular polarization. In the case of circular polarization, the electric field vector of constant magnitude rotates in the x-y plane.
If unit vectors are defined such that
and
then a circular polarization state can written in the "R-L basis" as
where
and
Any arbitrary state can be written in the R-L basis
where
The general case in which the electric field rotates in the x-y plane and has variable magnitude is called elliptical polarization. The state vector is given by
The energy per unit volume in classical electromagnetic fields is (cgs units)
For a plane wave, this becomes
where the energy has been averaged over a wavelength of the wave.
The fraction of energy in the x component of the plane wave is
with a similar expression for the y component resulting in fy = sin2θ.
The fraction in both components is
The momentum density is given by the Poynting vector
For a sinusoidal plane wave traveling in the z direction, the momentum is in the z direction and is related to the energy density:
The momentum density has been averaged over a wavelength.
The angular momentum density is
For a sinusoidal plane wave propagating along z axis the angular momentum is in the z direction and is given by
where again the density is averaged over a wavelength.
A polaroid filter transmits one component of a plane wave and absorbs the perpendicular component. In that case, if the filter is polarized in the x direction, the fraction of energy passing through the filter is
An ideal birefringent crystal transforms the polarization state of an electromagnetic wave without loss of wave energy. Birefringent crystals therefore provide an ideal test bed for examining the conservative transformation of polarization states. Even though this treatment is still purely classical, standard quantum tools such as unitary and Hermitian operators that evolve the state in time naturally emerge.
A birefringent crystal is a material that has an optic axis with the property that the light has a different index of refraction for light polarized parallel to the axis than it has for light polarized perpendicular to the axis. Light polarized parallel to the axis are called "extraordinary rays." Light polarized perpendicular to the axis are called "ordinary rays." If a linearly polarized wave impinges on the crystal, the extraordinary component of the wave will emerge from the crystal with a different phase than the ordinary component. In mathematical language, if the incident wave is linearly polarized at an angle θ with respect to the optic axis, the incident state vector can be written
and the state vector for the emerging wave can be written
While the initial state was linearly polarized, the final state is elliptically polarized. The birefringent crystal alters the character of the polarization.
The initial polarization state is transformed into the final state with the operator U. The dual of the final state is given by
The fraction of energy that emerges from the crystal is
In this ideal case, all the energy impinging on the crystal emerges from the crystal. An operator U with the property that
where I is the identity operator and U is called a unitary operator. The unitary property is necessary to ensure energy conservation in state transformations.
If the crystal is very thin, the final state will be only slightly different from the initial state. The unitary operator will be close to the identity operator. We can define the operator H by
and the adjoint by
Energy conservation then requires
This requires that
Operators like this that are equal to their adjoints are called Hermitian or self-adjoint.
The infinitesimal transition of the polarization state is
Thus, energy conservation requires that infinitesimal transformations of a polarization state occur through the action of a Hermitian operator.
The treatment to this point has been classical. It is a testament, however, to the generality of Maxwell's equations for electrodynamics that the treatment can be made quantum mechanical with only a reinterpretation of classical quantities. The reinterpretation is based on the experiments of Max Planck and the interpretation of those experiments by Albert Einstein.
The important conclusion from these early experiments is that electromagnetic radiation is composed of irreducible packets of energy, known as photons. The energy of each packet is related to the angular frequency of the wave by the relation
and the energy density is
The energy of a photon can be related to classical fields through the correspondence principle which states that for a large number of photons, the quantum and classical treatments must agree. Thus, for very large N, the quantum energy density must be the same as the classical energy density
The number of photons in the box is then
The correspondence principle also determines the momentum and angular momentum of the photon. For momentum
which implies that the momentum of a photon is
Similarly for the angular momentum
which implies that the angular momentum of the photon is
The expected value of a spin measurement on a photon is then
An operator S has been associated with an observable quantity, the angular momentum. The eigenvalues of the operator are the allowed observable values. This has been demonstrated for angular momentum, but it is in general true for any observable quantity.
We can write the circularly polarized states as
where s=1 for
and s= -1 for
An arbitrary state can be written
where
When the state is written in spin notation, the spin operator can be written
The eigenvectors of the differential spin operator are
To see this note
The angular momentum operator is
There are two ways in which probability can be applied to the behavior of photons; probability can be used to calculate the probable number of photons in a particular state, or probability can be used to calculate the likelihood of a single photon to be in a particular state. The former interpretation violates energy conservation. The latter interpretation is the viable, if nonintuitive, option. Dirac explains this in the context of the double-slit experiment:
Some time before the discovery of quantum mechanics people realized that the connection between light waves and photons must be of a statistical character. What they did not clearly realize, however, was that the wave function gives information about the probability of one photon being in a particular place and not the probable number of photons in that place. The importance of the distinction can be made clear in the following way. Suppose we have a beam of light consisting of a large number of photons split up into two components of equal intensity. On the assumption that the beam is connected with the probable number of photons in it, we should have half the total number going into each component. If the two components are now made to interfere, we should require a photon in one component to be able to interfere with one in the other. Sometimes these two photons would have to annihilate one another and other times they would have to produce four photons. This would contradict the conservation of energy. The new theory, which connects the wave function with probabilities for one photon gets over the difficulty by making each photon go partly into each of the two components. Each photon then interferes only with itself. Interference between two different photons never occurs.
—Paul Dirac, The Principles of Quantum Mechanics, Fourth Edition, Chapter 1
The probability for a photon to be in a particular polarization state depends on the fields as calculated by the classical Maxwell's equations. The polarization state of the photon is proportional to the field. The probability itself is quadratic in the fields and consequently is also quadratic in the quantum state of polarization. In quantum mechanics, therefore, the state or probability amplitude contains the basic probability information. In general, the rules for combining probability amplitudes look very much like the classical rules for composition of probabilities: [The following quote is from Baym, Chapter 1]
For any legal operators the following inequality, a consequence of the Cauchy-Schwarz inequality, is true.
If B A ψ and A B ψ are defined then
where
is the operator mean of observable X in the system state ψ and
Here
is called the commutator of A and B.
This is a purely mathematical result. No reference has been made to any physical quantity or principle. It simply states that the uncertainty of an operator acting on a state times the uncertainty of another operator acting on the state is not necessarily zero.
The connection to physics can be made if we identify the operators with physical operators such as the angular momentum and the polarization angle. We have then
which simply states that angular momentum and the polarization angle cannot be measured simultaneously with infinite accuracy.
Much of the mathematical apparatus of quantum mechanics appears in the classical description of a polarized sinusoidal electromagnetic wave. The Jones vector for a classical wave, for instance, is identical with the quantum polarization state vector for a photon. The right and left circular components of the Jones vector can be interpreted as probability amplitudes of spin states of the photon. Energy conservation requires that the states be transformed with a unitary operation. This implies that infinitesimal transformations are transformed with a Hermitian operator. These conclusions are a natural consequence of the structure of Maxwell's equations for classical waves.
Quantum mechanics enters the picture when observed quantities are measured and found to be discrete rather than continuous. The allowed observable values are determined by the eigenvalues of the operators associated with the observable. In the case angular momentum, for instance, the allowed observable values are the eigenvalues of the spin operator.
These concepts have emerged naturally from Maxwell's equations and Planck's experiments. They have been found to be true for many other physical systems. In fact, the typical program is to assume the concepts of this section and then to infer the unknown dynamics of a physical system. This was done, for instance, with the dynamics of electrons. In that case, working back from the principles in this section, the quantum dynamics of particles were inferred, leading to Schrödinger's equation, a departure from Newtonian mechanics. The solution of this equation for atoms led to the explanation of the Balmer series for atomic spectra and consequently formed a basis for all of atomic physics and chemistry.
This is not the only occasion in which Maxwell's equations have forced a restructuring of Newtonian mechanics. Maxwell's equations are relativistically consistent. Special relativity resulted from attempts to make classical mechanics consistent with Maxwell's equations (see, for example, Moving magnet and conductor problem).