What is a wave function and what are the requirements for it to be well-behaved, i.e. for it to properly represent physical reality?

2 Answers
Aug 3, 2015

The wavefunction is a complex valued function of which the amplitude (absolute value) gives the probability distribution. However it does not behave in the same way as an ordinary wave.

Explanation:

In quantum mechanics, we talk about the state of a system. One of the simplest examples is a particle that can be in an up or down spin, for instance an electron. When we measure the spin of a system, we either measure it to be up or down. A state by which we are certain of the outcome of the measurement, we call an eigenstate (one up state uarr and one down state darr).

There are also states where we are uncertain of the outcome of the measurement before we measure it. These states we call a superposition and we can write them down as a*uarr+b*darr. Here we have |a|^2 the probability of measuring uarr, and |b|^2 the probability of measuring darr. This means of course that |a|^2+|b|^2=1. We allow a,b to be complex numbers, the reason for this is not immediately clear from this example, but in the context of the wavefunction it will be more clear. The bottom line is that there are more states than one giving the same probabilities for measuring the spins.

Now we could try to assign a function to this spin state. Since there are only two outcomes of the measurement of the spin, we have a function that has only two possible inputs. If we call the function psi (this is a very conventional symbol used for a wavefuntion), we set psi(uarr)=a and psi(darr)=b.

Now we turn to the wavefunction. One aspect of a particle is of course its location. Just like in the case of spin, we can measure distinct values for the location, and we can have states in which the outcome of the measurement is not fixed beforehand. Since we have an uncountable infinity amount of locations where a particle can be, writing down this state as a*"here"+b*"there" won't do. However, the idea of the function that we have used above does. So for any location x, we have a complex value psi(x). The probability density function of the particle is now given by |psi(x)|^2.

In all fairness, historically the idea of the wavefunction is older than that of the spin, but I think understanding the idea of spin to a certain degree helps in the understanding of the wavefunction.

Now first of all, why is the wavefunction complex valued? The first reason can be found in the idea of interference. The wavefunction of a particle can interfere with itself. This interference has to do with adding up wavefunctions, if the wavefunctions give the same absolute value at a certain point, then the probability of measuring a particle around that point is similar. However the function values can be different, if they are the same, adding them up will make the amplitude, or probability density 4 (|2|^2) times bigger (constructive interference), and if they differ by a sign they negate each other (destructive interference). However the can also differ by for instance a factor i, meaning that the probability density becomes 2 times bigger at that point. We know that all of these interferences can occur. So this points towards a complex valued wavefunction as described earlier.

The second reason can be found in the Schrödinger equation. Initially it was thought that these wavefunctions behaved just like classical waves. However, when Schrödinger tried to describe the behaviour of these waves, or at least their evolution through time, he found that the equation governing classical waves was not adequate. In order for it to work, he had to introduce a complex number into the equation, leading to the conclusion that the function itself has to be complex as well, and the order of the derivatives appearing in the equation differs from the classical wave equation.

This difference in the equations also answers your second question. Since the evolution of the wavefunction differs so much from that of classical waves, we cannot use the same methods we use in classical wave physics. There are of course geometrical arguments you can use, but it will not be enough to describe all of the phenomena in quantum physics. Besides, even though the wavefunction gives a lot of information about the state of a particle, it tells you nothing about its spin, since the observables spin and location have little to do with eachother.

Perhaps I'm interpreting what you mean by a geometric nature wrongly. Could you perhaps give an example of what you mean. Perhaps then I could help you further.

Feb 4, 2016

The wave function represents the state of a quantum mechanical system such as an atom or a molecule.

It can be represented as either psi, the time-independent wave function, or Psi, the time-dependent wave function.

Because the wave function evidently represents a system that behaves like a wave (it's no coincidence that it's called the wave function!), we would normally expect an unrestricted wave function to have no boundaries. Consider the fact that sinx and cosx, two functions that are clearly waves, have domains of (-oo,oo).

EXAMPLE: THE WAVE FUNCTION FOR ORBITALS

However, let's take orbitals for example. There must be a set of boundary conditions for an orbital, because obviously orbitals aren't infinitely large.

A wave function can depict the linear combination of atomic orbitals to form molecular orbitals:

color(blue)(psi_("MO")) = sum_(i) c_iphi_i^"AO"

= color(blue)(c_1phi_(1s) + c_2phi_(2s) + c_3phi_(2px) + c_4phi_(2py) + c_5phi_(2pz) + . . . )

where c_i is the expansion coefficient indicating the contribution of each atomic orbital to the particular molecular orbital in question, and phi_i^"AO" is the experimental/trial wave function for each atomic orbital.

Since a wave function must be able to represent an orbital, it must have a positive radius (r > 0) and the wave function must be single-valued, closed, continuous, orthogonal to all related wave functions, and normalizable.

In other words, it must pass the vertical line test, have a finite area under the curve, have no jumps/discontinuities/asymptotes/breaks, and satisfy the following two equations:

int_"allspace" psi_A^"*"psi_Bd tau = 0
(the integral of a wave function and its complex conjugate is 0 if the wave functions are different)

int_"allspace" psi_A^"*"psi_Ad tau = 1
(the integral of a wave function and its complex conjugate is normalized such that it equals 1 if the wave functions are the same besides the sign of pmi)

One example equation for the wave function in spherical coordinates for the hydrogen atom is:

color(blue)(psi_(2pz)(r,theta,phi)) = R_(21)(r)Y_(1)^(0)(theta,phi)

= color(blue)(1/(sqrt(32pi))(Z/(a_0))^("3/2")((Zr)/(a_0))e^(-Zr//2a_0)costheta)

To think, I actually spent time to normalize this. I even took the time to check for orthogonality with the other two 2p wave functions. :P


Just in case, here is an appendix of what I have linked above in Scratchpads.

" "" "" "APPENDIX

Normalization of the 2p_z wave function

The 2p_z atomic orbital wavefunction is:

psi_(2pz)

= R_(nl)(r)Y_(l)^(m)(theta,phi) = R_(21)(r)Y_(1)^(0)(theta,phi)

= 1/sqrt(32pi) (Z/(a_0))^(3/2) (Zr)/(a_0)e^(-(Zr)/(2a_0))costheta
(McQuarrie)

Is the 2p_z wavefunction really normalized? LET'S FIND OUT!

\mathbf(int_(0)^(oo) R_(nl)^"*"(r)R_(nl)(r)r^2dr int_(0)^(pi) Y_(l)^(m)(theta,phi) sintheta int_(0)^(2pi) dphi stackrel(?)(=) 1)

[1/sqrt(32pi) (Z/(a_0))^(5/2)]^2 int_(0)^(oo) e^(-(Zr)/(a_0)) r^4dr int_(0)^(pi) sinthetacos^2thetad theta int_(0)^(2pi) dphi stackrel(?)(=) 1

color(green)(1/(32pi) (Z/a_0)^5 int_(0)^(oo) e^(-(Zr)/(a_0)) r^4dr stackrel(= "2/3")(overbrace(int_(0)^(pi) sinthetacos^2thetad theta)) stackrel(= 2pi)(overbrace(int_(0)^(2pi) dphi)) stackrel(?)(=) 1)

Now, examining only the radial part, which is the crazy part... let the quadruple Integration by Parts commence!

EVALUATION OF THE RADIAL COMPONENT OF THE WAVE FUNCTION

Part 1

int_(0)^(oo) e^(-(Zr)/(a_0)) r^4dr

Let:
u = r^4
dv = e^(-(Zr)/(a_0))dr
v = -(a_0)/Ze^(-(Zr)/(a_0))
du = 4r^3dr

= -(a_0)/Ze^(-(Zr)/(a_0))r^4 - int -(a_0)/Ze^(-(Zr)/(a_0))4r^3dr

= -(a_0)/Z {e^(-(Zr)/(a_0))r^4 - 4int e^(-(Zr)/(a_0))r^3dr}

Part 2

Let:
u = r^3
dv = e^(-(Zr)/(a_0))dr
v = -(a_0)/Ze^(-(Zr)/(a_0))
du = 3r^2dr

= -(a_0)/Z {e^(-(Zr)/(a_0))r^4 - 4[-(a_0)/Ze^(-(Zr)/(a_0))r^3 - 3int -(a_0)/Ze^(-(Zr)/(a_0))r^2dr ]}

= -(a_0)/Z {e^(-(Zr)/(a_0))r^4 + (4a_0)/Z[e^(-(Zr)/(a_0))r^3 - 3int e^(-(Zr)/(a_0))r^2dr ]}

Part 3

Let:
u = r^2
dv = e^(-(Zr)/(a_0))dr
v = -(a_0)/Ze^(-(Zr)/(a_0))
du = 2rdr

= -(a_0)/Z {e^(-(Zr)/(a_0))r^4 + (4a_0)/Z[e^(-(Zr)/(a_0))r^3 - 3[-(a_0)/Ze^(-(Zr)/(a_0))r^2 - 2int -(a_0)/Ze^(-(Zr)/(a_0))rdr]]}

= -(a_0)/Z {e^(-(Zr)/(a_0))r^4 + (4a_0)/Z[e^(-(Zr)/(a_0))r^3 + (3a_0)/Z[e^(-(Zr)/(a_0))r^2 - 2int e^(-(Zr)/(a_0))rdr]]}

Part 4

Let:
u = r
dv = e^(-(Zr)/(a_0))dr
v = -(a_0)/Ze^(-(Zr)/(a_0))
du = dr

= -(a_0)/Z {e^(-(Zr)/(a_0))r^4 + (4a_0)/Z[e^(-(Zr)/(a_0))r^3 + (3a_0)/Z[e^(-(Zr)/(a_0))r^2 - 2{-(a_0)/Ze^(-(Zr)/(a_0))r - int -(a_0)/Ze^(-(Zr)/(a_0))dr}]]}

= -(a_0)/Z {e^(-(Zr)/(a_0))r^4 + (4a_0)/Z[e^(-(Zr)/(a_0))r^3 + (3a_0)/Z[e^(-(Zr)/(a_0))r^2 + (2a_0)/Z{e^(-(Zr)/(a_0))r - int e^(-(Zr)/(a_0))dr}]]}

EXPANSION/SIMPLIFICATION

= -(a_0)/Ze^(-(Zr)/(a_0))r^4 - 4((a_0)/Z)^2[e^(-(Zr)/(a_0))r^3 + (3a_0)/Z[e^(-(Zr)/(a_0))r^2 + (2a_0)/Z{e^(-(Zr)/(a_0))r + (a_0)/Ze^(-(Zr)/(a_0))}]]

= -(a_0)/Ze^(-(Zr)/(a_0))r^4 - ((a_0)/Z)^2 e^(-(Zr)/(a_0))4r^3 - 12((a_0)/Z)^3[e^(-(Zr)/(a_0))r^2 - (2a_0)/Z{e^(-(Zr)/(a_0))r + (a_0)/Ze^(-(Zr)/(a_0))}]

= -(a_0)/Ze^(-(Zr)/(a_0))r^4 - ((a_0)/Z)^2 e^(-(Zr)/(a_0))4r^3 - 12((a_0)/Z)^3e^(-(Zr)/(a_0))r^2 - 24((a_0)/Z)^4{e^(-(Zr)/(a_0))r + (a_0)/Ze^(-(Zr)/(a_0))}

= -(a_0)/Ze^(-(Zr)/(a_0))r^4 - ((a_0)/Z)^2 e^(-(Zr)/(a_0))4r^3 - ((a_0)/Z)^3e^(-(Zr)/(a_0))12r^2 - ((a_0)/Z)^4e^(-(Zr)/(a_0))24r - 24((a_0)/Z)^5 e^(-(Zr)/(a_0))

EVALUATION-READY FORM

= |[-e^(-(Zr)/(a_0))[(a_0)/Z r^4 + 4((a_0)/Z)^2 r^3 + 12((a_0)/Z)^3 r^2 + 24((a_0)/Z)^4 r + 24((a_0)/Z)^5]]|_(0)^(oo)

First half cancels out to be 0:

= cancel({-e^(-(Zoo)/(a_0))[(a_0)/Z oo^4 + 4((a_0)/Z)^2 oo^3 + 12((a_0)/Z)^3 oo^2 + 24((a_0)/Z)^4 oo + 24((a_0)/Z)^5]})^(0) - {-e^(-(Z(0))/(a_0))[(a_0)/Z (0)^4 + 4((a_0)/Z)^2 (0)^3 + 12((a_0)/Z)^3 (0)^2 + 24((a_0)/Z)^4 (0) + 24((a_0)/Z)^5]}

Second half simplifies down to be 1*(0+0+0+0+24((a_0)/(Z))^5):

= cancel(e^(-(Z(0))/(a_0)))^(1)[cancel((a_0)/Z (0)^4)^(0) + cancel(4((a_0)/Z)^2 (0)^3)^(0) + cancel(12((a_0)/Z)^3 (0)^2)^(0) + cancel(24((a_0)/Z)^4 (0))^(0) + 24((a_0)/Z)^5]

= 24(a_0/Z)^5

Now, let us re-examine the wave function as a whole...

psi_(2pz)

= 1/(32pi) (Z/a_0)^5 (24(a_0/Z)^5) (2/3) (2pi) stackrel(?)(=) 1

= 1/(cancel(32)cancel(pi)) cancel((Z/a_0)^5) (cancel(16)cancel((a_0/Z)^5))(cancel(2)cancel(pi)) stackrel(?)(=) 1

color(blue)(1 = 1)

YES! ONE DOES EQUAL ONE! I mean...

The wave function is indeed normalized! :D

Proving mutual orthogonality for the 2p wave functions

Let us choose the following wavefunctions:

psi_(2px) = 1/(sqrt(32pi))(Z/(a_0))^"3/2" (Zr)/(a_0)e^(-"Zr/"2a_0)sinthetacosphi

psi_(2py) = 1/(sqrt(32pi))(Z/(a_0))^"3/2" (Zr)/(a_0)e^(-"Zr/"2a_0)sinthetasinphi

psi_(2pz) = 1/(sqrt(32pi))(Z/(a_0))^"3/2" (Zr)/(a_0)e^(-"Zr/"2a_0)costheta

To show they are orthogonal, we need to show at least one of them:

int_("all space") psi_(2px)^"*" psi_(2pz)d tau = 0

And from induction we can imply the rest since the radial components are identical. In other words:

\mathbf(int_(0)^(oo) R_(nl,2px)^"*"(r)R_(nl,2pz)(r)r^2dr int_(0)^(pi) Y_(l)^(m)(theta)sintheta int_(0)^(2pi) Y_(l)^(m)(phi)dphi stackrel(?)(=) 0)

color(green)(1/(32pi)(Z/(a_0))^5 int_(0)^(oo) e^(-"Zr/"a_0) r^4dr int_(0)^(pi) sin^2thetacosthetad theta int_(0)^(2pi) cosphidphi stackrel(?)(=) 0)

The radial portion turns out to be 24 ((a_0)/Z)^5. So, let us evaluate the angular portions.

The theta portion:

color(green)(int_(0)^(pi) sin^2thetacosthetad theta)

Let:
u = sintheta
du = costhetad theta

= int_(0)^(pi) u^2du

= 1/3*|[sin^3theta]|_(0)^(pi)

= 1/3*[sin^3(pi) - sin^3(0)]

= 1/3*[0 - 0] = color(green)(0)

And now the phi portion:

color(green)(int_(0)^(2pi) cosphidphi)

= |[sinphi]|_(0)^(2pi)

= sin(2pi) - sin(0)

Let:
u = sintheta
du = costhetad theta

= int_(0)^(pi) u^2du

= 0 - 0 = color(green)(0)

Therefore, we have overall:

color(blue)(1/(32pi)(Z/(a_0))^5 int_(0)^(oo) e^(-"Zr/"a_0) r^4dr int_(0)^(pi) sin^2thetacosthetad theta int_(0)^(2pi) cosphidphi)

= cancel(1/(32pi)(Z/(a_0))^5 (24)((a_0)/Z)^5 (0)(0))^(0)

= color(blue)(0)

Since

int_("all space") psi_(2px)^"*" psi_(2pz)d tau = 0

the 2p_z and 2p_x atomic orbitals are orthogonal.

Really, the main difference with using the 2p_y equation is that you instead get:

color(green)("Constants" int_(0)^(oo) "Same stuff"dr int_(0)^(pi) sin^3thetad theta int_(0)^(2pi) sinphicosphidphi stackrel(?)(=) 0)

And so:

color(blue)(int_(0)^(2pi) sinphicosphidphi)

= 1/2|[sin^2phi]|_(0)^(2pi)

= 1/2[sin^2(2pi) - sin^2(0)] = color(blue)(0)

From multiplying 0 by the other integrals, thus the whole integral disappears and:

int_("all space") psi_(2px)^"*" psi_(2py)d tau = 0

thus, the 2p_x and 2p_y atomic orbitals are orthogonal.

Finally, for the 2p_y vs. the 2p_z:

color(green)("Constants" int_(0)^(oo) "Same stuff"dr int_(0)^(pi) sin^2thetacosthetad theta int_(0)^(2pi) sinphidphi stackrel(?)(=) 0)

We know the theta integral from before:

color(blue)(int_(0)^(pi) sin^2thetacosthetad theta)

= 1/3*|[sin^3theta]|_(0)^(pi)

= 1/3*[sin^3(pi) - sin^3(0)]

= 1/3*[0 - 0] = color(blue)(0)

And so the whole integral disappears again, and indeed the 2p_y and 2p_z orbitals are orthogonal as well!