Thouless Theorem26 Nov 2013
Kind of a dry post today (hey, I put in a cat picture at least)…but I had to share a theorem – the Thouless theorem – that is so important in computational and quantum chemistry, and is really quite unknown. It ties together Hartree-Fock, Brillouin’s theorem, the successes of coupled cluster, and even some MCSCF ideas. It basically explains why single excitation operators are so important and useful in quantum chemistry, and what they are actually doing when we run our calculations and devise new methods. For someone working closely in electronic structure, it is hard to understate the importance of this theorem. Also the proof is kind of hard to a) find and, b) understand. So I tried to make it simpler and more accessible!
The Thouless theorem ( Thouless, 1960) says that the effect of the operator is to transform any single determinant into any other (non-orthogonal!) single determinant. In other words, we can write
In other words, we can generate any single determinant in terms of another determinant with virtual orbitals (or single excitations) mixed in with it. This is sometimes called orbital rotation’’. If we minimize the energy of the Slater determinant with respect to the rotation parameters (or amplitudes), we get the Hartree-Fock wavefunction out. This is the Brillouins condition, and says that the Hartree Fock reference will not interact (directly) with singly excited determinants. And of course it shouldn’t! We just variationally optimized with respect to those parameters! The Thouless theorem also explains why there is no such thing as a coupled cluster singles equation (it gives Hartree-Fock back). Now, we do include single excitations in CCSD, but they work themselves in because of correlation generated by higher excitations (Ds or Ts, etc.) The Thouless theorem explains why CCSD and higher methods are so insensitive to reference choice; you generate the optimal reference through the operator. I’m sure there are plenty more uses (you see them a lot in MCSCF actually) that exist in the quantum chemistry literature.
Let’s say we have a single N-electron Slater determinant, written in second quantization as
where is the number of electrons in our aptly named N-electron wavefunction, and is a vacuum. Let’s say we know what this wavefunction is, which is to say it will be our reference determinant. This might be the result of a Hartree-Fock calculation for example. Now say we want to make a new single determinant in terms of our reference determinant (I’ll use determinant and wavefunction interchangably). First we write our wavefunction:
The difference between and is simply that they create electrons in different (though not orthonormal) orbitals in our Slater determinant. We will assume that the sets of creation (and corresponding annihilation) operators form complete sets and are non-orthogonal with respect to each other. If this is the case we can write one set in terms of the other like so:
This is just a change of basis operation. See, for example, Szabo and Ostlund p13. This suggests that we can write
Following the presentation of Thouless, we then split the sum over occupied and virtual spaces (with respect to the reference):
Alright. Since we assumed and were not orthogonal, we can choose an intermediate normalization, setting
Which further implies (again, check out Szabo and Ostlund p13) that the transformation matrix composed of is unitary when and run from 1 to (remember is actually a rectangular matrix according to our change of basis definition in 5. This means the square subsection of is invertible, and we represent its inverse as . Here’s another way to look at it, as well as derive some of the properties of and :
and that odd looking matrix is the result of the parts of that extend past the number of electrons :
Where and for the above equation. Now we put it all together. Because is unitary, we can write linearly independent combinations of the creation operators, each of which we’ll call . (Yes, we have now introduced three sets of creation operators, and yes this is essentially what Thouless does, but one of the sets is intermediate to our results and then we will go back to relating two sets. Trust me.)
Now the fun happens and using equations 5,10, 11, and 12 we can rewrite the above in terms of just our reference creation and annihilation operators
In other words, any new orbital may be generated by mixing in contributions from virtual orbitals. We apply this to generating any new single determinant below.
We got to Eq. (22) from Eq. (21) by realizing that , and . The fact that all terms in which the same creation operator occurs more than once is also the reason we can write the infinite product as an exponential.
HT to Ismail Aydin and Matthias Degroote for pointing out some typos that have now been fixed!