Lax pair

It happens that sometimes integrable systems could generally be viewed as those equations that can be obtained from an overdetermined system of linear differential equations, with the original PDE playing the role of the compatibility condition. The overdetermined linear system is the Lax pair associated to the equation
This is by no means a general construction: in practice, there are no general methods of computing a Lax pair for a given system of PDEs. However, when a Lax pair for a system is known, it is enormously useful. One can use Lax pairs to derive infinitely many conserved quantities, describe large classes of solutions, and play a crucial role in the celebrated Inverse Scattering Method.

Note: it seems to me that to say that a PDE system admits a Lax pair we only need that the PDE system implies the Lax equation [L,M]=0. I guess this is enough to take advantage of the Lax pair...

Finite degree of freedom

Consider a nonlinear system of ODEs

x˙i=Fi,

for example one arising in Hamiltonian mechanics,

(1)p˙=Hq,q˙=Hp.

A Lax pair for this system is a pair of matrices L and M that satisfy the Lax equation:

(2)dLdt=[L,M]

where [L,M]=LMML is the commutator of M and L, and dLdt is the time derivative of L. The entries of L and M are typically expressed in terms of the variables xi, and we require that equation (1) is satisfied if and only if (2) is satisfied.

The exact form of L and M depends on the specific system under consideration, and is not unique. For instance, in the case of the simple harmonic oscillator with Hamiltonian H=p22m+12mω2q2, we can choose:

L=(p/mωqωqp/m),M=(0ω/2ω/20)

Then, the Lax equation dLdt=[L,M] reproduces the equations of motion for the harmonic oscillator:

p˙=mω2qq˙=pm

The key point is that the eigenvalues of the matrix L are conserved quantities, i.e., constants of motion (see isospectral property below), and the eigenvectors provide a "good" transformation of variables (????).

Isospectral property

The eigenvalues of L do not change with time, a property known as isospectrality.
Due to the cyclic property of the trace, we can show that the trace of all the powers of L are conserved quantities.
In effect, the trace of L is conserved, since

ddttr(L)=tr([L,M])=0.

To show ddttr(L2)=0, observe

ddttr(L2)=tr(ddtL2)=tr(LdLdt+dLdtL),

where we used the product rule. Substituting the Lax equation for dLdt gives:

ddttr(L2)=tr(L(LMML)+(LMML)L)==tr(L2MLML+LMLMLL)=tr(L2MMLL)=0.

The same is true for ddttr(L3)=0 and son on.

To see that the eigenvalues of L are conserved, only observe that the eigenvalues are determined by the trace of the powers of the matrix.

In the previous example of the harmonic oscillator we can find the eigenvalues λ and eigenvectors ψ of the L matrix. We solve the characteristic equation:

det(LλI)=0

where I is the identity matrix. This gives

det(p/mλωqωqp/mλ)=0

which simplifies to

(p2/m2+λ2)ω2q2=0

We can solve this equation for λ to get

λ=±(p2/m2+ω2q2)

These are the eigenvalues of the L matrix, which correspond to the given Hamiltonian.

Spectral parameter

In many cases, Lax pairs depend on an auxiliary variable, the so-called spectral parameter, which is not directly related to the dynamics of the model. The Lax pair L(u), M(u) then obeys the Lax equation at all values of uC:

ddtL(u)=[M(u),L(u)] for all uC.

Such a Lax pair, called nonisospectral, must be constructed, as always, such that this equation is equivalent to the complete set of equations of motion. As a functional equation, it is, in principle, much more constraining than the Lax equation without spectral parameter. This feature is useful for mechanical systems with infinitely many degrees of freedom whose equations of motion could thus be formulated by a finite-dimensional Lax pair.

Even for a finite-dimensional system, Lax pairs with spectral parameter often exist. While the spectral parameter is not essential to encode finitely many equations of motion, it is nevertheless useful in several respects.

Infinite degrees of freedom (fields)

Idea: see also this paragraph.

In this case we have PDEs, for example evolution equations like KdV

ut6uux+uxxx=0,

the transport equation, etc, we can apply similar ideas, but now L,M are operators on a Hilbert space instead of matrices.
For the KdV example we have:

L=x2+u(x,t)M=4x3+6ux+3ux

In this cases, Lax equation Lt=[L,M] can be replaced by [L,M]=0 if we replace M by M+t (simple computations).

Isospectral property

In this case, it can then be shown that the eigenvalues and more generally the spectrum of L are independent of t in the following way.

The matrices L(t) are all similar by virtue of the existence of matrices U(t,s) such that

L(t)=U(t,s)L(s)U(t,s)1

Observe that in this case, the eigenvalues of L(t) are the same as those of L(0).
The existence of U(t,s) is due to the existence of solution for the Cauchy problem

ddtU(t,s)=M(t)U(t,s),U(s,s)=I,

where I denotes the identity matrix. (I guess there is a kind of result for general operators!!??)

Indeed, observe that in that case

ddt(U(t,s)1L(t)U(t,s))==U(t,s)1dU(t,s)dtU(t,s)1L(t)U(t,s)+U(t,s)1dL(t)dtU(t,s)+U(t,s)1L(t)dU(t,s)dt=U(t,s)1M(t)L(t)U(t,s)+U(t,s)1dL(t)dtU(t,s)U(t,s)1L(t)M(t)U(t,s)=U(t,s)1[L(t),M(t)]U(t,s)+U(t,s)1dL(t)dtU(t,s)=U(t,s)1(dL(t)dt[L(t),M(t)])U(t,s)=0.

This means that the operator U(t,s)1L(t)U(t,s) does not change with time, and therefore, it must be equal to a constant matrix. Because U(s,s)=I, we have

U(s,s)1L(s)U(s,s)=L(s).

and then L(t)=U(t,s)L(s)U(t,s)1.

Lax equation as a compatibility condition

The Lax equation,

(1)dLdt=[L,M]

is equivalent to the two linear compatibility conditions,

(2)Lψ=λψ

and

(3)ψt=Mψ

Here, ψ=ψ(x,t) is a nonzero simultaneous solution of both of these two equations for some λC.

Let's see that (2) and (3) implies (1). First, by using the operator analogue of the product rule and then eliminating ψt from equation (3) gives

(4)t(Lψ)=dLdtψLMψ

Second, by using Lψ=λψ from equation (2) and the fact that λC is a constant, we get

(5)t(Lψ)=t(λψ)=λMψ=MLψ

Comparing equations (4) and (5) gives

(6)dLdtψLMψ=MLψ

so

(7)dLdtψ[L,M]ψ=(dLdt[L,M])ψ=0

From here, dividing by ψ (assuming ψ0 and that dLdt[L,M] is a multiplication operator, since it is supposed to be representing a given PDE like the KdV) gives back the Lax equation (1).

Now, to see that (1) implies (2) and (3), observe that we already know that Lax equation implies conserved eigenvalues. Differentiating the eigenvalue problem equation Lψ=λψ in time, we obtain:

dLdtψ+Ldψdt=λdψdt.

Using the Lax equation, this becomes:

(LMML)ψ+Ldψdt=λdψdt.

Rearranging terms and using Lψ=λψ, we get:

(Lλ)(dψdt+Mψ)=0.

Suppose, for simplicity, that the λ-eigenspace of L is one-dimensional, then dψdt+Mψ=βψ, which can be interpreted as a evolution equation for ψ:

dψdt=(M+β)ψ,

where β is a possibly time-dependent complex number. I think we can get rid of β by modifying M at the beginning.

If we replace L~=Lλ and M~=M+t then equation (1) becomes

[L~,M~]=0

since

[L~,M~]ψ=[Lλ,M+t]ψ=[L,M+t]ψ==0.

And equations (2) and (3) become

L~ψ=0,M~ψ=0.

As a zero-curvature condition

Pending task

Relation to QM

In Quantum Mechanics, observables are operators on a Hilbert space, just as L and M.
The Lax equation L˙=[L,M] bears a resemblance to the Heisenberg equation of motion for an operator Q in the Heisenberg picture of quantum mechanics:

idQdt=[Q,H],

where H is the Hamiltonian. If H and Q are finite dimensional matrices, then tr([Q,H])=0 so that tr(Q) is conserved. But often, operators in quantum mechanics are infinite dimensional and unbounded. The trace of the commutator of such operators may not vanish (or even be finite). In such cases, tr(Q) may not be a (finite) conserved quantity.

Open questions

Can we start with arbitrary operators L, M and then the condition [L,M]=0 is automatically an integrable system? This should be related with Inverse Scattering Method...
How do we obtain conserved quantities if we have no sense of "trace"?
What does it have to do with the zero-curvature representation?
What are the elements of the Hilbert space on which do these operator act?
What does it have to do with the inverse scattering transform?