Lax pair

It happens that sometimes integrable systems could generally be viewed as those equations that can be obtained from an overdetermined system of linear differential equations, with the original PDE playing the role of the compatibility condition. The overdetermined linear system is the Lax pair associated to the equation
This is by no means a general construction: in practice, there are no general methods of computing a Lax pair for a given system of PDEs. However, when a Lax pair for a system is known, it is enormously useful. One can use Lax pairs to derive infinitely many conserved quantities, describe large classes of solutions, and play a crucial role in the celebrated Inverse Scattering Method.

Note: it seems to me that to say that a PDE system admits a Lax pair we only need that the PDE system implies the Lax equation $[L, M] = 0$ . I guess this is enough to take advantage of the Lax pair...

Finite degree of freedom

Consider a nonlinear system of ODEs

{\dot{x}}_{i} = F_{i},

for example one arising in Hamiltonian mechanics,

\begin{matrix} (1) & \dot{p} = - \frac{\partial H}{\partial q}, \dot{q} = \frac{\partial H}{\partial p} . \end{matrix}

A Lax pair for this system is a pair of matrices $L$ and $M$ that satisfy the Lax equation:

\begin{matrix} (2) & \frac{d L}{d t} = [L, M] \end{matrix}

where $[L, M] = L M - M L$ is the commutator of $M$ and $L$ , and $\frac{d L}{d t}$ is the time derivative of $L$ . The entries of $L$ and $M$ are typically expressed in terms of the variables $x_{i}$ , and we require that equation (1) is satisfied if and only if (2) is satisfied.

The exact form of $L$ and $M$ depends on the specific system under consideration, and is not unique. For instance, in the case of the simple harmonic oscillator with Hamiltonian $H = \frac{p^{2}}{2 m} + \frac{1}{2} m ω^{2} q^{2}$ , we can choose:

L = (\begin{matrix} p / m & ω q \\ ω q & - p / m \end{matrix}), M = (\begin{matrix} 0 & ω / 2 \\ - ω / 2 & 0 \end{matrix})

Then, the Lax equation $\frac{d L}{d t} = [L, M]$ reproduces the equations of motion for the harmonic oscillator:

\dot{p} = - m ω^{2} q

\dot{q} = \frac{p}{m}

The key point is that the eigenvalues of the matrix $L$ are conserved quantities, i.e., constants of motion (see isospectral property below), and the eigenvectors provide a "good" transformation of variables (????).

Isospectral property

The eigenvalues of $L$ do not change with time, a property known as isospectrality.
Due to the cyclic property of the trace, we can show that the trace of all the powers of $L$ are conserved quantities.
In effect, the trace of $L$ is conserved, since

\frac{d}{d t} t r (L) = t r ([L, M]) = 0.

To show $\frac{d}{d t} tr (L^{2}) = 0$ , observe

\frac{d}{d t} tr (L^{2}) = tr (\frac{d}{d t} L^{2}) = tr (L \frac{d L}{d t} + \frac{d L}{d t} L),

where we used the product rule. Substituting the Lax equation for $\frac{d L}{d t}$ gives:

\frac{d}{d t} tr (L^{2}) = tr (L (L M - M L) + (L M - M L) L) =

= tr (L^{2} M - L M L + L M L - M L L) = tr (L^{2} M - M L L) = 0.

The same is true for $\frac{d}{d t} tr (L^{3}) = 0$ and son on.

To see that the eigenvalues of $L$ are conserved, only observe that the eigenvalues are determined by the trace of the powers of the matrix.

In the previous example of the harmonic oscillator we can find the eigenvalues $λ$ and eigenvectors $ψ$ of the $L$ matrix. We solve the characteristic equation:

det (L - λ I) = 0

where $I$ is the identity matrix. This gives

det (\begin{matrix} p / m - λ & ω q \\ ω q & - p / m - λ \end{matrix}) = 0

which simplifies to

- (p^{2} / m^{2} + λ^{2}) - ω^{2} q^{2} = 0

We can solve this equation for $λ$ to get

λ = \pm \sqrt{(p^{2} / m^{2} + ω^{2} q^{2})}

These are the eigenvalues of the $L$ matrix, which correspond to the given Hamiltonian.

Spectral parameter

In many cases, Lax pairs depend on an auxiliary variable, the so-called spectral parameter, which is not directly related to the dynamics of the model. The Lax pair $L (u)$ , $M (u)$ then obeys the Lax equation at all values of $u \in C$ :

\frac{d}{d t} L (u) = [M (u), L (u)] for all u \in C .

Such a Lax pair, called nonisospectral, must be constructed, as always, such that this equation is equivalent to the complete set of equations of motion. As a functional equation, it is, in principle, much more constraining than the Lax equation without spectral parameter. This feature is useful for mechanical systems with infinitely many degrees of freedom whose equations of motion could thus be formulated by a finite-dimensional Lax pair.

Even for a finite-dimensional system, Lax pairs with spectral parameter often exist. While the spectral parameter is not essential to encode finitely many equations of motion, it is nevertheless useful in several respects.

Infinite degrees of freedom (fields)

Idea: see also this paragraph.

In this case we have PDEs, for example evolution equations like KdV

u_{t} - 6 u u_{x} + u_{x x x} = 0,

the transport equation, etc, we can apply similar ideas, but now $L, M$ are operators on a Hilbert space instead of matrices.
For the KdV example we have:

L = - \partial_{x}^{2} + u (x, t)

M = 4 \partial_{x}^{3} + 6 u \partial_{x} + 3 u_{x}

In this cases, Lax equation $L_{t} = [L, M]$ can be replaced by $[L, M] = 0$ if we replace $M$ by $M + \partial_{t}$ (simple computations).

Isospectral property

In this case, it can then be shown that the eigenvalues and more generally the spectrum of $L$ are independent of $t$ in the following way.

The matrices $L (t)$ are all similar by virtue of the existence of matrices $U (t, s)$ such that

L (t) = U (t, s) L (s) U (t, s)^{- 1}

Observe that in this case, the eigenvalues of $L (t)$ are the same as those of $L (0)$ .
The existence of $U (t, s)$ is due to the existence of solution for the Cauchy problem

\frac{d}{d t} U (t, s) = - M (t) U (t, s), U (s, s) = I,

where $I$ denotes the identity matrix. (I guess there is a kind of result for general operators!!??)

Indeed, observe that in that case

\frac{d}{d t} (U (t, s)^{- 1} L (t) U (t, s)) =

= - U (t, s)^{- 1} \frac{d U (t, s)}{d t} U (t, s)^{- 1} L (t) U (t, s) + U (t, s)^{- 1} \frac{d L (t)}{d t} U (t, s) + U (t, s)^{- 1} L (t) \frac{d U (t, s)}{d t}

= U (t, s)^{- 1} M (t) L (t) U (t, s) + U (t, s)^{- 1} \frac{d L (t)}{d t} U (t, s) - U (t, s)^{- 1} L (t) M (t) U (t, s)

= - U (t, s)^{- 1} [L (t), M (t)] U (t, s) + U (t, s)^{- 1} \frac{d L (t)}{d t} U (t, s)

= U (t, s)^{- 1} (\frac{d L (t)}{d t} - [L (t), M (t)]) U (t, s) = 0.

This means that the operator $U (t, s)^{- 1} L (t) U (t, s)$ does not change with time, and therefore, it must be equal to a constant matrix. Because $U (s, s) = I$ , we have

U (s, s)^{- 1} L (s) U (s, s) = L (s) .

and then $L (t) = U (t, s) L (s) U (t, s)^{- 1}$ .

Lax equation as a compatibility condition

The Lax equation,

\begin{matrix} (1) & \frac{d L}{d t} = [L, M] \end{matrix}

is equivalent to the two linear compatibility conditions,

\begin{matrix} (2) & L ψ = λ ψ \end{matrix}

and

\begin{matrix} (3) & ψ_{t} = - M ψ \end{matrix}

Here, $ψ = ψ (x, t)$ is a nonzero simultaneous solution of both of these two equations for some $λ \in C$ .

Let's see that (2) and (3) implies (1). First, by using the operator analogue of the product rule and then eliminating $ψ_{t}$ from equation (3) gives

\begin{matrix} (4) & \frac{\partial}{\partial t} (L ψ) = \frac{d L}{d t} ψ - L M ψ \end{matrix}

Second, by using $L ψ = λ ψ$ from equation (2) and the fact that $λ \in C$ is a constant, we get

\begin{matrix} (5) & \frac{\partial}{\partial t} (L ψ) = \frac{\partial}{\partial t} (λ ψ) = - λ M ψ = - M L ψ \end{matrix}

Comparing equations (4) and (5) gives

\begin{matrix} (6) & \frac{d L}{d t} ψ - L M ψ = - M L ψ \end{matrix}

\begin{matrix} (7) & \frac{d L}{d t} ψ - [L, M] ψ = (\frac{d L}{d t} - [L, M]) ψ = 0 \end{matrix}

From here, dividing by $ψ$ (assuming $ψ \neq 0$ and that $\frac{d L}{d t} - [L, M]$ is a multiplication operator, since it is supposed to be representing a given PDE like the KdV) gives back the Lax equation (1).

Now, to see that (1) implies (2) and (3), observe that we already know that Lax equation implies conserved eigenvalues. Differentiating the eigenvalue problem equation $L ψ = λ ψ$ in time, we obtain:

\frac{d L}{d t} ψ + L \frac{d ψ}{d t} = λ \frac{d ψ}{d t} .

Using the Lax equation, this becomes:

(L M - M L) ψ + L \frac{d ψ}{d t} = λ \frac{d ψ}{d t} .

Rearranging terms and using $L ψ = λ ψ$ , we get:

(L - λ) (\frac{d ψ}{d t} + M ψ) = 0.

Suppose, for simplicity, that the $λ$ -eigenspace of $L$ is one-dimensional, then $\frac{d ψ}{d t} + M ψ = β ψ$ , which can be interpreted as a evolution equation for $ψ$ :

\frac{d ψ}{d t} = (- M + β) ψ,

where $β$ is a possibly time-dependent complex number. I think we can get rid of $β$ by modifying $M$ at the beginning.

If we replace $\tilde{L} = L - λ$ and $\tilde{M} = M + \partial_{t}$ then equation (1) becomes

[\tilde{L}, \tilde{M}] = 0

since

[\tilde{L}, \tilde{M}] ψ = [L - λ, M + \partial_{t}] ψ = [L, M + \partial_{t}] ψ = \dots = 0.

And equations (2) and (3) become

\tilde{L} ψ = 0, \tilde{M} ψ = 0.

As a zero-curvature condition

Pending task

Relation to QM

In Quantum Mechanics, observables are operators on a Hilbert space, just as $L$ and $M$ .
The Lax equation $\dot{L} = [L, M]$ bears a resemblance to the Heisenberg equation of motion for an operator $Q$ in the Heisenberg picture of quantum mechanics:

i ℏ \frac{d Q}{d t} = [Q, H],

where $H$ is the Hamiltonian. If $H$ and $Q$ are finite dimensional matrices, then $tr ([Q, H]) = 0$ so that $tr (Q)$ is conserved. But often, operators in quantum mechanics are infinite dimensional and unbounded. The trace of the commutator of such operators may not vanish (or even be finite). In such cases, $tr (Q)$ may not be a (finite) conserved quantity.

Open questions

Can we start with arbitrary operators $L$ , $M$ and then the condition $[L, M] = 0$ is automatically an integrable system? This should be related with Inverse Scattering Method...
How do we obtain conserved quantities if we have no sense of "trace"?
What does it have to do with the zero-curvature representation?
What are the elements of the Hilbert space on which do these operator act?
What does it have to do with the inverse scattering transform?