Lax pair introduction

Finite degree of freedom

Consider a nonlinear system of ODEs

\[ \dot{x}_i=F_i, \]

for example one arising in Hamiltonian mechanics,

\[ \dot{p} = -\frac{\partial H}{\partial q}, \quad \dot{q} = \frac{\partial H}{\partial p}. \tag{1} \]

A Lax pair for this system is a pair of matrices \(L\) and \(M\) that satisfy the Lax equation:

\[ \frac{dL}{dt} = [L, M] \tag{2} \]

where \([L, M] = LM - ML\) is the commutator of \(M\) and \(L\), and \(\frac{dL}{dt}\) is the time derivative of \(L\). The entries of \(L\) and \(M\) are typically expressed in terms of the variables \(x_i\), and we require that equation (1) is satisfied if and only if (2) is satisfied.

The exact form of \(L\) and \(M\) depends on the specific system under consideration, and is not unique. For instance, in the case of the simple harmonic oscillator with Hamiltonian \(H = \frac{p^2}{2m} + \frac{1}{2}m\omega^2 q^2\), we can choose:

\[ L = \begin{pmatrix} p/m & \omega q \\ \omega q & -p/m \end{pmatrix}, \quad M = \begin{pmatrix} 0 & \omega/2 \\ -\omega/2 & 0 \end{pmatrix} \]

Then, the Lax equation \(\frac{dL}{dt} = [L,M]\) reproduces the equations of motion for the harmonic oscillator:

\[ \dot{p} = -m\omega^2 q \] \[ \dot{q} =\frac{p}{m} \]

The key point is that the eigenvalues of the \(L\) matrix are conserved quantities (constants of motion), and the eigenvectors provide a "good" transformation of variables (????).

Isospectral property

The eigenvalues of \(L\) do not change with time, a property known as isospectrality.

Due to the cyclic property of the trace, we can show that the trace of all the powers of \(L\) are conserved quantities.

In effect, the trace of \(L\) is conserved, since

\[ \dfrac{d}{dt}tr(L)=tr([L,M])=0. \]

To show \(\frac{d}{dt} \mathrm{tr}(L^2) = 0\), observe

\[ \frac{d}{dt} \mathrm{tr}(L^2) = \mathrm{tr}\left(\frac{d}{dt} L^2\right) = \mathrm{tr}\left(L\frac{dL}{dt} + \frac{dL}{dt}L\right), \]

where we used the product rule. Substituting the Lax equation for \(\frac{dL}{dt}\) gives:

\[ \frac{d}{dt} \mathrm{tr}(L^2) = \mathrm{tr}\left(L(LM - ML) + (LM - ML)L\right) = \] \[ =\mathrm{tr}(L^2M - LML + LML - MLL) = \mathrm{tr}(L^2M - MLL)=0. \]

The same is true for \(\frac{d}{dt} \mathrm{tr}(L^3) = 0\) and son on.

To see that the eigenvalues of \(L\) are conserved, only observe that the eigenvalues are determined by the trace of the powers of the matrix.

In the previous example of the harmonic oscillator we can find the eigenvalues \(\lambda\) and eigenvectors \(\psi\) of the \(L\) matrix. We solve the characteristic equation:

\[ \text{det}(L - \lambda I) = 0 \]

where \(I\) is the identity matrix. This gives

\[ \text{det}\begin{pmatrix} p/m - \lambda & \omega q \\ \omega q & -p/m - \lambda \end{pmatrix} = 0 \]

which simplifies to

\[ -(p^2/m^2 + \lambda^2) - \omega^2 q^2 = 0 \]

We can solve this equation for \(\lambda\) to get

\[ \lambda = \pm\sqrt{(p^2/m^2 + \omega^2 q^2)} \]

These are the eigenvalues of the \(L\) matrix, which correspond to the given Hamiltonian.

Spectral parameter

In many cases, Lax pairs depend on an auxiliary variable, the so-called spectral parameter, which is not directly related to the dynamics of the model. The Lax pair \(L(u)\), \(M(u)\) then obeys the Lax equation at all values of \(u \in \mathbb{C}\):

\[ \frac{d}{dt} L(u) = [M(u), L(u)] \text{ for all }u \in \mathbb{C}. \]

Such a Lax pair, called nonisospectral, must be constructed as always such that this equation is equivalent to the complete set of equations of motion. As a functional equation, it is, in principle, much more constraining than the Lax equation without spectral parameter. This feature is useful for mechanical systems with infinitely many degrees of freedom whose equations of motion could thus be formulated by a finite-dimensional Lax pair.

Even for a finite-dimensional system, Lax pairs with spectral parameter often exist. While the spectral parameter is not essential to encode finitely many equations of motion, it is nevertheless useful in several respects.

Infinite degrees of freedom (fields)

In this case we have PDEs, for example evolution equations like KdV:

\[ u_t-6u u_x+u_{xxx}=0, \]

the transport equation, etc, we can apply similar ideas, but now \(L,M\) are operators on a Hilbert space instead of matrices.

For the KdV example we have:

\[ L=-\partial_x^2+u(x,t) \] \[ M=4\partial_x^3+6u\partial_x+3u_x \]

In this cases, Lax equation \(L_t=[L,M]\) can be replaced by \([L,M]=0\) if we replace \(M\) by \(M+\partial_t\) (simple computations).

Isospectral property

In this case, it can then be shown that the eigenvalues and more generally the spectrum of \(L\) are independent of \(t\) in the following way.

The matrices \(L(t)\) are all similar by virtue of the existence of matrices \(U(t,s)\) such that

\[ L(t)=U(t,s) L(s) U(t,s)^{-1} \]

Observe that in this case, the eigenvalues of \(L(t)\) are the same as those of \(L(0)\).

The existence of \(U(t,s)\) is due to the existence of solution for the Cauchy problem

\[ \frac{d}{dt} U(t,s) = -M(t) U(t,s), \quad U(s,s) = I, \]

where \(I\) denotes the identity matrix. (I guess there is a kind of result for general operators!!??)

Indeed, observe that in that case

\[ \frac{d}{dt} \left( U(t,s)^{-1} L(t) U(t,s) \right) = \] \[ =-U(t,s)^{-1} \frac{d U(t,s)}{dt} U(t,s)^{-1} L(t) U(t,s) + U(t,s)^{-1} \frac{dL(t)}{dt} U(t,s) + U(t,s)^{-1} L(t) \frac{d U(t,s)}{dt} \] \[ =U(t,s)^{-1} M(t) L(t) U(t,s) + U(t,s)^{-1} \frac{dL(t)}{dt} U(t,s) - U(t,s)^{-1} L(t) M(t) U(t,s) \] \[ =-U(t,s)^{-1} [L(t),M(t)] U(t,s) + U(t,s)^{-1} \frac{dL(t)}{dt} U(t,s) \] \[ =U(t,s)^{-1} \left(\frac{dL(t)}{dt}-[L(t),M(t)] \right)U(t,s) =0. \]

This means that the operator \(U(t,s)^{-1} L(t) U(t,s)\) does not change with time, and therefore, it must be equal to a constant matrix. Because \(U(s, s) = I\), we have

\[ U(s,s)^{-1} L(s) U(s,s) = L(s). \]

and then \(L(t) = U(t,s) L(s) U(t,s)^{-1}\).

Lax equation as a compatibility condition

The Lax equation,

\[ \frac{dL}{dt} = [L, M] \tag{1} \]

is equivalent to the two linear compatibility conditions,

\[ L\psi = \lambda\psi \tag{2} \]

and

\[ \psi_t = -M\psi \tag{3} \]

Here, \(\psi = \psi(x, t)\) is a nonzero simultaneous solution of both of these two equations for some \(\lambda \in \mathbb{C}\).

Let's see that (2) and (3) implies (1). First, by using the operator analogue of the product rule and then eliminating \(\psi_t\) from equation (3) gives

\[ \frac{\partial}{\partial t} (L\psi) = \frac{dL}{dt} \psi - LM\psi \tag{4} \]

Second, by using \(L\psi = \lambda\psi\) from equation (2) and the fact that \(\lambda \in \mathbb{C}\) is a constant, we get

\[ \frac{\partial}{\partial t} (L\psi) = \frac{\partial}{\partial t} (\lambda\psi) = -\lambda M\psi = -ML\psi \tag{5} \]

Comparing equations (4) and (5) gives

\[ \frac{dL}{dt} \psi - LM\psi = -ML\psi \tag{6} \]

so

\[ \frac{dL}{dt} \psi - [L, M]\psi =\left(\frac{dL}{dt} - [L, M]\right)\psi= 0 \tag{7} \]

From here, dividing by \(\psi\) (assuming \(\psi \neq 0\) and that \(\frac{dL}{dt} - [L, M]\) is a multiplication operator, since it is supposed to be representing a given PDE like the KdV) gives back the Lax equation (1).

Now, to see that (1) implies (2) and (3), observe that we already know that Lax equation implies conserved eigenvalues. Differentiating the eigenvalue problem equation \(L\psi=\lambda \psi\) in time, we obtain:

\[ \frac{dL}{dt}\psi + L\frac{d \psi}{dt} = \lambda\frac{d \psi}{dt}. \]

Using the Lax equation, this becomes:

\[ (LM - ML)\psi + L\frac{d \psi}{dt} = \lambda\frac{d \psi}{dt}. \]

Rearranging terms and using \(L\psi = \lambda\psi\), we get:

\[ (L - \lambda)(\frac{d \psi}{dt} + M\psi) = 0. \]

Suppose, for simplicity, that the \(\lambda\)-eigenspace of \(L\) is one-dimensional, then \(\frac{d \psi}{dt} + M\psi=\beta\psi\), which can be interpreted as a evolution equation for \(\psi\):

\[ \frac{d \psi}{dt} = (−M + \beta)\psi, \]

where \(\beta\) is a possibly time-dependent complex number. I think we can get rid of \(\beta\) by modifying \(M\) at the beginning.

If we replace \(\tilde{L}=L-\lambda\) and \(\tilde{M}=M+\partial_t\) then equation (1) becomes

\[ [\tilde{L},\tilde{M}]=0 \]

since

\[ [\tilde{L},\tilde{M}]\psi=[L-\lambda, M+\partial_t]\psi=[L,M+\partial_t]\psi=\cdots=0. \]

And equations (2) and (3) become

\[ \tilde{L}\psi=0,\quad \tilde{M}\psi=0. \]

As a zero-curvature condition

Pending task

Relation to QM

In Quantum Mechanics, observables are operators on a Hilbert space, just as in the case of \(L\) of \(M\).

The Lax equation \(\dot{L} = [L, M]\) bears a resemblance to the Heisenberg equation of motion for an operator \(Q\) in the Heisenberg picture of quantum mechanics: \(i\hbar \frac{dQ}{dt} = [Q, H]\), where \(H\) is the Hamiltonian. If \(H\) and \(Q\) are finite dimensional matrices, then \(\text{tr}([Q, H]) = 0\) so that \(\text{tr}(Q)\) is conserved. But often, operators in quantum mechanics are infinite dimensional and unbounded. The trace of the commutator of such operators may not vanish (or even be finite). In such cases, \(\text{tr}(Q)\) may not be a (finite) conserved quantity.

Open questions

Can we start with arbitrary operators \(L\), \(M\) and then the condition \([L,M]=0\) is automatically an integrable system?

How do we obtain conserved quantities if we have no sense of "trace"?

What does it have to do with the zero-curvature representation?

What are the elements of the Hilbert space do these operator act on?

What does it have to do with the inverse scattering transform?

Comments

Popular posts from this blog

About the definition of angle, from Geometric Algebra

Spherical, hyperbolic and parabolic geometry

4-dimensional spheres, or haunting an ant