In this post I’ll write up the notes I took while trying to follow this “short and simple” linear algebra result which made a few headlines in 2019, sent to me by Juanhe (as so many other cool things are, thank you!)
The general thrust of the result is that simply knowing the eigenvalues of a Hermitian matrix \(A\) and its principal minors, you can construct its eigenvectors.
EDIT, 2019-12-14: the authors of the above paper have reuploaded a much more extensive version replete with multiple proofs and a historical analysis of the identity’s history. The below is now Section 2.5, “Proof using a Cauchy-Binet type formula”; it seems like the simplest of the provided proofs.
Lemma
The lemma asserts that for \(\small n \times n\) Hermitian \(A\) with eigenvalues \(\lambda_i\) and associated eigenvectors \(v_i\) with some eigenvalue \(\lambda_k=0\) and any \(\small n \times n-1\) matrix \(B\), we have:
⎝⎜⎛i=1,i=k∏nλi(A)⎠⎟⎞∣det(B∣∣vk)∣2=det(B∗AB)
where \(B \mathbf{||} v_k\) denotes the \(\small n \times n\) matrix formed by augmenting the \(\small n \times n-1\) matrix \(B\) with the \(\small n \times1\) eigenvector \(v_k\).
Begin by assuming \(A\) is diagonal and that \(k=n\) which by construction means:
A=⎣⎢⎢⎢⎢⎢⎡λ1λ20⋱0λn−10⎦⎥⎥⎥⎥⎥⎤=⎣⎢⎡A′0⎦⎥⎤
vi=ei,∀i
where \(A’\) is the upper-left diagonal \(\small n − 1 \times n − 1\) submatrix and \(e_i\) is the standard basis vector with a \(1\) in the \(i\)th column and \(0\) elsewhere.
Similarly, write \(B = \begin{bmatrix} B’ \\ X \end{bmatrix} = \begin{bmatrix} \beta_1’ & \beta_2’ & \dots & \beta_{n-1}’ \\ x_1 & x_2 & \dots & x_{n-1} \end{bmatrix}\) where \(B’\) is the upper \(\small n − 1 \times n − 1\) submatrix and \(X\) is the lower \(\small 1 \times n − 1\) row vector, with each \(\beta_i’\) a \(\small n-1 \times 1\) column vector and \(X = \begin{bmatrix}—x_i—\end{bmatrix}\).
We have on the one hand that:
det(B∣∣vn)=det(⎣⎢⎡B′X01⎦⎥⎤)=0+0+⋯+detB′
by considering the Laplace expansion along the last row (each submatrix but the last contains the \(0\)-vector on the top right, hence each cofactor but the last is also \(0\)), which implies:
⇒(i=1∏n−1λi(A))∣det(B∣∣vn)∣2=(i=1∏n−1λi(A))∣detB′∣2
On the other hand, note that since the last row and column of \(A\) is identically zero, we have:
AB=[A′β1′0……A′βn−1′0]
⇒B∗AB=⎣⎢⎡β1′…βn−1′x1…xn−1⎦⎥⎤[A′β1′0……A′βn−1′0]=⎣⎢⎡β1′A′β1′…βn−1′A′β1………β1′A′βn−1′…βn−1′A′βn−1⎦⎥⎤
where the last row of the rightmost matrix being zero eliminates all of the \(x_i\) from the product. This last \(\small n-1 \times n-1\) matrix we recognize as simply the conjugation:
B′∗A′B′
and hence:
det(B∗AB)=det(B′∗A′B′)=det(A′)∣det(B′)∣2=(i=1∏n−1λi(A))∣detB′∣2
as desired.
To complete the lemma, when \(A\) is Hermitian we can find an eigendecomposition \(A = VDV^*\) where the \(V = \begin{bmatrix} v_1 & \dots & v_n \end{bmatrix}\) is orthogonal, and \(D\) is the diagonal matrix of eigenvalues \(\lambda_i\), and moreover we can find such a decomposition where \(\lambda_n\) is the \(0\) eigenvalue. Then:
det(B∗AB)=det(B∗VDV∗B)=(i=1∏n−1λi)∣det(V∗B∣∣en)∣2=(i=1∏n−1λi)∣det(V∗B∣∣V∗vn)∣2=(i=1∏n−1λi)∣det(B∣∣vn)∣2
where on the first line we use the diagonal case of the lemma, on the second we use the orthogonality of the rows \(v_i\) of \(V^*\), and on the third we extract \(\det(V^*) = 1\) again by orthogonality.
Main result
The main result applies to Hermitian \(A\), again with eigenvalues/vectors \(\lambda_i, v_i\). Letting \(M_j\) be \(A\) with the \(j\)-th row and column removed, and \(\mu_i\) be its eigenvalues, it says that \(\forall 1\leq i, j\leq n\):
∣vi,j∣2k=1,k=i∏n(λk−λi)=k=1∏n−1(μk−λi)
It’s actually pretty simple once we already have the Lemma. Fixing \(i, j\) (the paper argues WLOG to consider \(i=n, j=1\)), note that \(A’ = A - \lambda_i I_n\) has eigenvalues \(\lambda_k’ = \lambda_k - \lambda_i, \forall k\) with the same eigenvectors \(v_k’ = v_k\); letting \(M_j’\) be the analogous submatrix similarly \(\mu_k’ = \mu_k - \lambda_i, \forall k\). Substituting in the above, the proof reduces to showing:
∣vi,j∣2k=1,k=i∏nλk′=k=1∏n−1μk′
Our new matrix \(A’\) has \(\lambda_k’ = 0\) so we may apply the Lemma with \(B\) the \(\small n \times n-1\) matrix:
bu,v=⎩⎪⎪⎨⎪⎪⎧110u<j,u=vu>j,u=v+1otherwise⟶B=⎣⎢⎢⎢⎢⎢⎢⎢⎢⎢⎡1010⋱0…0⋱0101⎦⎥⎥⎥⎥⎥⎥⎥⎥⎥⎤
that is, \(B\) is \(I_{n-1}\) with the zero vector \(0_{1, n-1}\) inserted at the \(j\)-th row.
Note \(|\det(B || v_i)|^2\) is precisely \(|v_{i,j}|^2\) again by considering the Laplace expansion along the \(j\)-th row. And:
det(B∗A′B)=det⎝⎜⎜⎜⎛B∗⎣⎢⎢⎢⎡a1,1′a2,1′…an,1′…………a1,j−1′a2,j−1′…an,j−1′a1,j+1′a2,j+1′…an,j+1′…………a1,n′a2,n′…an,n′⎦⎥⎥⎥⎤⎠⎟⎟⎟⎞=det⎝⎜⎜⎜⎜⎜⎜⎜⎛⎣⎢⎢⎢⎢⎢⎢⎢⎡a1,1′…aj−1,1′aj+1,1′…an,1′………………a1,j−1′…aj−1,j−1′aj+1,j−1′…an,j−1′a1,j+1′…aj−1,j+1′aj+1,j+1′…an,j+1′………………a1,n′…aj−1,n′aj+1,n′…an,n′⎦⎥⎥⎥⎥⎥⎥⎥⎤⎠⎟⎟⎟⎟⎟⎟⎟⎞=det(Mj′)=k=1∏n−1μk′
so that the Lemma immediately gives our result:
⇒⎝⎜⎛k=1,k=i∏nλk(A′)⎠⎟⎞∣det(B∣∣vi)∣2=det(B∗A′B)⎝⎜⎛k=1,k=i∏nλk′⎠⎟⎞∣vi,j∣2=k=1∏n−1μk′