MathAlgebraLinear algebraMatrix decomposition

Diagonalizing matrices

Provided by: Edvancium

9 minutes read

In this topic, we will finally delve into the practical procedure of diagonalizing matrices. Our focus is to provide you with the necessary tools and steps to successfully carry out the diagonalization process.

But first, let's develop the necessary theoretical results. Please remember that a square matrix $A$ of dimension $n$ is diagonalizable if there exists a diagonal matrix $D$ and an invertible one $P$ such that $A=PDP^{−1}$ .

The surprising connection with eigenvectors

Let us first carefully analyze the definition of diagonalizability. Let's write the columns of $P$ as $\bold{v_1, \dots, v_n}$ and the diagonal entries of $D$ as $d_1, \dots, d_n$ .

If we multiply the equation $A=PDP^{−1}$ on the right by $P$ , then we simply have that: $AP =PD$ For these two matrices to be equal, it is enough that their columns are equal, so we will focus on them. In fact, we only need to consider the first column, since the reasoning is the same for all the others.

First, from the properties of matrix multiplication, we know that the first column of the product $AP$ is simply the product of $A$ with the first column of $P$ , that is, $A \mathbf{v_1}$ . On the other hand, we can follow the same reasoning for the first column of $PD$ ; but since we also know that $D$ is diagonal, we obtain the first column of $PD$ is $d_1 \bold{v_1}$ . Therefore, we are looking for: $A\bold{v_1} = d_1 \bold{v_1}$ But wait just a moment... that equation looks like one we already know... what that expression means is that $\mathbf{v_1}$ is an eigenvector of $A$ with eigenvalue $d_1$ !

Thus, $A$ is diagonalizable if and only if the columns of $P$ are eigenvectors of $A$ whose corresponding eigenvalues are the diagonal entries of $D$ .

Thanks to our surprising result, to construct $P$ and $D$ all we need is to calculate all the eigenvalues of $A$ together with their eigenvectors:

Put the eigenvalues as the entries of the diagonal of $D$ .
Put the respective eigenvectors as the columns of $P$ .

How to detect diagonalizability

Not every matrix is diagonalizable, but the connection with eigenvectors is so deep that it allows us to fully understand when a matrix can be diagonalizable.

Before, remember each eigenvalue has an associated eigenspace (the subspace spanned by its eigenvectors), which has a certain dimension. We call this number the geometric multiplicity of the eigenvalue. The characterization theorem is:

The following are equivalent:

a) $A$ is diagonalizable.

b) There exists a basis consisting of eigenvectors of $A$ .

c) The sum of the geometric multiplicities of the eigenvalues of $A$ is $n$

The simplest case when diagonalizability exists for sure is when an $n \times n$ matrix has exactly $n$ eigenvalues. If not, then you have to get the sum of the geometric multiplicities of the eigenvalues.

In short, when you don't have $n$ distinct eigenvalues, you need to calculate all geometric multiplicities and check if they sum to $n$ .

Steps to diagonalize

Combining everything we discovered, we now have a straightforward and elegant procedure for diagonalizing any square matrix (as long as it is):

Find all the eigenvalues of the matrix.
Determine the eigenvectors corresponding to each eigenvalue.
Find the dimension of each eigenspace.
Add the geometric multiplicities of each eigenvalue. If the sum is exactly $n$ , then it is diagonalizable, and we can go to the next step. Otherwise, the matrix is not diagonalizable.
Build $P$ : its columns are the eigenvectors.
Build $D$ : the entries on its diagonal are the eigenvalues corresponding to the eigenvectors in the same order. If an eigenvalue has more than one eigenvector, then it is repeated as many times as many eigenvectors it has.

Let's put our brand-new procedure into practice!

A simple diagonalization

Let's start with a simple matrix: $A=\begin{bmatrix} 3 & 1 \\ 2 & 4 \\ \end{bmatrix}$ The first thing you have to do is find its eigenvalues, for this, first calculate its characteristic polynomial $\chi(\lambda) = \det(A-\lambda I) = \lambda^2 - 7\lambda + 10$ . The roots of the polynomial are the eigenvalues, so we get that they are $5$ and $2$ . Thanks to our shortcut, we notice that there are as many eigenvalues as the dimension of $A$ , so it is diagonalizable!

Now you have to find the eigenvectors. Starting with $5$ , you need the solutions of the system of equations $(A-5 I ) v = 0$ , that is:

$\begin{bmatrix} -2 & 1 \\ 2 & -1 \end{bmatrix} \begin{bmatrix} x \\ y \end{bmatrix} = \begin{bmatrix} 0 \\ 0 \end{bmatrix}$ A possible solution would be $v_1=(1, 2)^T$ . Now, for the eigenvalue $2$ , you need to solve $(A-2 I ) v = 0$ :

$\begin{bmatrix} 1 & 1 \\ 2 & 2 \end{bmatrix} \begin{bmatrix} x \\ y \end{bmatrix} = \begin{bmatrix} 0 \\ 0 \end{bmatrix}$ An easy solution is be $v_2=(-1, 1)^T$ . All ready! You only have to create $D$ whose diagonal entries are the eigenvalues and $P$ whose columns are the eigenvectors in the same order: $D =\begin{bmatrix}5 &0\\ 0&2\\\end{bmatrix} \qquad \text{and} \qquad P =\begin{bmatrix} v_1 | v_2 \\\end{bmatrix} =\begin{bmatrix}1& -1\\ 2& 1\end{bmatrix}$ Feel free to verify that $A=PDP^{−1}$ .

A more challenging diagonalization

Let's analyze the following matrix: $A=\begin{bmatrix} 5 & 0 & -3 \\ -3 & 2 & 3 \\ 6 & 0 & -4 \end{bmatrix}$ Calculating its characteristic polynomial $\chi(\lambda) = \det(A-\lambda I) = - \lambda^3 + 3 \lambda^2 -4$ , we see that the eigenvalues are 2 and -1. Until now, you cannot know if A is diagonalizable, you must find all the eigenvectors.

Starting with $2$ , the eigenvectors are the solutions to the system:

$\begin{bmatrix} 3 & 0 & -3 \\ -3 & 0 & 3 \\ 6 & 0 & -6 \end{bmatrix} \begin{bmatrix} x \\ y \\ z \end{bmatrix} = \begin{bmatrix} 0 \\ 0 \\ 0 \end{bmatrix}$ You know that the solutions of a system of linear equations is a vector subspace, in this case, it is of the form:

$\{x (1,0,1)^T + y (0,1,0)^T \, | \, x, y \in \mathbb{R} \}$ According to our characterization theorem, you need a basis, so you can take $v_1 = (1,0,1)^T$ and $v_2 = (0,1,0)^T$ .

Finally, for the eigenvalue, you must solve the system:

$\begin{bmatrix} 6 & 0 & -3 \\ -3 & 3 & 3 \\ 6 & 0 & -3 \end{bmatrix} \begin{bmatrix} x \\ y \\ z \end{bmatrix} = \begin{bmatrix} 0 \\ 0 \\ 0 \end{bmatrix}$ A solution would be $v_3=(1, -1, 2)^T$ . And now, you can build $D$ and $P$ :

$D = \begin{bmatrix}2&0&0 \\ 0&2&0 \\ 0 &0&-1 \\\end{bmatrix} \qquad \text{and} \qquad P =\begin{bmatrix} v_1 | v_2 | v_3 \\\end{bmatrix} =\begin{bmatrix}1&0&1 \\ 0&1&-1 \\ 1&0&2 \\\end{bmatrix}$

Not all matrices are diagonalizable

Not every matrix is diagonalizable. Think about this simple one:

$A=\begin{bmatrix} 2 & 1 \\ 0 & 2 \\ \end{bmatrix}$ Its only eigenvalue is $2$ but the dimension of its eigenspace is $1$ , so the matrix is not diagonalizable. But if we still built $D=\begin{bmatrix} 2 & 0 \\ 0 & 2 \\ \end{bmatrix}$ we would get that:

$A = P D P^{-1} = P (2I) P^{-1} = 2 P I P^{-1} = 2 P P^{-1} = 2 I$ This is a clear contradiction!

Conclusion

A square matrix $A$ of dimension $n$ is diagonalizable if there exists a diagonal matrix $D$ and an invertible one $P$ such that $A=PDP^{−1}$ .
The diagonal entries of $D$ are the eigenvalues of $A$ , while the columns of $P$ are its eigenvectors.
The matrix $A$ is diagonalizable if and only if there is a basis formed by its eigenvectors, which is also equivalent to the fact that the geometric multiplicities of its eigenvalues add up to $n$ .
If $A$ has $n$ eigenvalues, then it is diagonalizable.
To diagonalize $A$ , first calculate its eigenvalues, then the associated eigenvectors, and finally get $D$ and $P$ .

4 learners liked this piece of theory. 0 didn't like it. What about you?

Report a typo