Systems of linear equations occur everywhere: not only in math itself, but also in physics, chemistry and economics! There are different ways of solving them, but the most general way is based on matrices. In this way of solving, the main stage in the solution process is the reduction of the matrix to an echelon form (also called row echelon form). Let's see how to do this reduction.
Echelon matrix
First, let's define the echelon matrix. It is a matrix that has three properties:
If any column contains the leading entry (first non-zero in the row), then below it can be only zero elements.
The leading entry of the row above the current one must be to the left of the leading entry of the current row.
All rows that contain only zeros must be below all other rows.
Below are matrices reduced to the echelon form:
Note that a zero matrix is in echelon form!
Elementary row and column operations
Before learning how to reduce a matrix to an echelon form, you have to focus on elementary row and column operations, since they're the core of this process. After applying one of these operations, you'll get a matrix equivalent to the original one. Why is it so? It's connected with solving systems of linear equations using Gaussian elimination. It is better explored in the corresponding topic. Now, let's just focus on the types of these operations.
There are three types of elementary row and column operations:
Interchange of any two rows or columns
Multiplication of any row or column by a non-zero number
Multiplication of any row or column by a non-zero number and adding to another row or column
If these operations are performed on rows, they are called elementary row operations. If they are performed on columns, they are called elementary column operations.
Here's a picture that illustrates all elementary row operations:
Reducing a matrix to echelon form: the algorithm
And now you're ready to learn how to reduce a matrix to the echelon form. Suppose you have a matrix Here's the algorithm of reduction to echelon form:
Find a non-zero element in the first column. This element is called a pivot, and the row containing it is called the pivot row. If the row with the pivot is not the first row, interchange it with the first row. Otherwise, don't change anything. If you were unable to find a non-zero element of the first column, just leave it untouched and move on to the second column
For every row except the first, do the following: multiply the first row by and add the result to the -th row. Now all the elements in the first column, but not in the first row, have become zero. You'll get a matrix of the form:
Leave the pivot row and column untouched and move on to the second, third, etc. column and do the same thing as described in . Every time, you'll work with a smaller matrix from the bottom-right corner. For the second column, it will be
Note that the echelon form of any one-row matrix will be this matrix itself.
Example
Assume you have a matrix:
The first element of the first row is , so it is a pivot. Now you have to add multipliers of the first row to the second and the third row in order to make all other elements in the first column equal to . How can you do it? As you know, and . So, let's multiply the first row by and add to the second one. You can do it, because . You'll get the following matrix As you know, , so let's multiply the first row by and add it to the third row. You also can do it, because . You'll get the following matrix:
According to our algorithm, you should move on to the second column. Here, add a multiplier of the second row to the third one to make the second element of the third row equal to . You should also apply the fact that . So, let's multiply the second row by and add it to the third row:
The matrix is now in echelon form.
Reduced echelon form
A matrix is in reduced echelon form if it satisfies one extra condition: each leading entry of non-zero rows is , and it is the only non-zero element in its column. For instance, these matrices are in reduced echelon form:
As you can see, in each matrix, there's a row with only one element, and this is why such a form of a matrix is the desired result when you solve a system of linear equations. Since only one element is present, it means that the only variable is left, and it's easy to find it. You'll learn more about this process in the next topic on Gaussian elimination.
If you have a matrix reduced to an echelon form, you can easily reduce it to the reduced echelon form.Just multiply all rows by the inverse of their pivots and perform some more elementary operations to make all non-pivot elements of columns equal to . Coming back to the matrix from the previous paragraph, you have
You have to multiply the first row by and the third one by , so you'll get:
And now, you have to do something with the last element of the first and the second column. You'll make them equal to , if you just multiply the third row by and add it to them:
Conclusion
Let's sum up all the most important facts you've learned so far:
An echelon matrix has three properties:
If any column contains the leading entry (first non-zero in the row), then below it can be only zero elements.
The leading entry of the row above the current one must be to the left of the leading entry of the current row.
All rows that contain only zeros must be below all other rows.
There are three elementary row and column operations:
Interchange of any two rows or columns
Multiplication of any row or column by a non-zero number
Multiplication of any row or column by a non-zero number and adding to another row or column
The algorithm of reduction of a matrix to a row echelon form is based on elementary row and column operations
There's also a reduced echelon form of a matrix, where each leading entry of non-zero rows is , and it is the only non-zero element in its column