Lesson Explainer: The General Solution of a System of Linear Equations | Nagwa Lesson Explainer: The General Solution of a System of Linear Equations | Nagwa

Lesson Explainer: The General Solution of a System of Linear Equations Mathematics

Join Nagwa Classes

Attend live Mathematics sessions on Nagwa Classes to learn more about this topic from an expert teacher!

In this explainer, we will learn how to find the general solution of a system of linear equations whether it has a unique solution, an infinite number of solutions, or no solution.

When working with a system of linear equations, there are 3 possible categories of solution that are possible to find. The most familiar case will be a system of linear equations which has a unique solution. Take, for example, the system of linear equations 2𝑥+3𝑦=1,4𝑥+𝑦=9.

Solving this system of linear equations using any method will give 𝑥=2 and 𝑦=1. Given that both of the variables 𝑥 and 𝑦 have only one possible value, we would say that the solution is unique.

Suppose that we instead have the system of linear equations 𝑥3𝑦=2,3𝑥+9𝑦=6.

In this instance we can see that the bottom equation is essentially just a copy of the first equation, with every term multiplied by 3. In fact, we can multiply each term in the left-hand side by 13 without changing the solution and the system would become 𝑥3𝑦=2,𝑥3𝑦=2.

The first and second equations are identical, which means that there is no benefit of writing them both out. We lose no information by writing the system of linear equations more concisely as 𝑥3𝑦=2.

This shows that, unlike the previous example where we had a unique solution, it is now possible for 𝑥 and 𝑦 to take infinitely many values whilst still solving the original system of linear equations. If we were to solve the above equation for 𝑥, then we would find 𝑥=2+3𝑦. If we picked the example value of 𝑦=1, then we would find 𝑥=5 and we could check that this pair of values do actually solve the original system. Equally, we would pick 𝑦=3, which would give 𝑥=7. It is also the case that this pair of values solve the original system of equations.

Given that we can pick any value of 𝑦 and find a corresponding value of 𝑥 from the equation 𝑥=2+3𝑦, we therefore have infinitely many possible solutions. It is this type of solution that we will discuss in this explainer.

For completion, we will briefly mention the final possible solution type. We consider the system of linear equations 3𝑥+𝑦=2,6𝑥+2𝑦=6.

We can see that the second equation is very similar to the first equation. We multiply all terms in the second equation by 12, giving 3𝑥+𝑦=2,3𝑥+𝑦=3.

Now we have a rather curious pair of equations, which have the same left-hand sides but which have different right-hand sides. Reading the two equations above, we have to find values of 𝑥 and 𝑦 that when combined together in exactly the same way, somehow give different outputs. This is patently ludicrous and is not possible to achieve, meaning that the given system of linear equations does not have a solution. We would say that the system is inconsistent or insoluble. Naturally enough, we are not generally interested in these types of systems of linear equations.

The method and difficulty for solving a system of equations with an infinite number of solutions are not notably different to the situation where there is a unique solution. The method is largely the same and does not require advanced knowledge of the solution type, relying on the Gauss–Jordan elimination method for completing the calculations. It is only partway through the calculations, when the solution type has been identified, that the method’s approach begins to deviate. No matter what the solution type of a system of linear equations is, we may always find it by using the same approach to grouping together the coefficients of this system into a particular matrix, which we then manipulate with row operations to find the solution.

Definition: Augmented Coefficient Matrix

Consider a general system of linear equations in the variables 𝑥,𝑥,,𝑥 and the coefficients 𝑎: 𝑎𝑥+𝑎𝑥𝑎𝑥=𝑏,𝑎𝑥+𝑎𝑥𝑎𝑥=𝑏,𝑎𝑥+𝑎𝑥𝑎𝑥=𝑏.

Then the system of linear equations is also described by the matrix equation 𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑥𝑥𝑥=𝑏𝑏𝑏.

The augmented coefficient matrix of the system is 𝑎𝑎𝑎𝑏𝑎𝑎𝑎𝑏𝑎𝑎𝑎𝑏.

Suppose that we wished to solve the system of linear equations defined as 𝑥2𝑦=3,5𝑥10𝑦=15.

By this point it is already apparent that the second equation is the same as the first equation, except for every term being multiplied by 5. We can reasonably anticipate that there will be an infinite number of solutions, but we will initially proceed with no such assumptions. The augmented coefficient matrix of this system of equations is 12351015.

The second row is a copy of the first row after every entry has been multiplied by 5. The reason for writing the system of linear equations in this form is that it is easier to then manipulate the matrix into the form that represents the solution of the system. This can be understood by the following theorem.

Theorem: Augmented Coefficient Matrix and Reduced Echelon Form

If a system of linear equations has the augmented coefficient matrix 𝑎𝑎𝑎𝑏𝑎𝑎𝑎𝑏𝑎𝑎𝑎𝑏, then the solution to the system of equations is encoded by the reduced echelon form of this matrix.

We will demonstrate this theorem using the system of linear equations the we had above: 𝑥2𝑦=3,5𝑥10𝑦=15, which has the augmented coefficient matrix 12351015.

We have highlighted the pivots in the row above because we will use these as a reference to begin the process of putting the matrix in reduced echelon form. For every pivot, we need to ensure that all other entries in the same column have a value of zero. This means that we can choose to remove the pivot in the first row or the pivot in the second row. It makes most sense to keep the pivot in the top row, so we use the elementary row operation 𝑟𝑟5𝑟 to transform the matrix to 123000.

This matrix is now in reduced echelon form and, by the theorem above, it tells us the solution to the original system of linear equations. We could choose to write the solution out in the form 𝑥2𝑦=3, as we did above, but this convention will not be sufficient when we move onto systems of linear equations which have more than two variables, so we choose to write the solution in a particular format that will be useful to us later.

When expressing the solution of a system of linear equations, it is helpful to identify all variables that correspond to a pivot in the reduced echelon form of the matrix and then solve every outputted equation for these variables. In our case we only have one pivot variable, 𝑥, which we express in terms of the nonpivot variable 𝑦 as 𝑥=3+2𝑦.

Although it may seem trivial, we then write the nonpivot variables in terms of themselves: 𝑦=𝑦.

Aligning these two equations gives 𝑥=3+2𝑦𝑦=𝑦.

Although it may appear highly convoluted, this is an entirely adequate statement of the solution to the original system of equations. We can actually go further and group the terms in vector form as 𝑥𝑦=30+𝑦21.

It is generally considered a better practice to write any variables on the right-hand side as an independent parameter, so we instead write the solution as 𝑥𝑦=30+𝑡21.

Now that we have all of the tools that we need to solve any system of linear equations which has infinitely many solutions, we will apply the techniques to a series of questions which involve larger augmented coefficient matrices.

Example 1: A System of Linear Equations with 3 Variables and 3 Equations

Find a general solution to the linear system 012101125𝑥𝑦𝑧=111.

Answer

We first create the augmented coefficient matrix of the given system of linear equations: 012110111251.

We will use elementary row operations to reduce the matrix to reduced echelon form, thereby giving the solution to the original system of equations. To help with this, we first highlight the pivots of each row, which are the first nonzero entries: 012110111251.

It will be easier to reach reduced echelon form if we can keep a value of 1 in the top-left entry, so we first perform the row swap 𝑟𝑟, giving 125110110121.

We need to remove any nonzero entry that is above or below the pivot in the first row. There is only one such entry, which is the pivot in the second row. This can be eliminated with the row operation 𝑟𝑟𝑟, which produces the matrix 125102420121.

Whenever feasible, it is generally wisest to remove constant factors from entire rows if we can do this without introducing fractions into our calculations. In the matrix, every entry in the second row is divisible by 2, so we scale this row using the operation 𝑟12𝑟, which gives the simpler matrix 125101210121.

Now we must remove the nonzero entry that is below the pivot in the second row with the row operation 𝑟𝑟+𝑟. This gives the matrix 125101210000, which now has a zero row in the third row, meaning that there will be infinitely many solutions to the system, provided that it is not inconsistent. To remove the nonzero entry above the pivot in the second row, we perform 𝑟𝑟+2𝑟, which gives the reduced echelon form 101101210000.

Now we effectively have the solution to the original system of linear equations. Note that the variables corresponding to the pivots are 𝑥 and 𝑦, and the variable that does not correspond to any pivots is 𝑧. We write out the matrix in terms of the original variables, not including the zero row: 𝑥+𝑧=1𝑦2𝑧=1.

Solving both of these equations for the pivot variables gives 𝑥=1𝑧𝑦=1+2𝑧.

We also have the nonpivot variable 𝑧=𝑧. Writing these 3 equations together gives 𝑥=1𝑧𝑦=1+2𝑧𝑧=𝑧.

We could have equivalently expressed this solution in the form 𝑥𝑦𝑧=110+𝑡121, where 𝑡 is an independent parameter.

Example 2: A System of Linear Equations with 3 Variables and 3 Equations

Find a general solution to the linear system 012121145𝑥𝑦𝑧=111.

Answer

We first create the augmented coefficient matrix of the system of linear equations, highlighting the pivot entries: 012112111451.

We will need to put this matrix in reduced echelon form. To place a 1 in the top-left entry, we first swap rows 𝑟𝑟: 145112110121.

We cannot put the matrix in reduced echelon form if there are any pivots below the pivot in the top row, so we must remove the pivot that currently appears in the second row. The elementary row operation 𝑟𝑟𝑟 will achieve this: 145102420121.

It will be helpful if the pivot in the second row is made equal to 1, which in this case can be achieved without introducing fractions into the row, by using the row operation 𝑟12𝑟 to give 145101210121.

Now it is clear that the third row is just a copy of the second row, except for a sign change in every entry. We can therefore eliminate the entire third row with the row operation 𝑟𝑟+𝑟: 145101210000.

Reduced echelon form is then achieved with the final row operation 𝑟𝑟+4𝑟: 103301210000.

The variables 𝑥 and 𝑦 refer to the two columns which contain pivots and the variable 𝑧 does not. Writing out the corresponding equations 𝑥3𝑧=3𝑦2𝑧=1 and solving both for the pivot variables gives 𝑥=3+3𝑧𝑦=1+2𝑧.

Combining this with the nonpivot variable 𝑧=𝑧 gives 𝑥=3+3𝑧𝑦=1+2𝑧𝑧=𝑧.

This is the full solution and we also choose to write it in matrix form as 𝑥𝑦𝑧=310+𝑡321.

All of the techniques practiced above can be used immediately with a larger system of linear equations. As long as we are proficient with row operations and have a full understanding of reduced echelon form, the method is little different to either of the previous examples that we practiced. In the following example, we will show how exactly the same method can be applied to find the solution of a larger system of equations.

Example 3: A System of Linear Equations with 4 Variables and 4 Equations

Find a general solution to the linear system 1011111031323303𝑥𝑦𝑧𝑤=1243.

Answer

We create the augmented coefficient matrix and highlight the pivot entries: 10111111023132433033.

We need to remove all of the nonzero entries below the pivot in the first row. We can do this with the three elementary row operations 𝑟𝑟𝑟, 𝑟𝑟3𝑟, and 𝑟𝑟3𝑟. The resulting matrix is 10111010110101103300.

Given that the third row is a copy of the second row, we can remove the whole row with the operation 𝑟𝑟𝑟. This gives a matrix where the third row is a zero row: 10111010110000003300, which we move to the bottom of the matrix with the row swap operation 𝑟𝑟, giving 10111010110330000000.

The bottom two pivot variables can be scaled to give a value of 1, which is generally more convenient to work with, by the row operations 𝑟𝑟 and 𝑟13𝑟. This gives 10111010110110000000.

Now we need to remove the pivot entry in the third row because this is directly below the pivot in the second row. The row operation 𝑟𝑟𝑟 achieves this: 10111010110011100000.

In the penultimate step, we scale the pivot entry in the third row with 𝑟𝑟, which gives 10111010110011100000 and allows us to remove the remaining nonzero entry above the pivot in the third row, with the the row operation 𝑟𝑟𝑟: 10002010110011100000.

The matrix is now in reduced echelon form, with the pivots corresponding to the variables 𝑥, 𝑦, and 𝑧. The remaining variable 𝑤 is represented by the fourth column, which does not contain any pivots. Writing out the corresponding equations gives 𝑥=2𝑦+𝑤=1𝑧+𝑤=1.

We now solve the three equations for the pivot variables, giving 𝑥=2𝑦=1𝑤𝑧=1𝑤.

Considering the nonpivot variable 𝑤, the full solution is 𝑥=2𝑦=1𝑤𝑧=1𝑤𝑤=𝑤.

Written in vector form, we have 𝑥𝑦𝑧𝑤=2110+𝑡0111.

Although it was not strictly necessary, we chose to represent the above solutions in a very particular form, in terms of vectors and also in terms of some arbitrary parameter(s). The reasons for this are not entirely superficial, as it is often helpful to have a solution to a system of linear equations that is written in this vector form, encoding the key information that defines the associated vector space. Unlike a solution that is written in terms of equations, it is much easier to understand the key components of a vector space when they are presented in vector form.

So far in this explainer we have not discussed how we would write the solution in terms of vectors if there is more than one pivot variable. As an example, suppose that we had been given a system of linear equations, from which we had already created the augmented coefficient matrix and found the reduced echelon form 11022001130000000000.

Assuming that the original variables were 𝑥, 𝑦, 𝑧, and 𝑤, we would have the corresponding equations 𝑥𝑦+2𝑤=2,𝑧+𝑤=3.

The pivot variables would be 𝑥 and 𝑧 and the nonpivot variables would be 𝑦 and 𝑤. We solve the two above equations for the pivot variables: 𝑥=2+𝑦2𝑤,𝑧=3𝑤.

We now include the two trivial equations 𝑦=𝑦 and 𝑤=𝑤 for the nonpivot variables 𝑥=2+𝑦2𝑤𝑦=𝑦𝑧=3𝑤𝑤=𝑤.

Group these together as vectors and, using two independent parameters, we have the full solution 𝑥𝑦𝑧𝑤=2030+𝑠1100+𝑡2011.

In the above example we required 3 vectors to fully express the solution. Of course it is possible for there to be more than 3 vectors that must be used in order to express the full solution and the number of these vectors is related to the rank and nullity of a matrix, both of which are key concepts in linear algebra. It is also possible for there to be only one vector required to express the full solution, although this would mean that the solution is unique and that no parameters are needed to express the solution space.

Key Points

  • The augmented coefficient matrix of a system of linear equations can be operated on using rowoperations. When in reduced echelon form, the solution of the original system is found.
  • It is usually helpful to express the solution in terms of the pivot variables, which are the variables that correspond to the columns of a matrix that contain pivots when in reduced echelon form.
  • If there are infinitely many solutions to a system of linear equations, then these solutions should generally be expressed in terms of vectors with arbitrary parameters.

Join Nagwa Classes

Attend live sessions on Nagwa Classes to boost your learning with guidance and advice from an expert teacher!

  • Interactive Sessions
  • Chat & Messaging
  • Realistic Exam Questions

Nagwa uses cookies to ensure you get the best experience on our website. Learn more about our Privacy Policy