Gauss–Seidel method
   HOME

TheInfoList



OR:

In numerical linear algebra, the Gauss–Seidel method, also known as the Liebmann method or the method of successive displacement, is an
iterative method In computational mathematics, an iterative method is a Algorithm, mathematical procedure that uses an initial value to generate a sequence of improving approximate solutions for a class of problems, in which the ''i''-th approximation (called an " ...
used to solve a
system of linear equations In mathematics, a system of linear equations (or linear system) is a collection of two or more linear equations involving the same variable (math), variables. For example, : \begin 3x+2y-z=1\\ 2x-2y+4z=-2\\ -x+\fracy-z=0 \end is a system of th ...
. It is named after the German
mathematician A mathematician is someone who uses an extensive knowledge of mathematics in their work, typically to solve mathematical problems. Mathematicians are concerned with numbers, data, quantity, mathematical structure, structure, space, Mathematica ...
s
Carl Friedrich Gauss Johann Carl Friedrich Gauss (; ; ; 30 April 177723 February 1855) was a German mathematician, astronomer, geodesist, and physicist, who contributed to many fields in mathematics and science. He was director of the Göttingen Observatory and ...
and Philipp Ludwig von Seidel. Though it can be applied to any matrix with non-zero elements on the diagonals, convergence is only guaranteed if the matrix is either strictly diagonally dominant, or
symmetric Symmetry () in everyday life refers to a sense of harmonious and beautiful proportion and balance. In mathematics, the term has a more precise definition and is usually used to refer to an object that is invariant under some transformations ...
and positive definite. It was only mentioned in a private letter from Gauss to his student Gerling in 1823. A publication was not delivered before 1874 by Seidel.


Description

Let \mathbf A\mathbf x = \mathbf b be a square system of linear equations, where: \mathbf A = \begin a_ & a_ & \cdots & a_ \\ a_ & a_ & \cdots & a_ \\ \vdots & \vdots & \ddots & \vdots \\a_ & a_ & \cdots & a_ \end, \qquad \mathbf = \begin x_ \\ x_2 \\ \vdots \\ x_n \end , \qquad \mathbf = \begin b_ \\ b_2 \\ \vdots \\ b_n \end. When \mathbf A and \mathbf b are known, and \mathbf x is unknown, the Gauss–Seidel method can be used to iteratively approximate \mathbf x. The vector \mathbf x^ denotes the initial guess for \mathbf x, often \mathbf x^_i=0 for i=1,2,...,n. Denote by \mathbf^ the k-th approximation or iteration of \mathbf, and by \mathbf^ the approximation of \mathbf at the next (or k+1-th) iteration.


Matrix-based formula

The solution is obtained iteratively via \mathbf L \mathbf^ = \mathbf - \mathbf U \mathbf^, where the matrix \mathbf A is decomposed into a lower triangular component \mathbf L, and a strictly upper triangular component \mathbf U such that \mathbf = \mathbf L + \mathbf U . More specifically, the decomposition of A into L_* and U is given by: \mathbf A = \underbrace_ + \underbrace_ .


Why the matrix-based formula works

The system of linear equations may be rewritten as: :\begin \mathbf A \mathbf x &= \mathbf b \\ (\mathbf L + \mathbf U) \mathbf x &= \mathbf b \\ \mathbf L \mathbf x+ \mathbf U \mathbf x &= \mathbf b \\ \mathbf L \mathbf &= \mathbf - \mathbf U \mathbf \end The Gauss–Seidel method now solves the left hand side of this expression for \mathbf, using the previous value for \mathbf on the right hand side. Analytically, this may be written as \mathbf ^ = \mathbf L^ \left(\mathbf - \mathbf U \mathbf^\right).


Element-based formula

However, by taking advantage of the triangular form of \mathbf L, the elements of \mathbf^ can be computed sequentially for each row i using forward substitution: x^_i = \frac \left(b_i - \sum_^a_x^_j - \sum_^a_x^_j \right),\quad i=1,2,\dots,n. Notice that the formula uses two summations per iteration which can be expressed as one summation \sum_ a_x_j that uses the most recently calculated iteration of x_j. The procedure is generally continued until the changes made by an iteration are below some tolerance, such as a sufficiently small residual.


Discussion

The element-wise formula for the Gauss–Seidel method is related to that of the (iterative) Jacobi method, with an important difference: In Gauss-Seidel, the computation of \mathbf^ uses the elements of \mathbf^ that have already been computed, and only the elements of \mathbf^ that have not been computed in the (k+1)-th iteration. This means that, unlike the Jacobi method, only one storage vector is required as elements can be overwritten as they are computed, which can be advantageous for very large problems. However, unlike the Jacobi method, the computations for each element are generally much harder to implement in parallel, since they can have a very long critical path, and are thus most feasible for
sparse matrices In numerical analysis and scientific computing, a sparse matrix or sparse array is a matrix (mathematics), matrix in which most of the elements are zero. There is no strict definition regarding the proportion of zero-value elements for a matrix ...
. Furthermore, the values at each iteration are dependent on the order of the original equations. Gauss-Seidel is the same as successive over-relaxation with \omega=1.


Convergence

The convergence properties of the Gauss–Seidel method are dependent on the matrix \mathbf A. Namely, the procedure is known to converge if either: * \mathbf A is symmetric positive-definite, or * \mathbf A is strictly or irreducibly diagonally dominant. The Gauss–Seidel method may converge even if these conditions are not satisfied. Golub and Van Loan give a theorem for an algorithm that splits \mathbf A into two parts. Suppose \mathbf A = \mathbf M - \mathbf N is nonsingular. Let r = \rho (\mathbf M^ \mathbf N) be the
spectral radius ''Spectral'' is a 2016 Hungarian-American military science fiction action film co-written and directed by Nic Mathieu. Written with Ian Fried (screenwriter), Ian Fried & George Nolfi, the film stars James Badge Dale as DARPA research scientist Ma ...
of \mathbf M^ \mathbf N. Then the iterates \mathbf x^ defined by \mathbf M \mathbf x^ = \mathbf N \mathbf x^ + \mathbf b converge to \mathbf x = \mathbf A^ \mathbf b for any starting vector \mathbf x^ if \mathbf M is nonsingular and r < 1.


Algorithm

Since elements can be overwritten as they are computed in this algorithm, only one storage vector is needed, and vector indexing is omitted. The algorithm goes as follows: algorithm Gauss–Seidel method is inputs: , repeat until convergence for from 1 until do for from 1 until do if ≠ then end if end (-loop) end (-loop) check if convergence is reached end (repeat)


Examples


An example for the matrix version

A linear system shown as \mathbf A \mathbf = \mathbf is given by: \mathbf A= \begin 16 & 3 \\ 7 & -11 \\ \end \quad \text \quad \mathbf b = \begin 11 \\ 13 \end. Use the equation \mathbf^ = \mathbf L^ (\mathbf - \mathbf U \mathbf^) in the form \mathbf^ = \mathbf T \mathbf^ + \mathbf c where: :\mathbf T = - \mathbf L ^ \mathbf U \quad \text \quad \mathbf c = \mathbf L ^ \mathbf. Decompose \mathbf A into the sum of a lower triangular component \mathbf L and a strict upper triangular component U: \mathbf L= \begin 16 & 0 \\ 7 & -11 \\ \end \quad \text \quad \mathbf U = \begin 0 & 3 \\ 0 & 0 \end. The inverse of \mathbf L is: \mathbf L^ = \begin 16 & 0 \\ 7 & -11 \end^ = \begin 0.0625 & 0.0000 \\ 0.0398 & -0.0909 \\ \end. Now find: \begin \mathbf T & = - \begin 0.0625 & 0.0000 \\ 0.0398 & -0.0909 \end \begin 0 & 3 \\ 0 & 0 \end = \begin 0.000 & -0.1875 \\ 0.000 & -0.1194 \end, \\ ex \mathbf c & = \begin 0.0625 & 0.0000 \\ 0.0398 & -0.0909 \end \begin 11 \\ 13 \end = \begin 0.6875 \\ -0.7439 \end. \end With \mathbf T and \mathbf c the vectors \mathbf can be obtained iteratively. First of all, choose \mathbf^, for example \mathbf x^ = \begin 1.0 \\ 1.0 \end. The closer the guess to the final solution, the fewer iterations the algorithm will need. Then calculate: \begin \mathbf x^ &= \begin 0.000 & -0.1875 \\ 0.000 & -0.1193 \end \begin 1.0 \\ 1.0 \end + \begin 0.6875 \\ -0.7443 \end = \begin 0.5000 \\ -0.8636 \end. \\ ex \mathbf x^ &= \begin 0.000 & -0.1875 \\ 0.000 & -0.1193 \end \begin 0.5000 \\ -0.8636 \end + \begin 0.6875 \\ -0.7443 \end = \begin 0.8494 \\ -0.6413 \end. \\ ex \mathbf x^ &= \begin 0.000 & -0.1875 \\ 0.000 & -0.1193 \end \begin 0.8494 \\ -0.6413 \\ \end + \begin 0.6875 \\ -0.7443 \end = \begin 0.8077 \\ -0.6678 \end. \\ ex\mathbf x^ &= \begin 0.000 & -0.1875 \\ 0.000 & -0.1193 \end \begin 0.8077 \\ -0.6678 \end + \begin 0.6875 \\ -0.7443 \end = \begin 0.8127 \\ -0.6646 \end. \\ ex\mathbf x^ & = \begin 0.000 & -0.1875 \\ 0.000 & -0.1193 \end \begin 0.8127 \\ -0.6646 \end + \begin 0.6875 \\ -0.7443 \end = \begin 0.8121 \\ -0.6650 \end. \\ ex \mathbf x^ & = \begin 0.000 & -0.1875 \\ 0.000 & -0.1193 \end \begin 0.8121 \\ -0.6650 \end + \begin 0.6875 \\ -0.7443 \end = \begin 0.8122 \\ -0.6650 \end. \\ ex\mathbf x^ & = \begin 0.000 & -0.1875 \\ 0.000 & -0.1193 \end \begin 0.8122 \\ -0.6650 \end + \begin 0.6875 \\ -0.7443 \end = \begin 0.8122 \\ -0.6650 \end. \end As expected, the algorithm converges to the solution: \mathbf = \mathbf A ^ \mathbf \approx \begin 0.8122\\ -0.6650 \end. In fact, the matrix is strictly diagonally dominant, but not positive definite.


Another example for the matrix version

Another linear system shown as \mathbf A \mathbf = \mathbf is given by: \mathbf A= \begin 2 & 3 \\ 5 & 7 \\ \end \quad \text \quad \mathbf b = \begin 11 \\ 13 \\ \end. Use the equation \mathbf^ = \mathbf L^ (\mathbf - \mathbf U \mathbf^) in the form \mathbf^ = \mathbf T \mathbf^ + \mathbf c where: :\mathbf T = - \mathbf L ^ \mathbf U \quad \text \quad \mathbf c = \mathbf L^ \mathbf. Decompose \mathbf A into the sum of a lower triangular component \mathbf L and a strict upper triangular component \mathbf U: \mathbf L = \begin 2 & 0 \\ 5 & 7 \\ \end \quad \text \quad \mathbf U = \begin 0 & 3 \\ 0 & 0 \\ \end. The inverse of \mathbf L is: \mathbf L ^ = \begin 2 & 0 \\ 5 & 7 \\ \end^ = \begin 0.500 & 0.000 \\ -0.357 & 0.143 \\ \end . Now find: \begin \mathbf T & = - \begin 0.500 & 0.000 \\ -0.357 & 0.143 \\ \end \begin 0 & 3 \\ 0 & 0 \\ \end = \begin 0.000 & -1.500 \\ 0.000 & 1.071 \\ \end, \\ ex \mathbf c & = \begin 0.500 & 0.000 \\ -0.357 & 0.143 \\ \end \begin 11 \\ 13 \\ \end = \begin 5.500 \\ -2.071 \\ \end. \end With \mathbf T and \mathbf c the vectors \mathbf can be obtained iteratively. First of all, we have to choose \mathbf^, for example \mathbf ^ = \begin 1.1 \\ 2.3 \end Then calculate: \begin \mathbf ^ &= \begin 0 & -1.500 \\ 0 & 1.071 \\ \end \begin 1.1 \\ 2.3 \\ \end + \begin 5.500 \\ -2.071 \\ \end = \begin 2.050 \\ 0.393 \\ \end. \\ ex \mathbf ^ &= \begin 0 & -1.500 \\ 0 & 1.071 \\ \end \begin 2.050 \\ 0.393 \\ \end + \begin 5.500 \\ -2.071 \\ \end = \begin 4.911 \\ -1.651 \end. \\ ex \mathbf ^ &= \cdots. \end In a test for convergence we find that the algorithm diverges. In fact, the matrix \mathbf A is neither diagonally dominant nor positive definite. Then, convergence to the exact solution \mathbf = \mathbf A ^ \mathbf = \begin -38\\ 29 \end is not guaranteed and, in this case, will not occur.


An example for the equation version

Suppose given n equations and a starting point \mathbf _0. At any step in a Gauss-Seidel iteration, solve the first equation for x_1 in terms of x_, \dots, x_n; then solve the second equation for x_2 in terms of x_1 just found and the remaining x_, \dots, x_n; and continue to x_n. Then, repeat iterations until convergence is achieved, or break if the divergence in the solutions start to diverge beyond a predefined level. Consider an example: \begin 10x_1 &- x_2 &+ 2x_3 & & = 6, \\ -x_1 &+ 11x_2 &- x_3 &+ 3x_4 & = 25, \\ 2x_1 &- x_2 &+ 10x_3 &- x_4 & = -11, \\ & 3x_2 &- x_3 &+ 8x_4 & = 15. \end Solving for x_1, x_2, x_3 and x_4 gives: \begin x_1 & = x_2/10 - x_3/5 + 3/5, \\ x_2 & = x_1/11 + x_3/11 - 3x_4/11 + 25/11, \\ x_3 & = -x_1/5 + x_2/10 + x_4/10 - 11/10, \\ x_4 & = -3x_2/8 + x_3/8 + 15/8. \end Suppose is the initial approximation, then the first approximate solution is given by: \begin x_1 & = 3/5 = 0.6, \\ x_2 & = (3/5)/11 + 25/11 = 3/55 + 25/11 = 2.3272, \\ x_3 & = -(3/5)/5 +(2.3272)/10-11/10 = -3/25 + 0.23272-1.1 = -0.9873,\\ x_4 & = -3(2.3272)/8 +(-0.9873)/8+15/8 = 0.8789. \end Using the approximations obtained, the iterative procedure is repeated until the desired accuracy has been reached. The following are the approximated solutions after four iterations. The exact solution of the system is .


An example using Python and NumPy

The following iterative procedure produces the solution vector of a linear system of equations: import numpy as np ITERATION_LIMIT = 1000 # initialize the matrix A = np.array( [10.0, -1.0, 2.0, 0.0 [-1.0, 11.0, -1.0, 3.0">0.0,_-1.0,_2.0,_0.0.html" ;"title=" [10.0, -1.0, 2.0, 0.0"> [10.0, -1.0, 2.0, 0.0 [-1.0, 11.0, -1.0, 3.0 [2.0, -1.0, 10.0, -1.0], [0.0, 3.0, -1.0, 8.0], ] ) # initialize the RHS vector b = np.array([6.0, 25.0, -11.0, 15.0]) print("System of equations:") for i in range(A.shape : row = [f"*x" for j in range(A.shape ] print("[] = []".format(" + ".join(row), b[i])) x = np.zeros_like(b, np.float_) for it_count in range(1, ITERATION_LIMIT): x_new = np.zeros_like(x, dtype=np.float_) print(f"Iteration : ") for i in range(A.shape : s1 = np.dot(A , :i x_new i s2 = np.dot(A , i + 1 : x + 1 : x_new = (b - s1 - s2) / A , i if np.allclose(x, x_new, rtol=1e-8): break x = x_new print(f"Solution: ") error = np.dot(A, x) - b print(f"Error: ") Produces the output: System of equations: 10*x1 + -1*x2 + 2*x3 + 0*x4= 6 -1*x1 + 11*x2 + -1*x3 + 3*x4= 25 2*x1 + -1*x2 + 10*x3 + -1*x4= 11 0*x1 + 3*x2 + -1*x3 + 8*x4= 15Iteration 1: 0. 0. 0. 0.Iteration 2: 0.6 2.32727273 -0.98727273 0.87886364Iteration 3: 1.03018182 2.03693802 -1.0144562 0.98434122Iteration 4: 1.00658504 2.00355502 -1.00252738 0.99835095Iteration 5: 1.00086098 2.00029825 -1.00030728 0.99984975Iteration 6: 1.00009128 2.00002134 -1.00003115 0.9999881 Iteration 7: 1.00000836 2.00000117 -1.00000275 0.99999922Iteration 8: 1.00000067 2.00000002 -1.00000021 0.99999996Iteration 9: 1.00000004 1.99999999 -1.00000001 1. Iteration 10: 1. 2. -1. 1.Solution: 1. 2. -1. 1.Error: 2.06480930e-08 -1.25551054e-08 3.61417563e-11 0.00000000e+00


Program to solve arbitrary number of equations using Matlab

The following code uses the formula x^_i = \frac \left(b_i - \sum_a_x^_j - \sum_a_x^_j \right),\quad \begin i=1,2,\ldots,n \\ k=0,1,2,\ldots \end function x = gauss_seidel(A, b, x, iters) for i = 1:iters for j = 1:size(A,1) x(j) = (b(j) - sum(A(j,:)'.*x) + A(j,j)*x(j)) / A(j,j); end end end


See also

* Conjugate gradient method * Gaussian belief propagation * Iterative method: Linear systems * Kaczmarz method (a "row-oriented" method, whereas Gauss-Seidel is "column-oriented." See, for example
this paper
) * Matrix splitting * Richardson iteration


Notes


References

* . * . *


External links

*
Gauss–Seidel from www.math-linux.com
From Holistic Numerical Methods Institute
BicksonMatlab code
{{DEFAULTSORT:Gauss-Seidel Method Numerical linear algebra Articles with example pseudocode Relaxation (iterative methods) Articles with example Python (programming language) code Articles with example MATLAB/Octave code