The set ℳm,n of all m × n matrices under the field of either real or complex numbers is a vector space of dimension m · n.
In order to determine how close two matrices are, and in order to define the convergence of sequences of matrices, a special concept of matrix norm is employed, with notation \( \| {\bf A} \| . \) A norm
is a function from a real or complex vector space to the nonnegative real numbers that satisfies the following conditions:
Since the set of all matrices admits the operation of multiplication in addition to the basic operation of addition (which is included in the definition of vector spaces), it is natural to require that matrix norm satisfies the special property:
Once a norm is defined, it is the most natural way of measure distance between two matrices A and B as d(A, B) = ‖A − B‖ = ‖B − A‖. However, not all distance functions have a corresponding norm. For example, a trivial distance that has no equivalent norm is d(A, A) = 0 and d(A, B) = 1 if A ≠ B.
The norm of a matrix may be thought of as its magnitude or length because it is a nonnegative number.
Their definitions are summarized below for an \( m \times n \) matrix A, to which corresponds a self-adjoint (m+n)×(m+n) matrix B:
This matrix norm is called the operator norm or induced norm.
The term "induced" refers to the fact that the definition of a norm for vectors such as A x and x is what enables the definition above of a matrix norm.
This definition of matrix norm is not computationally friendly, so we use other options. The most important norms are as follow
✼
The operator norm corresponding to the p-norm for vectors, p ≥ 1, is:
Theorem 5:
Let ‖ ‖ be any matrix norm, and let matrix
I + B is singular, where
I is the identity matrix.
Then ‖B‖ ≥ 1 for every matrix norm.
Mathematica has a special command for evaluating norms:
Norm[A] = Norm[A,2] for evaluating the Euclidean norm of the matrix A; Norm[A,1] for evaluating the 1-norm; Norm[A, Infinity] for evaluating the ∞-norm; Norm[A, "Frobenius"] for evaluating the Frobenius norm.
A = {{1, 2, 3}, {4, 5, 6}, {7, 8, 9}}
Norm[A]
Sqrt[3/2 (95 + Sqrt[8881])]
N[%]
16.8481
Example 3:
Evaluate the norms of the matrix
\( {\bf A} = \left[ \begin{array}{cc} \phantom{-}1 & -7 & 4 \\ -2 & -3 & 1\end{array} \right] . \)
The absolute column sums of A are \( 1 + | -2 | =3 \) , \( |-7| + | -3 | =10 , \) and \( 4+1 =5 . \)
The larger of these is 10 and therefore \( \| {\bf A} \|_1 = 10 . \)
Norm[A, 1]
10
The absolute row sums of A are \( 1 + | -7 | + 4 =12 \) and
\( | -2 | + |-3| + 1 = 6 ; \) therefore, \( \| {\bf A} \|_{\infty} = 12 . \)
Norm[Transpose[A], 1]
12
The Euclidean norm of A
is the largest singular value. So we calculate
This matrix\( {\bf A}\,{\bf A}^{\ast} \) has two eigenvalues
\( 40 \pm \sqrt{1205} . \) Hence, the Euclidean norm of the matrix A is
\( \sqrt{40 + \sqrt{1205}} \approx 8.64367 . \)
These matrices S and M have the same eigenvalues. Therefore, we found the Euclidean (operator) norm of A to be approximately 16.8481. Mathematica knows this norm:
Norm[A]
Sqrt[3/2 (95 + Sqrt[8881])]
The spectral radius of A is the largest eigenvalue:
The set ℳm,n of all m × n matrices under the field of either real or complex numbers is a vector space of dimension m · n.
In order to determine how close two matrices are, or to define the convergence of sequences of matrices, a special concept of matrix norm is employed, with notation \( \| {\bf A} \| . \) A norm
is a function from a real or complex vector space to the nonnegative real numbers that satisfies the following conditions:
The norm of a matrix may be thought of as its magnitude or length because it is a nonnegative number. There are known three kinds of matrix norms:
The operator norms are norms induced by a matrix considered as a linear operator from ℝm into ℝn for real scalars or ℂm into ℂn for complex scalars.
The entrywise norms treat an m-by-n matrix as the vector of length m · n. Therefore, these norms are directly related to norms in a vector space.
The norm notation ‖ · ‖ is heavily overloaded, obliging readers to disambiguate norms by paying close attention to the linguistic type and context of each norm’s argument. Therefore, the main norm notation ‖ · ‖ is subject to some indices, depending on author's preference. Moreover, the reader should be aware that different kinds of norms may lead to the same definition (for example, the Euclidean norm is actually the spectral norm in Schatten's sense).
For a rectangular m-by-n matrix A and given norms \( \| \ \| \)
in \( \mathbb{R}^n \mbox{ and } \mathbb{R}^m , \) the norm of A is defined as follows:
This matrix norm is called the operator norm or induced norm.
The term "induced" refers to the fact that the definition of a norm for vectors such as A x and x is what enables the definition above of a matrix norm.
This definition of matrix norm is not computationally friendly, so we use other options. The following popular norms are listed below.
For a rectangular m×n matrix A, there are known the following operator norms.
‖A‖₁ is the maximum absolute column sum of the matrix:
‖A‖₂
is the Euclidean norm, the greatest singular value of
A, which is the square root of the greatest eigenvalue of \( {\bf A}^{\ast} {\bf A} , \)
i.e., its spectral radius
where \( \lambda_{\max} \left( {\bf A}^{\ast} {\bf A} \right) \) is is the maximal
eigenvalue of \( {\bf A}^{\ast} {\bf A} , \) also is known as the spectral radius, and
\( \sigma_{\max} \left( {\bf A} \right) \) is the maximal singular value of A.
The induced matrix norms constitute a large and important part of possible matrix norms---there are known many non-induced norms.
The following very important non-induced norm is called after Ferdinand Georg Frobenius (1849--1917).
The Frobenius
norm\( \| \cdot \|_F : \mathbb{C}^{m\times n} \to \mathbb{R}_{+} \) is defined for a rectangular m-by-n matrix A by
where A* is the adjoint matrix to A.
Recall that the trace function returns the sum of diagonal entries of a square matrix.
Mathematica has a dedicated command for evaluating the Frobenius norm: Norm[A, "Frobenius"]
One can think of the Frobenius norm as taking the columns of the matrix, stacking them on top of each other to create a vector of size m×n, and then taking the vector 2-norm of the result.
The Frobenius norm is the matrix norm that is unitary invariant, i.e., it is conserved or invariant under a unitary transformation (such as a rotation).
For a norm to be unitary invariant, it should depend solely upon the singular values of the matrix.
So if B = R*A R with a unitary (orthogonal if real) matrix R satisfying R* = R-1, then
Besides the Frobenius norm, there are known other non-induced norms that treat an m-by-n matrix as the vector of length m · n. For example, the following "entrywise" norms are also widely used.
There is this unfortunate but unavoidable overuse of the notation.
\( \| {\bf A} \|_1 \) is the absolute sum of all elements of A:
Robert Schatten (1911--1977) suggested to define a matrix norm based on the singular values σi or eigenvalues. Therefore, these norms are called after him. We present three of them, with overlapping previously used notations.
We denote by r the rank of the rectangular m×n matrix A.
Again, we double check this answer with two approaches. First, we calculate the sum of eigenvalues of matrices A₃ and A₄:
N[Sum[Sqrt[Eigenvalues[A4][[i]]], {i, 1, 4}]]
15.556136510657433
and then repeat with matrix A₃:
N[Sum[Sqrt[Eigenvalues[A3][[i]]], {i, 1, 3}]]
15.556136510657433
Since the sum of eigenvalues of a square matrix is its trace, we find first of of the roots of matrix A₃. To determine its square root, we use Sylvester's method:
Then find its eigenvalues and add them. You may want to verify that the sum of absolute values of eigenvalues of 3×3 matrix A₃ is exactly the same as the corresponding sum for 4×4 matrix A₄.
The square of the Schatten 2-norm is the square of the Frobenius norm