In previous subsection, we saw that any m-by-n matrix A represents a linear transformation from n-dimensional column vector space 𝔽^n,1 to another m-dimensional column space 𝔽^m,1. In this subsection, we show that actually any linear transformation T : V ≌ 𝔽^n,1 ⇾ W ≌ 𝔽^m,1 can be interpreted as matrix multiplication from left.

A linear transformation T : 𝔽^n×1 ⇾ 𝔽^m×1 is represented by a matrix A when T can be computed using multiplication by matrix A: \[ T({\bf x}) = \mathbf{A}\,\mathbf{x} , \qquad \forall \mathbf{x} \in \mathbb{F}^{n\times 1} . \]

Matrices of Linear Transformations

First though, we want to show how to find the matrix that represents a given linear map between any two finite dimensional vector spaces.

For any vector x ∈ V ≌ 𝔽ⁿ and ordered basis α = [e₁, e₂, … , e_n] of V, we have

\[ \mathbb{F}^n \ni \mathbf{x} = x_1 {\bf e}_1 + x_2 {\bf e}_2 + \cdots + x_n {\bf e}_n = \sum_{i=1}^n x_i {\bf e}_i , \]

where coordinates (x₁, x₂, … , x_n) identify vector x uniquely. Then applying linear transformation T to x, we get

\[ T(\mathbf{x}) = T \left( \sum_{i=1}^n x_i \mathbf{e}_i \right) = \sum_{i=1}^n x_i T\left( \mathbf{e}_i \right) . \]

Since every vector T(e_i) ∈ W ≌ 𝔽^m can be expanded uniquely with respect to ordered basis β = [ε₁, ε₂, … , ε_m], we obtain

\[ T\left(\mathbf{e}_i \right) = a_{i,1} \varepsilon_1 + a_{i,2} \varepsilon_2 + \cdots + a_{i,m} \varepsilon_m = \sum_{j=1}^m a_{i,j} \varepsilon_j , \]

where coefficients 𝑎_i,j constitute the transformation matrix ⟦T⟧. This matrix can be used to define matrix multiplication operator (3) upon writing coordinates of vectors x and T(x) in column form. This reveals a powerful fact:

Observation: If we know where a linear transformation T : V ≌ 𝔽ⁿ ⇾ W ≌ 𝔽^m sends the ordered basis vectors, we can find its effect on any input vector x ∈ V.

Example 8: Let us consider the linear transformation defined by

\[ T \left( {\bf x} \right) = T \left( x_1 , x_2 , x_3 \right) = \left( x_1 + x_2 , x_2 - x_3 \right) . \tag{8.1} \]

In standard ordered basis [i, j, k], this linear transformation (8.1) is defined by the matrix

\[ {\bf A} = \begin{bmatrix} 1 & 1 & \phantom{-}0 \\ 0 & 1 & -1 \end{bmatrix} . \tag{8.2} \]

To find the null space of matrix A, we assume that x ∈ ker(T), then

\[ x_1 + x_2 = 0 \qquad\mbox{and} \qquad x_2 - x_3 = 0 . \]

Setting the free variable x₃ = t ∈ ℝ, we get

\[ x_1 = -t , \qquad x_2 = t , \qquad x_3 = t , \qquad t \in \mathbb{R} . \]

Hence ker(T) is the one-dimensional subspace of ℝ³ spanned on the vector (−1, 1, 1).

Let S be the subspace of ℝ³ spanned by i and k. If x ∈ S, then x must be of the form (𝑎, 0, b), and hence T(x) = (𝑎, −b). Clearly, T(S) = ℝ². Since the image of the subspace S is all of ℝ², it follows that the entire range of T must be ℝ². ■

End of Example 8

Let T : V ⇾ W be a linear transformation where V ≌ 𝔽ⁿ and W ≌ 𝔽^m. Let α = [e₁, e₂, … , e_n] be an ordered basis (not necessarily standard) in V and β = [ε₁, ε₂, … , ε_m] be an ordered basis in W. The matrix representation of T with respect to α and β is \[ [\![ T ]\!]_{\alpha \to \beta} = \begin{bmatrix} \left[ T(\mathbf{e}_1) \right]_{\beta} & \left[ T(\mathbf{e}_2) \right]_{\beta} & \cdots & \left[ T(\mathbf{e}_n) \right]_{\beta} \end{bmatrix} , \] where [T(e_i)]_β is the coordinate vector written as a column for expansion \[ \left[ T(\mathbf{e}_i) \right]_{\beta} = a_{i,1} \varepsilon_1 + a_{i,2} \varepsilon_2 + \cdots +a_{i,m} \varepsilon_m . \] When V = W and α = β, we write transformation matrix as \[ [\![ T ]\!]_{\alpha} = \begin{bmatrix} \left[ T(\mathbf{e}_1) \right]_{\alpha} & \left[ T(\mathbf{e}_2) \right]_{\alpha} & \cdots & \left[ T(\mathbf{e}_n) \right]_{\alpha} \end{bmatrix} . \] In case of standard basis in 𝔽ⁿ, subscript α is dropped.

Example 9: Let T : ℝ² ⇾ ℝ³ be the linear transformation defined by \[ T \left( {\bf x} \right) = T \left( x_1 , x_2 \right) = \left( x_1 - x_2 , x_2 , x_1 + x_2 \right) . \tag{9.1} \] Find the matrix representations of T with respect to the ordered bases α = {u₁ , u₂} and β = {b₁ , b₂, b₃}, where \[ {\bf u}_1 = \begin{pmatrix} 2 \\ 1 \end{pmatrix} , \qquad {\bf u}_2 = \begin{pmatrix} -1 \\ \phantom{-}3 \end{pmatrix} \] and \[ {\bf b}_1 = \begin{pmatrix} 1 \\ 1 \\ 0 \end{pmatrix} , \qquad {\bf b}_2 = \begin{pmatrix} -1 \\ \phantom{-}0 \\ \phantom{-}1 \end{pmatrix} , \qquad {\bf b}_3 = \begin{pmatrix} \phantom{-}0 \\ -1 \\ \phantom{-}1 \end{pmatrix} . \]

Solution: We must compute T(u₁) and T(u₂) and then transform the augmented matrix [b₁, b₂, b₃ | T(u₁), T(u₂)] to reduced row echelon form. Since \[ T({\bf u}_1 ) = \begin{pmatrix} 1 \\ 1 \\ 3 \end{pmatrix} , \qquad T({\bf u}_2 ) = \begin{pmatrix} -4 \\ 3 \\ 2 \end{pmatrix} , \] we get the augmented matrix \[ \left[ \begin{array}{ccc|cc} 1 & -1 & \phantom{-}0 & 1 & -4 \\ 1 & \phantom{-}0 & -1 & 1 & \phantom{-}3 \\ 0 & \phantom{-}1 & \phantom{-}1 & 3 & \phantom{-}2 \end{array} \right] \,\sim \,\left[ \begin{array}{ccc|cc} 1 & -1 & \phantom{-}0 &1 & -4 \\ 0 & \phantom{-}1 & -1 & 0 & \phantom{-}7 \\ 0 & \phantom{-}1 & \phantom{-}1 & 3 & \phantom{-}2 \end{array} \right] \,\sim \,\left[ \begin{array}{ccc|cc} 1 & -1 & \phantom{-}0 &1 & -4 \\ 0 & \phantom{-}1 & -1 & 0 & \phantom{-}7 \\ 0 & \phantom{-}0 & \phantom{-}2 & 3 & -5 \end{array} \right] \] Next, we eliminate entries above diagonal \[ \left[ \begin{array}{ccc|cc} 1 & -1 & \phantom{-}0 & 1 & -4 \\ 1 & \phantom{-}0 & -1 & 1 & \phantom{-}3 \\ 0 & \phantom{-}1 & \phantom{-}1 & 3 & \phantom{-}2 \end{array} \right] \,\sim \,\left[ \begin{array}{ccc|cc} 1 & -1 & 0 &1 & -4 \\ 0 & \phantom{-}1 & 0 & 1.5 & \phantom{-}4.5 \\ 0 & \phantom{-}0 & 1 & 1.5 & -2.5 \end{array} \right] \,\sim \, \left[ \begin{array}{ccc|cc} 1 & 0 & 0 & 2.5 & \phantom{-}0.5 \\ 0 & 1 & 0 & 1.5 & \phantom{-}4.5 \\ 0 & 0 & 1 & 1.5 & -2.5 \end{array} \right] \] The matrix representing L with respect to the given ordered bases is \[ {\bf A} = \begin{bmatrix} 2.5 & \phantom{-}0.5 \\ 1.5 & \phantom{-}4.5 \\ 1.5 & -2.5 \end{bmatrix} . \] You may want to verify that \[ T({\bf u}_1 ) = \begin{pmatrix} 1 \\ 1 \\ 3 \end{pmatrix} = 2.5 \,{\bf b}_1 + 1.5 {\bf b}_2 + 1.5 {\bf b}_3 \]

b1 = {1, 1, 0}; b2 = {-1, 0, 1}; b3 = {0, -1, 1};
2.5*b1 + 1.5*b2 + 1.5*b3

{1., 1., 3.}

and \[ T({\bf u}_2 ) = \begin{pmatrix} -4 \\ 3 \\ 2 \end{pmatrix} = 0.5 {\bf b}_1 + 4.5 {\bf b}_2 - 2.5{\bf b}_3 . \]

0.5*b1 + 4.5*b2 - 2.5*b3

{-4., 3., 2.}

■

End of Example 9

Theorem 3: Let T : 𝔽^n,1 ⇾ 𝔽^m,1 be a linear transformation. Then there exists a unique matrix A such that

\begin{equation} \label{EqTransform.4} T \left( {\bf x} \right) = {\bf A}\, {\bf x} \qquad\mbox{for all } {\bf x} \in \mathbb{F}^{n,1} . \end{equation}

In fact, A is the m × n matrix whose j-th column is the vector T(e_j), where e₁, e₂, … , e_n is the list of basis vectors for 𝔽ⁿ:

\begin{equation} \label{EqTransform.5} {\bf A} = [\![ T ]\!] = \left[ T \left( {\bf e}_1 \right) , T \left( {\bf e}_2 \right) , \cdots , T \left( {\bf e}_n \right) \right] . \end{equation}

Write \( {\bf x} = {\bf I}_n {\bf x} = \left[ {\bf e}_1 \ \cdots \ {\bf e}_n \right] {\bf x} = x_1 {\bf e}_1 + \cdots + x_n {\bf e}_n , \) and use the linearity of T to compute

\begin{align*} T \left( {\bf x} \right) &= T \left( x_1 {\bf e}_1 + \cdots + x_n {\bf e}_n \right) = x_1 T \left( {\bf e}_1 \right) + \cdots + x_n T \left( {\bf e}_n \right) \\ &= \left[ T \left( {\bf e}_1 \right) \ \cdots \ T \left( {\bf e}_n \right) \right] \begin{bmatrix} x_1 \\ \vdots \\ x_n \end{bmatrix} = {\bf A} \, {\bf x} . \end{align*}

Such representation is unique, which could be proved by showing that for any other matrix representation B x of transformation T, it follows that A = B.

Example 10: The transformation T from \( \mathbb{R}^4 \) to \( \mathbb{R}^3 \) defined by the equations

\begin{eqnarray*} 3\, x_1 -2\, x_2 + 5\, x_3 - 7\, x_4 &=& b_1 , \\ x_1 + 7\, x_2 -3\, x_3 + 5\, x_4 &=& b_2 , \\ 4\, x_1 -3\, x_2 + x_3 -6\, x_4 &=& b_3 , \\ \tag{10.1} \end{eqnarray*}

can be represented in matrix form as

\[ \begin{bmatrix} 3 & -2 & \phantom{-}5 & -7 \\ 1 & \phantom{-}7 & -3 & \phantom{-}5 \\ 4 & -3 & \phantom{-}1 & -6 \end{bmatrix} \begin{bmatrix} x_1 \\ x_2 \\ x_3 \\ x_4 \end{bmatrix} = \begin{bmatrix} b_1 \\ b_2 \\ b_3 \end{bmatrix} , \]

from which we see that the transformation can be interpreted as matrix multiplication from left by

\[ {\bf A} = \begin{bmatrix} 3 & -2 & \phantom{-}5 & -7 \\ 1 & \phantom{-}7 & -3 & \phantom{-}5 \\ 4 & -3 & \phantom{-}1 & -6 \end{bmatrix} . \tag{10.2} \]

Although the image under the transformation T_A of any vector x in ℝ⁴ could be computer directly from system of equations (8.1), it is preferable to use matrix (8.2). Remember that you need to interpret vectors as column vectors and transfer the final answer into 4-tuple. For example, if

\[ {\bf x} = \begin{bmatrix} -1 \\ \phantom{-}2 \\ \phantom{-}3 \\ -4 \end{bmatrix} , \]

then

\[ T_{\bf A} \left( {\bf x} \right) = {\bf A}\, {\bf x} = \begin{bmatrix} 3 & -2 & \phantom{-}5 & -7 \\ 1 & \phantom{-}7 & -3 & \phantom{-}5 \\ 4 & -3 & \phantom{-}1 & -6 \end{bmatrix} \begin{bmatrix} -1 \\ \phantom{-}2 \\ \phantom{-}3 \\ -4 \end{bmatrix} = \begin{bmatrix} \phantom{-}36 \\ -16 \\ \phantom{-}17 \end{bmatrix} \in \mathbb{R}^{3,1} . \]

■

End of Example 10

Theorem 4: Every linear transformation from 𝔽ⁿ to 𝔽^m is a matrix transformation, and conversely, every matrix transformation from 𝔽^n,1 to 𝔽^m,1 is a linear transformation.

From theorem 3, we get the first part of theorem for free.
For any linear transformation T : 𝔽ⁿ ⇾ 𝔽^m, there exists a matrix A ∈ 𝔽^m,n such that T(x) = A x, ∀x ∈ 𝔽ⁿ.

For the second part, suppose that T(x) = A x. Then we have:

T(x + y) = A(x + y) = A x + A y = T(x ) + T(y).
T(αx) = A(αx) = αT(x).

The above two properties satisfy the definition of linear transformation. So we have proved that any matrix transformation is a linear transformation.

Example 11: The vector space ℂ over the field of complex numbers is the vector space ℂ of all complex numbers has complex dimension 1 because its basis consists of one element \( \{ 1 \} . \)

On the other hand, ℂ over the field of real numbers is the vector space of real dimension 2 because its basis consists of two elements \( \{ 1, {\bf j} \} . \)

Let us consider the transformation: \[ T \left( {\bf z}_1 , {\bf z}_2 \right) = \left( {\bf z}_1 +{\bf j}\, {\bf z}_2 , - {\bf j}\,{\bf z}_1 + {\bf z}_2 \right) , \tag{11.1} \] where j is the imaginary unit vector on ℂ, so j² = −1. The matrix corresponding to transformation (11.1) is \[ \left[ T \right] = \begin{bmatrix} 1 & {\bf j} \\ -{\bf j} & 1 \end{bmatrix} . \] ■

End of Example 11

Theorem 5: If T_A : 𝔽ⁿ ⇾ 𝔽^m and T_B : 𝔽ⁿ ⇾ 𝔽^m and T_A(v) = T_B(v) for every vector v ∈ 𝔽ⁿ, then A = B.

To say that \( T_{\bf A} \left( {\bf v} \right) = T_{\bf B} \left( {\bf v} \right) \) for every vector in \( \mathbb{R}^m \) is the same as saying that

\[ {\bf A}\,{\bf v} = {\bf B}\,{\bf v} \]

for every vector v in \( \mathbb{R}^m . \) This will be true, in particular, if v is any of the standard basis vectors \( {\bf e}_1 , {\bf e}_2 , \ldots , {\bf e}_m \) for \( \mathbb{R}^m ; \) that is,

\[ {\bf A}\,{\bf e}_j = {\bf B}\,{\bf e}_j \qquad (j=1,2,\ldots , m) . \]

Since every entry of e_j is 0 except for the j-th, which is 1, it follows that Ae_j is the j-th column of A and Be_j is the j-th column of B. Thus, \( {\bf A}\,{\bf e}_j = {\bf B}\,{\bf e}_j \) implies that corresponding columns of A and B are the same, and hence A = B.

Example 12: We consider the Chebyshev differential operator of the third kind: \[ L_n \left[ x, \texttt{D} \right] = \left( 1- x^2 \right) \texttt{D}^2 - \left( 2x-1 \right) \texttt{D} + n \left( n+1 \right) \texttt{I} , \qquad \texttt{D} = \frac{\text d}{{\text d}x} , \] where I is the identity operator. We consider a particular case of n = 3 and apply the Chebyshev operator L₃ to polynomials from space ℝ_≤3[x]. This vector space has standard basis β = [1, x, x², x³]. Application of L₃ to members of basis β yields \begin{align*} L_3 \left[ x, \texttt{D} \right] 1 &= 12 , \\ L_3 \left[ x, \texttt{D} \right] x &= 1 - 2x + 12x = 1 + 10\,x , \\ L_3 \left[ x, \texttt{D} \right] x^2 &= 2 + 2 x + 6 x^2 , \\ L_3 \left[ x, \texttt{D} \right] x^3 &= 6 x + 3 x^2 . \end{align*} We check with Mathematica:

Expand[(1 - x^2 )*D[x^2 , x,x] + (1- 2*x)* D[ x^2 , x] + 12* x^2]

2 + 2 x + 6 x^2

Expand[(1 - x^2 )*D[x^3 , x,x] + (1- 2*x)* D[ x^3 , x] + 12* x^3]

6 x + 3 x^2

The corresponding matrix of this differential operator becomes \[ [\![ L_3 ]\!] = \begin{bmatrix} 12 & 1 & 2 & 0 \\ 0 & 10& 2 & 6 \\ 0 & 0 & 6 & 3 \\ 0 & 0 & 0 & 0 \end{bmatrix} . \] ■

End of Example 12

Theorem 6: Let V and W be vector spaces with ordered bases α and β, respectively, where α = [v₁, v₂, … , v_n] and β = [w₁, w₂, … , w_n]. A function T : V ⇾ W is a linear transformation if and only if there exists a matrix ⟧T⟦_α→β ∈ 𝔽^m×n for which \[ [\![T(\mathbf{v})]\!]_{\beta} = [\![T]\!]_{\alpha\to\beta}\,[\![\mathbf{v}]\!]_{\alpha} , \] where coordinate vectors ⟧T(v)⟦_β and ⟧v⟦_α are written as column vectors.

It is straightforward to show that a function that multiplies a coordinate vector by a matrix is a linear transformation, so we only prove that for every linear transformation T : V ⇾ W, the matrix \[ [\![T]\!]_{\alpha\to\beta} = \left[ \left[ T(\mathbf{v}_1 ) \right]_{\beta} \left[ T(\mathbf{v}_2 ) \right]_{\beta} \cdots \left[ T(\mathbf{v}_n ) \right]_{\beta} \right] \] satisfies ⟧T⟦_α→β ⟧v⟦_α = ⟧T(v)⟦_β and no other matrix has this property. To see that ⟧T⟦_α→β ⟧v⟦_α = ⟧T(v)⟦_β, suppose that ⟧v⟦_α = [c₁, c₂, … , c_n] (i.e., v = c₁v₁ + c₂v₂ + ⋯ + c_nv_n) and do block matrix multiplication: \begin{align*} [\![T]\!]_{\alpha\to\beta}\,[\![\mathbf{v}]\!]_{\alpha} &= \left[ \left[ T(\mathbf{v}_1 ) \right]_{\beta} \left[ T(\mathbf{v}_2 ) \right]_{\beta} \cdots \left[ T(\mathbf{v}_n ) \right]_{\beta} \right] \begin{pmatrix} c_1 \\ \vdots \\ c_n \end{pmatrix} \\ &= c_1 \left[ T(\mathbf{v}_1 ) \right]_{\beta} + c_2 \left[ T(\mathbf{v}_2 ) \right]_{\beta} + \cdots + c_n \left[ T(\mathbf{v}_n ) \right]_{\beta} \\ &= \left[ c_1 T(\mathbf{v}_1 ) \right]_{\beta} + \left[ c_2 T(\mathbf{v}_2 ) \right]_{\beta} + \cdots + \left[ c_n T(\mathbf{v}_n ) \right]_{\beta} \\ &= \left[ T \left( c_1 \mathbf{v}_1 + c_2 \mathbf{v}_2 + \cdots + c_n \mathbf{v}_n \right) \right]_{\beta} \\ &= \left[ T \left( \mathbf{v} \right) \right]_{\beta} . \end{align*} The proof of uniqueness of matrix ⟧T⟦_α→β is left an exercizse because ot is similar to the proof of the previous theorem.

The above theorem tells us that there is a one-to-one correspondence between m-by-n matrices and matrix transformations from 𝔽ⁿ ≌ 𝔽^n×1 to 𝔽^m ≌ 𝔽^m×1 in the sense that every m×n matrix A generates exactly one matrix transformation (multiplication by A) 𝔽ⁿ ⇾ 𝔽^m and every matrix transformation from 𝔽^n,1 to 𝔽^m,1 arises from exactly one m × n matrix: we call that matrix the standard matrix for the transformation, which is given by the formula:

\[ [\![T]\!] = \left[ T \left( {\bf e}_1 \right) \,|\, T \left( {\bf e}_2 \right) \,|\, \cdots \,| T \left( {\bf e}_n \right) \right] , \]

where e₁, e₂, … , e_n is the list of basis vectors for 𝔽ⁿ. This suggests the following procedure for finding standard matrices.

Algorithm for finding the standard matrix of a linear transformation:
T : V ≌ 𝔽ⁿ ⇾ W ≌ 𝔽^m.
Step 1: Find the images T(e_i) of the standard basis vectors \( {\bf e}_1 , {\bf e}_2 , \ldots , {\bf e}_n \) for 𝔽ⁿ
Step 2: Construct the matrix ⟦T⟧ that has the images obtained in Step 1 as its successive columns. This matrix is the standard matrix for the transformation.

Example 13: Find the standard matrix for transformation ℝ³ ⇾ ℝ² A for the linear transformation:

\[ T \left( \begin{bmatrix} x_1 \\ x_2 \\ x_3 \end{bmatrix} \right) = \begin{bmatrix} 3\,x_1 -2\,x_3 \\ 2\,x_2 + 5\,x_3 \end{bmatrix} . \]

To answer the question, we apply the linear transformation T to every basic vector:

\[ T \left( {\bf e}_1 \right) = 3\, {\bf i} = \begin{bmatrix} 3 \\ 0 \end{bmatrix} , \quad T \left( {\bf e}_2 \right) = 2\, {\bf j} = \begin{bmatrix} 0 \\ 2 \end{bmatrix} , \quad T \left( {\bf e}_3 \right) = -2\, {\bf i} + 5\,{\bf j} = \begin{bmatrix} -2 \\ 5 \end{bmatrix} . \]

Therefore, the standard matrix becomes

\[ T \left( \begin{bmatrix} x_1 \\ x_2 \\ x_3 \end{bmatrix} \right) = {\bf A}\,{\bf x} = \begin{bmatrix} 3&0&-2 \\ 0&2&5 \end{bmatrix} \begin{bmatrix} x_1 \\ x_2 \\ x_3 \end{bmatrix} . \]

■

End of Example 13

∘ T

Axier, S., Linear Algebra Done Right. Undergraduate Texts in Mathematics (3rd ed.). Springer. 2015, ISBN 978-3-319-11079-0.
Beezer, R.A., A First Course in Linear Algebra, 2017.
Dillon, M., Linear Algebra, Vector Spaces, and Linear Transformations, American Mathematical Society, Providence, RI, 2023.
Halmos, Paul Richard (1974) [1958]. Finite-Dimensional Vector Spaces. Undergraduate Texts in Mathematics (2nd ed.). Springer. ISBN 0-387-90093-4.
Roman, Steven (2005). Advanced Linear Algebra. Undergraduate Texts in Mathematics (2nd ed.). Springer. ISBN 0-387-24766-1.

Introduction to Linear Algebra

Systems of Linear Equations

Matrix Algebra

Vector Spaces

Eigenvalues, Eigenvectors

Euclidean Spaces

Matrix Decompositions

Tensors

Applications

Functions of Matrices

Miscellany

Preliminaries

Glossary

Reference

Matrices of Linear Transformations