Duality is a complex notion having ramifications in many topics of mathematics and physics. The notion of a dual space is useful because we can phrase many important concepts in linear algebra without the need to introduce additional structure. In particular, we will see that we can formulate many notions involving inner products in a way that does not require the use of an inner product because ⟨φ|v⟩ can be thought of as the action of a linear functional ⟨φ| on ket vector |v⟩. In addition, it allows us to view subspaces as solutions of sets of linear equations and vice-versa.

When we speak about duality, it is understood as a duality with respect to what? This section gives an introduction to this important topic and mostly consider basic duality concepts with respect to the field of scalars 𝔽. To some extent, a duality can be viewed as a property to see an object similar to the original picture when it is applied twice, analogous to a mirror image. For instance, in real life, when a person is married, his wife can be considered as a dual image with respect to "marriage operation;" however, her marriage leads to second duality---a husband, who can be the same if it is the first marriage, or it can be another husband due to the second marriage. Another example of duality provides de Morgan's laws.

A special type of duality provides involution operations; when J : V ⇾ V and J² = J ⚬ J = I, the identity operation. So marriage is not always an involution, but complex conjugate is. When an inner product is employed, it forms a special structure in vector space that should be taken into account by duality. We discuss duality of Euclidean spaces in Part V.

Double Dual

Annihilators

Dual Spaces

Recall that the set of all linear transformations from one vectors space V into another vector space W is denoted as ℒ(V, W). Its particular case arises when we choose W = 𝔽 (as a one-dimensional coordinate vector space over itself).

Let V be an 𝔽-vector space over a field 𝔽 (which is either ℝ or ℂ or ℚ). A linear functional (also known as linear form or covector) on V is a linear map V ⇾ 𝔽. In other words, a linear functional T on V is a scalar-valued function that satisfies

\[ T \left( \alpha\,{\bf u} + \beta\,{\bf v} \right) = \alpha\,T \left( {\bf u} \right) + \beta\,T \left( {\bf v} \right) \]

for any vectors u, v ∈ V and any scalars α, β ∈ 𝔽.

Linear functionals can be thought of as giving us snapshots of vectors—knowing the value of φ(v) tells us what v looks like from one particular direction or angle (just like having a photograph tells us what an object looks like from one side), but not necessarily what it looks like as a whole. Alternatively, linear forms can be thought of as the building blocks that make up more general linear transformations. Indeed, every linear transformation into an n-dimensional vector space can be thought of as being made up of n linear forms (one for each of the n output dimensions).

Our first example, which seems a trivial one, clarifies the concept of duality. Let V = ℂ be the set of all complex numbers over the field ℂ (itself). We consider the involution operation

\[ J\, : \ \mathbb{C} \,\mapsto \,\mathbb{C} , \qquad J \left( x + {\bf j}\,y \right) = z^{\ast} = x - {\bf j}\,y . \]

So J maps the complex plane into itself by swapping it with respect to the abscissa (called the real axis in ℂ). Note that we denote complex conjugate by z* instead of overline notation $ \displaystyle \overline{z} = \overline{x + {\bf j}\,y} = x - {\bf j}\,y , $ which is common in mathematics literature. As you see, the asterisk notation is in agreement with notation of dual spaces. When V is considered as a complex vector space, then complex conjugate convolution is not a linear operation because

\[ J \left( c\,z \right) = c^{\ast} z^{\ast} , \qquad c \in \mathbb{C} . \]

However, when V = ℂ is considered as a vector space over the field of real numbers, J is a linear transformation.

Example 1: Let us consider fruit inventory in a supermarket, assuming that there are α^a apples, α^b bananas, α^c coconuts, and so on. We designate it as a vector

\[ {\bf x} = \alpha^{a} \hat{e}_a + \alpha^{b} \hat{e}_b + \alpha^{c} \hat{e}_c + \cdots \qquad (\mbox{finite number of distinct fruits}). \]

Fruit inventories like these form a space V that resembles a vector space, It is closed under addition and scalar multiplication (by rational numbers). V is not truly a vector space because we don't define subtraction or multiplication by negative numbers. Basis elements $ \hat{e}_i $ maybe considered as different containers because fruits are not allowed to be mixed.

Next, we consider a particular purchase price function, $_p. Its values yield the cost of any fruit inventory x:

\begin{align*} \$_p ({\bf x}) &= \$_p \left( \alpha^{a} \hat{e}_a + \alpha^{b} \hat{e}_b + \alpha^{c} \hat{e}_c + \cdots \right) \\ &= \alpha^{a} \underbrace{\$_p \left( \hat{e}_a \right)}_{\mbox{price per apple}} + \alpha^{b} \underbrace{\$_p \left( \hat{e}_b \right)}_{\mbox{price per banana}} + \alpha^{c} \underbrace{\$_p \left( \hat{e}_c \right)}_{\mbox{price per coconut}} + \cdots . \end{align*}

Also consider another cost of fruit inventory y:

\[ \$_p ({\bf y}) = \$_p \left( \beta^{a} \hat{e}_a + \beta^{b} \hat{e}_b + \beta^{c} \hat{e}_c + \cdots \right) . \]

For any real (or rational) number k, we have

\begin{align*} \$_p ({\bf x} + k {\bf y}) &= \$_p \left( \left( \alpha^a + k \beta^{a} \right) \hat{e}_a + \left( \alpha^b + k \beta^{b} \right) \hat{e}_b + \left( \alpha^c + k \beta^{c} \right) \hat{e}_c + \cdots \right) \\ &= \$_p ({\bf x}) + k\, \$_p ({\bf y}) . \end{align*}

Thus, this purchase price function is a linear functional:

\[ \$_p \, : \, V \mapsto \mathbb{R} , \]

which provides a measurement (in dollars) of any fruit inventory (vector x∈V). ■

End of Example 1

Example 2: We consider several familiar vector spaces and linear functions on these spaces.

Let V be ℝ², a Cartesian product of two real lines. We consider a functional T : V ↦ ℝ, defined by

\[ T({\bf u}) = T(x,y) = x + 2\,y . \]

We leave it to the reader to prove that T is a linear function.

Let ℭ[𝑎, b] be a set of all continuous functions on closed interval [𝑎, b]. For any function f ∈ ℭ[𝑎, b], we define a functional T on it by

\[ T(f) = \int_a^b f(x)\,{\text d} x . \]

When you studied calculus, you learned that T is a linear function.

You can define another linear functional on ℭ[𝑎, b]:

\[ T(f) = f(s) , \qquad s \in [a, b], \]

where s is an arbitrary (but fixed) point from the interval.

This functional can be generalized to obtain a sampling function. Let { s₁, s₂, … , s_n } ⊂ [𝑎, b] be a specified collection of points in [𝑎, b], and let { k₁, k₂, … , k_n } be a set of scalars. Then the function

\[ T(f) = \sum_{i=1}^n k_i f(s_i ) \]

is a linear function on V.

Let us consider the set of all square matrices over some field 𝔽, which we denote by V = 𝔽^n,n. Then evaluating a trace of any square matrix is a linear functional on V.

Let us consider a set ℘[t] of all polynomials in variable t of finite degree, which is a vector space over a field of constants 𝔽. Let k₁, k₂, … , k_n be any n scalars and let t₁, t₂, … , t_n be any n real numbers. Then the formula

\[ T(p) = \sum_{i=1}^n k_i p(t_i ) \]

defines a linear functional on ℘[t]. ■

End of Example 2

We observe that for any linear functional φ acting on any vector space V,

\[ \varphi (0) = \varphi (0\cdot 0) = 0 \cdot \varphi (0) = 0 . \]

That is why a linear functional is sometimes called homogeneous. In particular, for any vector v = (v₁, v₂,… , v_n) ∈ ℂⁿ and any set of n complex numbers k₁, k₂,… , k_n ∈ ℂ, the formula

\[ \varphi ({\bf v}) = k_1 v_1 + k_2 v_2 + \cdots + k_n v_n \]

defines a linear functional, but

\[ \varphi ({\bf v}) = k_1 v_1 + k_2 v_2 + \cdots + k_n v_n + \alpha \]

does not when α ≠ 0.

A set of all linear functions from vector space V into field 𝔽, ℒ(V, 𝔽), deserves a special label.

The dual space of V, denoted by V^✶ or V′, is the vector space of all linear functionals on V, i.e., V^✶ = ℒ(V, 𝔽).

Note: Unfortunately, tere is no consistency in notation of the dual space---some authors use asterisk, others use prime. Stricktly speaking, we need to use prime because it corresponds to transpose operation. However, all textbooks on linear algebra use symbol "T" for it and denote dual space (in finite dimensional case) as V^✶. < ■ /p>

The dimension of ℒ(U, V) is the product of dimensions of vector spaces U and V. Since 𝔽 is a vector space of dimension one over itself, dimV^✶ = dimV.

When dealing with linear functionals, it is convenient to follow Paul Dirac (1902--1984) and use his bra-ket notation (established in 1939). In quantum mechanics, a vector v is written in abstract ket form as |v>. A linear functional φ is written in bra form as <φ|, it is also frequently called a covector. Then a functional φ acting on vector v is written as

\[ \varphi ({\bf v}) = \,< \varphi \, |\, {\bf v} > \]

rather than traditional form φ(v) utilized in mathematics. So a bra-vector acts according to some linear law on a ket-vector to give a scalar output. For any ground field 𝔽, the bra-ket notation establishes the bilinear mapping

\begin{equation} \label{EqDual.1} \left\langle \cdot\,\vert \, \cdot \right\rangle : V^{\ast} \times V \,\mapsto \,\mathbb{F} . \end{equation}

Dirac's notation \eqref{EqDual.1} provides a duality between a vector space V and its dual space V^✶. A most important example of a linear functional provides the dot product when 𝔽 = ℝ:

\[ \delta_{\bf v} ({\bf u}) = {\bf v} \bullet {\bf u} = v_1 u_1 + v_2 u_2 + \cdots + v_n u_n . \]

This is actually the “standard” example of a linear form, and the one that we should keep in mind as our intuition builders. We will see shortly that every linear functional on a finite-dimensional vector space can be written in this way.

Note: Althogh the Dirac symbol \eqref{EqDual.1} is analogous to the inner product, the vectors inside abra-kets are from different spaces! Later in Part 5 you learn the Riesz representation theorem that establishes isomorphism between two spaces, V* and V′ (see Dual transformations in Part 5). ▣

It is common to consider kets as column vectors, and bras are identified with row vectors. Then you can multiply 1×n matrix <φ| (which is a row vector) with n×1 matrix |v> (which is column vector) to obtain a 1×1 matrix, which is isomorphic to a scalar. Strictly speaking, this operation should be written as <φ|·|v> or, dropping the dot for multiplication, <φ| |v>, but double vertical lines are substituted with a single one. Moreover, Dirac's notation allows us to combine bras, kets, and linear operators (they are matrices in finite dimensional case) together and interpret them using matrix multiplication:

\[ < \varphi\, |\, {\bf A}\,|\, {\bf v} > , \]

where A is an operator acting either on ket vector v (from left to right) or on covector (= bra) <φ| (from right to left). This naturally leads to inner product (see section in Part 5). Bra–ket notation is also known as Dirac notation, despite the notation having a precursor in Hermann Grassmann's use of [ ϕ ∣ ψ ] for inner products nearly 100 years earlier.

Theorem 1: The set V^✶ of all linear functionals over vector space V is a vector space.

The dual set is obviously closed under two operations: addition between functionals and multiplication by a scalar. Therefore, we need to check all eight axioms that are used for vector space definition.

φ + ψ = ψ + φ for φ, ψ ∈ V^✶;
φ + (ψ + χ) = (ψ + φ) +χ for φ, ψ, χ ∈ V&^✶;
the zero element is the constant zero function;;
the additive inverse of ψ is −ψ ∈ V^✶;
(ks)ψ = k(sψ) for ψ ∈ V^✶ and k, s ∈ 𝔽;
1ψ = ψ ∈ V^✶;
k(φ + ψ) = kφ + kψ for φ, ψ ∈ V* and k ∈ 𝔽;
(k + s)ψ = kψ + kψ for k, s ∈ 𝔽 and ψ ∈ V*.

Example 3: Let the space V be ℝⁿ, what is its dual? To answer this question, we first consider a trivial case n = 1. Then its dual space ℝ* consists of all linear transformations from ℝ into ℝ, which is denoted as ℒ(ℝ, ℝ). We know that linear real functions are all of the form f(x) = c·x, a scalar multiple of x, with c ∈ ℝ. Therefore, we conclude that the dual to ℝ is isomorphic (ℝ* ∋ f ↦ c ∈ ℝ) to itseld, ℝ.

In general case n > 1, any linear functional on ℝⁿ has the form \[ \mathbb{R}^n \ni {\bf x} = \left( x_1 , x_2 , \ldots , x_n \right) \,\mapsto \,a_1 x_1 + a_2 x_2 + \cdots + a_n x_n \in \mathbb{R} , \tag{3.1} \] for some real numbers 𝑎₁, 𝑎₂, … , 𝑎_n. Since there is exactly n real parameters 𝑎₁, 𝑎₂, … , 𝑎_n that define a linear functional on ℝⁿ, the dual space is isomorphic to ℝⁿ.

Hence we are left to prove that any linear functional on ℝⁿ is represented by formular (3.1). Let φ ∈ ℒ(ℝⁿ, ℝ) be arbitrary linear functional (or linear form). Because of its additivity, \[ \varphi \left( x_1 , x_2 , \ldots , x_n \right) = \varphi \left( x_1 , 0, \ldots , 0 \right) + \varphi \left( 0, x_2 , 0 \ldots , 0 \right) + \cdots + \varphi \left( 0, 0, \ldots , 0, x_n \right) . \] On every component x_i ∈ ℝ, linear form φ acts linearly, so there exists a constant 𝑎_i such that \[ \varphi \left( 0, 0, \ldots , 0, x_i , 0 , \ldots , 0 \right) = a_i x_i \] because it is actually a functional acting on one-dimensional space ℝ.

So, the dual of ℝⁿ is isomorphic to ℝⁿ itself. The same holds true for ℚⁿ or ℂⁿ, of course, as well as for 𝔽ⁿ, where 𝔽 is an arbitrary field. Since the space V over a field 𝔽 (we use only either ℚ or ℝ or ℂ) of dimension n is isomorphic to 𝔽ⁿ, and the dual to 𝔽ⁿ is isomorphic to 𝔽ⁿ, we can conclude that the dual V* is isomorphic to V.

Now let us discuss some convenient isomorphic representations of covectors acting on ℝⁿ. We replace the direct product ℝⁿ = ℝ × ℝ × ⋯ × ℝ by its isomorphic image ℝ^n×1 of column vectors. So instead of n-tuples we consider column vectors that are matrices of size n × 1. Then a linear transformation T : ℝ^n×1 ⇾ ℝ^m×1 is represented by an m × n matrix multiplication. Therefore, it is convenient to identify ℝⁿ with n-dimensional column vector space ℝ^n×1, then a linear functional on ℝⁿ ≌ ℝ^n×1 (i.e., a linear transformation φ : ℝⁿ ⇾ ℝ) is given by an 1 × n matrix (a row); we denote it by bra-vector. The collection of all such rows is isomorphic to ℝ^n×1 (isomorphism is given by taking the transpose of a ket-vector).

Remember that identifying the direct product 𝔽ⁿ with column vector space 𝔽^n×1 and its dual space with row vector space 𝔽^1×n is just a convenient assumption that will remind you matrix multiplication. However, any matrix is a list of lists, so 1×1 matrix is a list containing one scalar, which is not this scalar. From mathematical and computational point of view a 1 × 1 matrix is not the same as a scalar, but it is isomorphic to this scalar. Mathematica distinguishes these two objects:

a = {{1, 2}}.{3, 1}

{5}

SameQ[a, 5]

False

For a bra vector ⟨φ∣ = [φ₁, φ₂, … , φ_n], the action on a ket-vector ∣v⟩ = [v¹, v², … , vⁿ] ∈ 𝔽ⁿ is given by dot product

\[ \varphi ({\bf v}) = \langle \varphi \mid {\bf v} \rangle = \sum_{i=1}^n \varphi_i v^i , \]

which resembles matrix multiplication of row vector (bra) and column vector (ket). ■

End of Example 3

A definite state of a physical system is represented as a state ket or state vector. However, we can get physical information about the system only upon some measurement. This is achieved by applying a bra vector so we could get physically relevant information about the system by combining bras with kets. In short, the state ket as such gives relevant information about the system only upon measurement of observables, which is accomplished by taking the product of bras and kets.

We consider here only finite-dimensional spaces because for infinite-dimensional spaces the dual space consists not of all but only of the so-called bounded linear functionals. Without giving the precise definition, let us only mention than in the finite-dimensional case (both the domain and the target space are finite-dimensional) all linear transformations are bounded, and we do not need to mention the word bounded (or continuous).

Lemma 1: Let V be a vector space over 𝔽. For any nonzero vector v ∈ V , there exists a linear functional φ ∈ V* such that φ(v) = ⟨φ∣v⟩ ≠ 0.

Since v ≠ 0 in V, one can find a basis β = {v = e₁, e₂, … , e_n} of V. Then there exist unique list of scalars c₁, c₂, … , c_n such that any vector u ∈ V can be expanded as \[ {\bf u} = c_1 {\bf e}_1 + c_2 {\bf e}_2 + \cdots + c_n {\bf e}_n . \] Then we define a linear functional φ : V ⇾ 𝔽 as φ(u) = c₁. Clearly, φ is well-defined. Also, φ(v)= 1 ≠ 0, by construction.

Example 4: Let us consider a set of all polynomials with real coefficients. Any nonzero polynomial from this space, p(x) ∈ ℝ[x], contains a term c_px^p, with c_p ≠ 0 for some integer p: \[ p(x) = c_0 + c_1 x + c_2 x^2 + \cdots + c_m x^m , \qquad c_p \ne 0 . \] We define a covector that assigns p-th coefficient to every polynomial p ∈ ℝ[x] \[ \varphi_p \left( c_0 + c_1 x + c_2 x^2 + \cdots + c_m x^m \right) = c_p \in \mathbb{R} . \] This is a linear functional on the space ℝ[x] that assigns a nonzero value for every polynomial containing monomial c_px^p, with c_p ≠ 0. ■

End of Example 4

Let us consider the set V = ℂ of complex numbers as a real vector space. In this space, addition is defined as usual and multiplication of a complex number z = x + jy by a real number k is defined as kz = kx + jky. Here j is the unit imaginary vector on the complex plane, so j² = −1.

Consider the following functions and your task is to identify which of them is a linear functional on V = ℂ.
1. T(z) = T(x + jy) = y;
2. T(z) = T(x + jy) = x + y;
3. T(z) = T(x + jy) = x²;
4. T(z) = T(x + jy) = x − jy;
5. T(z) = T(x + jy) = x² + y²;
6. $ T(z) = T(x + {\bf j}y ) = \sqrt{x^2 + y^2} . $
Identify which of the following formulas define a linear functional acting on a polynomial x(t).
1. $ T(x) = \int_0^2 x(t)\,{\text d} t ; $
2. $ T(x) = \int_0^2 t^3 x(t)\,{\text d} t ; $
3. $ T(x) = \int_0^2 x(t^3 )\,{\text d} t ; $
4. T(x) = x(0);
5. $ T(x) =\frac{{\text d} x}{{\text d} t} ; $
6. $ T(x) = \left. \frac{{\text d} x}{{\text d} t} \right\vert_{t=0} . $
Let v₁, v₂,… , v_n+1 be a system of vectors in 𝔽-vector space V such that there exists a dual system v¹, v²,… , vⁿ⁺¹ of linear functionals such that \[ {\bf v}^j \left( {\bf v}_i \right) = \langle {\bf v}^j \, | \, {\bf v}_i \rangle = \delta_{i,j} . \]
1. Show that the system v₁, v₂,… , v_n+1 is linearly independent.
2. Show that if the system v₁, v₂,… , v_n+1 is not generating, then the “biorthogonal” system" v¹, v²,… , vⁿ⁺¹ is not unique.
Define a non-zero linear functional φ on ℚ³ such that if x₁ = (1, 2, 3) and x₂ = (−1, 1, −2), then ⟨φ | x₁⟩ = ⟨φ | x₂⟩ = 0.
The vectors x₁ = (3, 2, 1), x₂ = (−2, 1, −1), and x₃ = (−1, 3, 2) form a basis in ℝ³. If { e¹, e², e³ } is the dual basis, and if x = (1, 2, 3), find ⟨e¹ | x⟩ and ⟨e² | x⟩.
Prove that is φ is a linear functional on an n-dimensional vector space V, then the set of all those vectors for which ⟨φ | x⟩ = 0 is a subspace of V; what is the dimension of that subspace?
If φ(x) = x₁ + 2x₂ + 3x₃ whenever x = (x₁, x₂, x₃) ∈ ℝ³, then φ is a linear functional on ℝ³. Find a basis of the subspace consisting of all those vectors x for which ⟨φ | x⟩ = 0.
If R and S are subspaces of a vector space V, and if R ⊂ S, prove that S⁰ ⊂ R⁰.
Prove that if S is any subset of a finite-dimensional vector space, then S⁰⁰ coincides with the subspace spanned on S.
If R and S are subspaces of a finite dimensional vector space V, then (R ∩ S)⁰ = R⁰ + S⁰ and (R + S)⁰ = R⁰ ∩ S⁰.
Prove the converse: given any basis β* = { φ₁, φ₂, … , φ_n } of V*, we can construct a dual basis { e₁, e₂, … , e_n } of V so that the functionals φ₁, φ₂, … , φ_n serve as coordinate functions for this basis.
Consider ℝ³ with basis β = {v₁, v₂, v₃}, where v₁ = (−1, 4, 3), v₂ = (3, 2 −2), v₃ (3, 2 0). Find the dual basis β*.

Axler, Sheldon Jay (2015). Linear Algebra Done Right (3rd ed.). Springer. ISBN 978-3-319-11079-0.
Halmos, Paul Richard (1974) [1958]. Finite-Dimensional Vector Spaces (2nd ed.). Springer. ISBN 0-387-90093-4.
Katznelson, Yitzhak; Katznelson, Yonatan R. (2008). A (Terse) Introduction to Linear Algebra. American Mathematical Society. ISBN 978-0-8218-4419-9.
Treil, S., Linear Algebra Done Wrong.
Wikipedia, Dual space/

Introduction to Linear Algebra

Systems of Linear Equations

Matrix Algebra

Vector Spaces

Eigenvalues, Eigenvectors

Euclidean Spaces

Matrix Decompositions

Tensors

Applications

Functions of Matrices

Miscellany

Preliminaries

Glossary

Reference

Dual Spaces