Mathematical Methods of Physics/Vector Spaces

As is no doubt seen in elementary Physics, the notion of vectors, quantities that have a "magnitude" and a "direction" (whatever these may be) is very convenient in several parts of Physics. Here, we wish to put this idea on the rigorous foundation of Linear Algebra, to facilitate its further use in Physics. The interested reader is encouraged to look up the Wikibook Linear Algebra for details regarding the intricacies of the topic.

Vector Spaces
Let $$F$$ be field and let $$V$$ be a set. $$V$$ is said to be a Vector Space over $$F$$ along with the binary operations of addition and scalar product iff

$$\forall \mathbf{A},\mathbf{B},\mathbf{C}\in V$$

(i)  $$\mathbf{A}+\mathbf{B}=\mathbf{B}+\mathbf{A}$$  ...(Commutativity)

(ii) $$\mathbf{A}+(\mathbf{B}+\mathbf{C})=(\mathbf{A}+\mathbf{B})+\mathbf{C} $$ ...(Associativity)

(iii) $$\exists\mathbf{0}\in V$$ such that $$\mathbf{A}+\mathbf{0}=\mathbf{A}$$ ...(Identity)

(iv)$$\exists (\mathbf{-A})\in V$$ such that $$\mathbf{A}+\mathbf{-A}=\mathbf{0}$$ ...(Inverse)

$$\forall$$ $$a,b\in F$$

(v) $$(a+b)\mathbf{A}=a\mathbf{A}+b\mathbf{A}$$

(vi)$$a(\mathbf{A}+\mathbf{B})=a\mathbf{A}+a\mathbf{B}$$

(vii)$$a(b\mathbf{A})=(ab)\mathbf{A}$$

The elements of $$V$$ are called vectors while the elements of $$F$$ are called scalars. In most problems of Physics, the field $$F$$ of scalars is either the set of real numbers $$\mathbb{R}$$ or the set of complex numbers $$\mathbb{C}$$.

Examples of vector spaces:

(i) The set $$\mathbb{R}^n$$ over $$\mathbb{R}$$ can be visualised as the space of ordinary vectors "arrows" of elementary Physics.

(ii) The set of all real polynomials $$P(x)=a_0+a_1x+\ldots+a_nx^n$$ is a vector space over $$\mathbb{R}$$

(iii) Indeed, the set of all functions $$f:[a,b]\to\mathbb{R}$$ is also a vector spaces over $$\mathbb{R}$$, with addition and scalar multiplication defined as is usual.

Although the idea of vectors as "arrows" works well in most examples of vector spaces and is useful in solving problems, the latter two examples were deliberately provided as cases where this intuition fails to work.

Basis
A set $$E\subset V$$ is said to be linearly independent if and only if

$$a_1\mathbf{E}_1+a_2\mathbf{E}_2+\ldots+a_n\mathbf{E}_n=\mathbf{0}$$ implies that $$a_1=a_2=\ldots=0$$, whenever $$\mathbf{E}_1,\mathbf{E}_2,\ldots,\mathbf{E}_n\in E$$

A set $$E\subset V$$ is said to cover $$V$$ if for every $$\mathbf{A}\in V$$ there exist $$e_1,e_2,\ldots$$ such that $$\mathbf{A}=e_1\mathbf{E}_1+e_2\mathbf{E}_2+\ldots$$. (we leave the question of finiteness of the number of terms open at this point)

A set $$B\subset V$$ is said to be a basis for $$V$$ if $$B$$ is linearly independent and if $$B$$ covers $$V$$.

If a vector space has a finite basis with $$n$$ elements, the vector space is said to be n-dimensional

As an example, we can consider the vector space $$\mathbb{R}^3$$ over reals. The vectors $$(1,0,0);(0,1,0);(0,0,1)$$ form one of the several possible basis for $$\mathbb{R}^3$$. These vectors are often denoted as $$\hat{i},\hat{j},\hat{k}$$ or as $$\hat{x},\hat{y},\hat{z}$$

Theorem
Let $$V$$ be a vector space and let $$B=\{\mathbf{b}_1,\mathbf{b}_2,\ldots,\mathbf{b}_n\}$$ be a basis for $$V$$. Then any subset of $$V$$ with $$n+1$$ elements is linearly dependent.

Proof
Let $$E\subset V$$ with $$E=\{\mathbf{u}_1,\mathbf{u}_2,\ldots,\mathbf{u}_{n+1}\}$$

By definition of basis, there exist scalars $$a_{1i},a_{2i},\ldots,a_{ni}$$ such that $$u_i=\displaystyle\sum_m a_{mi}\mathbf{b}_m$$

Hence we can write $$\displaystyle\sum_{i=1}^{n+1}c_i\mathbf{u}_i=\mathbf{0}$$ as $$\displaystyle\sum_{m=1}^n\sum_{i=1}^{n+1}c_ia_{mi}\mathbf{b}_m=\mathbf{0}$$ that is

$$\displaystyle\sum_{m=1}^nc_1a_{1m}=0$$

$$\displaystyle\sum_{m=1}^nc_2a_{2m}=0$$

$$\ldots$$

$$\displaystyle\sum_{m=1}^nc_{n+1}a_{3(n+1)}=0$$

Which has a nontrivial solution for $$c_i$$. Hence $$E$$ is linearly dependent.

If a vector space has a finite basis of $$n$$ elements, we say that the vector space is n-dimensional

Inner Product
An in-depth treatment of inner-product spaces will be provided in the chapter on Hilbert Spaces. Here we wish to provide an introduction to the inner product using a basis.

Let $$V$$ be a vector space over $$\mathbb{R}$$ and let $$B=\{\mathbf{b}_1,\mathbf{b}_2,\ldots,\mathbf{b}_n\}$$ be a basis for $$V$$. Thus for every member $$\mathbf{u}$$ of $$V$$, we can write $$\mathbf{u}=\displaystyle\sum_{i=1}^n b_i\mathbf{b}_i$$. $$b_i$$ are called the components of $$\mathbf{u}$$ with respect to the basis $$B$$.

We define the inner product as a binary operation $$(\cdot) :V\times V\to\mathbb{R}$$ as $$\mathbf{x}\cdot\mathbf{y}=\displaystyle\sum_i x_iy_i$$, where $$x_i,y_i$$ are the components of $$\mathbf{x},\mathbf{y}$$ with respect to $$B$$

Note here that the inner product so defined is intrinsically dependent on the basis. Unless otherwise mentioned, we will assume the basis $$\hat{x},\hat{y},\hat{z}$$ while dealing with inner product of ordinary "vectors".

Linear Transformations
Let $$U$$, $$V$$ be vector spaces over $$F$$. A function $$T:U\to V$$ is said to be a Linear transformation if for all $$\mathbf{u}_1,\mathbf{u}_2\in U$$ and $$c\in F$$ if

(i)$$T(\mathbf{u}_1+\mathbf{u}_2)=T(\mathbf{u}_1)+T(\mathbf{u}_2)$$

(ii)$$T(c\mathbf{u}_1)=cT(\mathbf{u}_1)$$

Now let $$E=\{\mathbf{e}_1,\mathbf{e}_2,\ldots,\mathbf{e}_m\}$$ and $$F=\{\mathbf{f}_1,\mathbf{f}_2,\ldots,\mathbf{f}_n\}$$ be bases for $$U,V$$ respectively.

Let $$\mathbf{e'}_i=T(\mathbf{e}_i)$$. As $$F$$ is a basis, we can write $$\mathbf{e'}_i=\sum_j a_{ij}\mathbf{f}_j$$.

Thus, by linearity we can say that if $$T(\mathbf{u})=\mathbf{v}$$, we can write the components $$v_j$$ of $$\mathbf{v}$$ in terms of those of $$\mathbf{u}$$ as $$v_j=\sum_i u_ia_{ij}$$

The collection of coefficients $$a_{ij}$$ is called a matrix, written as

$$\mathbf{A}=\begin{pmatrix} a_{11} & a_{12} & a_{13} & \ldots & a_{1m} \\ a_{21} & a_{22} & a_{23} & \ldots & a_{2m} \\ a_{31} & a_{32} & a_{33} & \ldots & a_{3m} \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ a_{n1} & a_{n2} & a_{n3} & \ldots & a_{nm} \\ \end{pmatrix}$$ and we can say that $$T$$ can be represented as a matrix $$\mathbf{A}$$ with respect to the bases $$E,F$$

Eigenvalue Problems
Let $$V$$ be a vector space over reals and let $$T:V\to V$$ be a linear transformations.

Equations of the type $$T\mathbf{u}=\lambda\mathbf{u}$$, to be solved for $$\mathbf{u}\in V$$ and $$\lambda\in\mathbb{R}$$ are called eigenvalue problems. The solutions $$\lambda$$ are called eigenvalues of $$T$$ while the corresponding $$\mathbf{u}$$ are called eigenvectors or eigenfunctions. (Here we take $$T\mathbf{u}=T(\mathbf{u})$$)