Last modified on 15 May 2013, at 01:19

Mathematical Methods of Physics/Vector Spaces

As is no doubt seen in elementary Physics, the notion of vectors, quantities that have a "magnitude" and a "direction" (whatever these may be) is very convenient in several parts of Physics. Here, we wish to put this idea on the rigorous foundation of Linear Algebra, to facilitate its further use in Physics. The interested reader is encouraged to look up the Wikibook Linear Algebra for details regarding the intricacies of the topic.

Vector SpacesEdit

Let F be field and let V be a set. V is said to be a Vector Space over F along with the binary operations of addition and scalar product iff

\forall \mathbf{A},\mathbf{B},\mathbf{C}\in V

(i) \mathbf{A}+\mathbf{B}=\mathbf{B}+\mathbf{A} ...(Commutativity)

(ii) \mathbf{A}+(\mathbf{B}+\mathbf{C})=(\mathbf{A}+\mathbf{B})+\mathbf{C}
...(Associativity)

(iii) \exists\mathbf{0}\in V such that \mathbf{A}+\mathbf{0}=\mathbf{A} ...(Identity)

(iv)\exists (\mathbf{-A})\in V such that \mathbf{A}+\mathbf{-A}=\mathbf{0} ...(Inverse)

\forall a,b\in F

(v) (a+b)\mathbf{A}=a\mathbf{A}+b\mathbf{A}

(vi)a(\mathbf{A}+\mathbf{B})=a\mathbf{A}+a\mathbf{B}

(vii)a(b\mathbf{A})=(ab)\mathbf{A}

The elements of V are called vectors while the elements of F are called scalars. In most problems of Physics, the field F of scalars is either the set of real numbers \mathbb{R} or the set of complex numbers \mathbb{C}.


Examples of vector spaces:

(i) The set \mathbb{R}^n over \mathbb{R} can be visulaised as the space of ordinary vectors "arrows" of elementary Physics.

(ii) The set of all real polynomials P(x)=a_0+a_1x+\ldots+a_nx^n is a vector space over \mathbb{R}

(iii) Indeed, the set of all functions f:[a,b]\to\mathbb{R} is also a vector spaces over \mathbb{R}, with addition and scalar multiplication defined as is usual.

Although the idea of vectors as "arrows" works well in most examples of vector spaces and is useful in solving problems, the latter two examples were deliberately provided as cases where this intuition fails to work.

BasisEdit

A set E\subset V is said to be linearly independant if and only if

a_1\mathbf{E}_1+a_2\mathbf{E}_2+\ldots+a_n\mathbf{E}_n=\mathbf{0} implies that a_1=a_2=\ldots=0, whenever \mathbf{E}_1,\mathbf{E}_2,\ldots,\mathbf{E}_n\in E


A set E\subset V is said to cover V if for every \mathbf{A}\in V there exist e_1,e_2,\ldots such that \mathbf{A}=e_1\mathbf{E}_1+e_2\mathbf{E}_2+\ldots. (we leave the question of finiteness of the number of terms open at this point)


A set B\subset V is said to be a basis for V if B is linearly independant and if B covers V.

If a vector space has a finite basis with n elements, the vetor space is said to be n-dimensional


As an example, we can consider the vector space \mathbb{R}^3 over reals. The vectors (1,0,0);(0,1,0);(0,0,1) form one of the several possible basis for \mathbb{R}^3. These vectors are often denoted as \hat{i},\hat{j},\hat{k} or as \hat{x},\hat{y},\hat{z}

TheoremEdit

Let V be a vector space and let B=\{\mathbf{b}_1,\mathbf{b}_2,\ldots,\mathbf{b}_n\} be a basis for V. Then any subset of V with n+1 elements is linearly independant.

ProofEdit

Let E\subset V with E=\{\mathbf{u}_1,\mathbf{u}_2,\ldots,\mathbf{u}_{n+1}\}

By definition of basis, there exist scalars a_{1i},a_{2i},\ldots,a_{ni} such that u_i=\displaystyle\sum_m a_{mi}\mathbf{b}_m

Hence we can write \displaystyle\sum_{i=1}^{n+1}c_i\mathbf{u}_i=\mathbf{0} as \displaystyle\sum_{m=1}^n\sum_{i=1}^{n+1}c_ia_{mi}\mathbf{b}_m=\mathbf{0} that is

\displaystyle\sum_{m=1}^nc_1a_{1m}=0
\displaystyle\sum_{m=1}^nc_2a_{2m}=0
\ldots
\displaystyle\sum_{m=1}^nc_{n+1}a_{3(n+1)}=0

Which has a nontrivial solution for c_i. Hence E is linearly dependant.

If a vector space has a finite basis of n elements, we say that the vector space is n-dimensional

Inner ProductEdit

An in-depth treatment of inner-product spaces will be provided in the chapter on Hilbert Spaces. Here we wish to provide an introduction to the inner product using a basis.

Let V be a vector space over \mathbb{R} and let B=\{\mathbf{b}_1,\mathbf{b}_2,\ldots,\mathbf{b}_n\} be a basis for V. Thus for every member \mathbf{u} of V, we can write \mathbf{u}=\displaystyle\sum_{i=1}^n b_i\mathbf{b}_i. b_i are called the components of \mathbf{u} with respect to the basis B.

We define the inner product as a binary operation (\cdot) :V\times V\to\mathbb{R} as \mathbf{x}\cdot\mathbf{y}=\displaystyle\sum_i x_iy_i, where x_i,y_i are the components of \mathbf{x},\mathbf{y} with respect to B

Note here that the inner product so defined is intrinsically dependant on the basis. Unless otherwise mentioned, we will assume the basis \hat{x},\hat{y},\hat{z} while dealing with inner product of ordinary "vectors".

Linear TransformationsEdit

Let U, V be vector spaces over F. A function T:U\to V is said to be a Linear transformation if for all \mathbf{u}_1,\mathbf{u}_2\in U and c\in F if

(i)T(\mathbf{u}_1+\mathbf{u}_2)=T(\mathbf{u}_1)+T(\mathbf{u}_2)

(ii)T(c\mathbf{u}_1)=cT(\mathbf{u}_1)

Now let E=\{\mathbf{e}_1,\mathbf{e}_2,\ldots,\mathbf{e}_m\} and F=\{\mathbf{f}_1,\mathbf{f}_2,\ldots,\mathbf{f}_n\} be bases for U,V respectively.

Let \mathbf{e'}_i=T(\mathbf{e}_i). As F is a basis, we can write \mathbf{e'}_i=\sum_j a_{ij}\mathbf{f}_j.

Thus, by linearity we can say that if T(\mathbf{u})=\mathbf{v}, we can write the components v_j of \mathbf{v} in terms of those of \mathbf{u} as v_j=\sum_i u_ia_{ij}

The collection of coefficients a_{ij} is called a matrix, written as

\mathbf{A}=\begin{pmatrix}
a_{11} & a_{12} & a_{13} & \ldots & a_{1m} \\
a_{21} & a_{22} & a_{23} & \ldots & a_{2m} \\
a_{31} & a_{32} & a_{33} & \ldots & a_{3m} \\
\vdots & \vdots & \vdots & \ddots & \vdots \\
a_{n1} & a_{n2} & a_{n3} & \ldots & a_{nm} \\
\end{pmatrix} and we can say that T can be represented as a matrix \mathbf{A} with respect to the bases E,F

Eigenvalue ProblemsEdit

Let V be a vector space over reals and let T:V\to V be a linear transformations.

Equations of the type T\mathbf{u}=\lambda\mathbf{u}, to be solved for \mathbf{u}\in V and \lambda\in\mathbb{R} are called eigenvalue problems. The solutions \lambda are called eigenvalues of T while the corresponding \mathbf{u} are called eigenvectors or eigenfunctions. (Here we take T\mathbf{u}=T(\mathbf{u}))