Matrix Algebra

Matrix Algebra - Review

Matrix:

A matrix is a rectangular array of elements arranged in rows and columns. We usually use uppercase, boldface, letters (e.g A, B,..) to denote matrices. The element in the i^th row and j^th column of the matrix A is denoted by a_ij.

We sometimes use the square bracket notatation [a_ij] to denote a matrix. That is, the matrix A with r rows and c columns may be equivalently represented thus:

A º [a_ij]; i=1,...,r; j=1,...,c.

A matrix with r rows and c columns is said to have dimension r x c.

Square Matrix:

A matrix of dimension r x c is said to be square if r=c.

Vector:

A matrix of dimension r x c is referred to as a column vector (or simply a vector) if c=1.

A matrix of dimension r x c is referred to as a row vector if r=1.

Transpose:

The transpose of a matrix A is another matrix, denoted A^T that is obtained by interchanging the corresponding columns and rows of A.

Note that the transpose of a column vector is therefore a row vector and vice versa.

Equality:

Let A and B be matrices of the same dimension, then A = B Þ a_ij = b_ij " i, j.

Addition & Subtraction:

Let A and B be matrices of the same dimension, then if C is the sum or difference of A and B, then C will be another matrix of the same dimension as A and B:

C = A + B = [a_ij + b_ij]; i=1,…,r; j=1,…,c.

C = A - B = [a_ij - b_ij]; i=1,…,r; j=1,…,c.

Multiplication:

Scalar multiplication:

Let A = [a_ij]; i=1,…,r; j=1,…,c and let l denote some scalar then l A = [l a_ij] " i, j.

Matrix multiplication:

Let A be a matrix of dimension r x c and B be a matrix of dimension c x s. The product AB is a matrix of dimension r x s. Let C be the matrix that represents this product, then the element in the i^th row and j^th column of C is given by S a_ikb_kj; k=1,…,c.

In general, the product AB is only defined when the number of rows of A equals the number of columns of B.

Special Matrices:

Symmetric Matrix:

If A = A^T then A is said to be symmetric. Symmetric matrices are square.

Diagonal Matrix:

A diagonal matrix is a square matrix whose off diagonal elements are all zeros.

Identity Matrix:

An identity matrix, denoted I, is a diagonal matrix whose diagonal elements are all ones.

Note: Let A be a square matrix of dimension r x r then AI = IA = A.

Scalar Matrix:

A scalar matrix is a diagonal matrix whose diagonal elements are the same. A scalar matrix can be expressed as l I where l is the scalar.

Unity Matrix:

A unity matrix is a matrix where a_ij=1 " i, j. If the matrix is a column vector then it is denoted by 1 and if it is a square matrix then it is denoted by J.

Note that for the n x 1 unity vector, 1^T1 = n and 11^T=J.

Zero Vector:

A zero vector is a vector where a_ij=0 " i, j.

Linear Dependence & Rank:

Linear Dependence:

To illustrate the idea of linear independence, consider some matrix A of dimension 3 x 4. Let us think of the columns of this matrix (i.e. C₁, C₂, C₃, C₄) as vectors. Now let us say that C₁^T= [1 2 3], C₂^T= [2 2 4], C₃^T= [5 10 15], C₄^T= [1 6 1]. Notice that C₃^T= 5*C₁^T.

We say that the columns of A are linearly dependent since one of the columns can be obtained as a linear combination of the others.

In general, let C₁, … C_c be c column vectors of a matrix of dimension r x c and let l ₁, … l _c, be c scalars, not all zero, then if l ₁C₁ + l ₂C₂ +…+ l _cC_c = 0, where 0 denotes the zero vector, then the c column vectors are linearly dependent. Now, if the only set of scalars for which the equality holds is let l ₁ = l ₂ =…= l _c =0 then the set of c columns is linearly independent. For example, in the above example, l ₁ = 5, l ₂ = 0, l ₃ = -1, l ₄ = 0 and so:

l ₁C₁ + l ₂C₂ + l ₃C₃ + l ₄C₄ = 0

Notice that some of l _j = 0 for j=2, 4. For linear dependence, it is only required that not all l _j = 0.

Rank:
The rank of a matrix is defined as the maximum number of linearly independent columns in the matrix. Note that for the example above, the rank of A is clearly not 4 since we have shown that one column may be obtained from the others. As it turns out, the rank of A is 3 since columns C₁, C₂, and C₄ are linearly independent.

Note: The rank of a matrix is unique and can equivalently be defined as the maximum number of linearly independent rows. It follows that the rank of an r x c matrix cannot exceed min(r, c).

Inverse:

The inverse of a matrix A is another matrix, denoted A^-1, such that:

A^-1A = A A^-1 = I

Note: The inverse of a matrix is defined for square matrices only. However many square matrices do not have an inverse. Note that the inverse of a square matrix of dimension r x r exists iff the rank of the matrix is r. Such a matrix is said to be nonsingular whereas a matrix of dimension r with rank < r is said to be singular. If a matrix has an inverse then it is unique.

Determinants:
The determinant of the kxk mtrix A, denoted by |A|, is the scalar:
1. |A| = a₁₁; k=1
2. |A| = S^k_j=1 a_1j |A_1j| (-1)^1+j ; k>1
where A_1j is the (k-1)x(k-1) matrix obtained by deleting the first row and j^th column of the kxk matrix A.
Note
1. If I is the kxk identity matrix then |I|=1
2. If A, B are kxk matrices then:
  1. |A| = |A^T|
  2. If each element of a row or column of A is zero then: |A| = 0
  3. If two rows of A are identical then: |A| = 0
  4. |AB| = |A||B|
  5. If c is a scalar then |cA| = c^k|A|
Note
We may now provide an expression for A^-1. In general, A^-1 has j,i^th entry [|A_ij|/|A|](-1)^i+j where A_ij is the (k-1)x(k-1) matrix obtained by deleting the i^th row and j^th column of the kxk matrix A.
See this link for example problems.
Trace:
Let A be a kxk matrix. The trace of A, denoted tr(A), is the sum of the diagonal elemnets of A. That is, tr(A) = S^k_i=1a_ii
If A, B are kxk matrices then:
1. tr(AA^T) = S^k_i=1 S^k_j=1a_ij²
Orthogonal Matrix:
A square matrix A is said to be orthogonal if its rows, considered as vectors, are mutually perpendicular and have unit lengths.
Note that this means:
AA^T = I
Also, A is orthogonal iff:
A^-1 = A^T
General Properties:

Let A, B, C be matrices of appropriate dimension.

A + B = B + A

(A + B) + C = A + (B + C)

(AB)C = A(BC)

C(A + B) = CA + CB

l (A + B) = l A + l B

(A^T)^T= A

(A + B)^T = A^T + B^T

(AB)^T = B^TA^T

(ABC)^T = C^TB^TA^T

(AB)^-1 = B^-1A^-1

(ABC)^-1 = C^-1B^-1A^-1

(A^-1)^-1= A

(A^T)^-1= (A^-1)^T

SLR & Matrix Algebra:

Recall that the Simple Linear Regression model states that, given a set of n linearly related pairs (x_i, y_i), we may express the y_i’s in terms of the x_i’s thus:

y_i = b ₀ + b ₁x_i + e _i; i=1,…,n

where for a particular value of x_i, the corresponding residuals (i.e. e _i):

Are normally distributed.
Are homoscedastic (i.e. s e _i=s e _j " i, j; i¹ j).
Are unbiased.

Let Y be the vector of the n y_i values. Let X be an n x 2 matrix, where the first column of X is a column of n 1’s and the second column of X is a column of the n x_i values. Let b be the vector of coefficients (i.e. b ₀, b ₁). Let e be the vector of the n residual values (i.e. e _i). The SLR model may be expressed in matrix terms thus:

Y = Xb + e