linear algebra ch 1 and 2 notes

1

Lectures. MTH141: Linear Algebra Mathematics Department. Ryerson University

Instructor: Marcos Escobar ([email protected], Eng213) Textbook: Contemporary Linear Algebra by Howard Anton and Robert C. Busby (Wiley, 2003) Course Website: Blackboard

2

What is linear algebra? Like calculus, it is an area of mathematics. Algebra – sets with binary operations and rules. Linear – additional conditions that make the structure “act like” straight lines. [hence the term “linear”] Recall that calculus started with two main problems or ideas.

1. Tangent line to a curve 2. Area under a curve.

These lead to the study of differential and integral calculus. We begin with the problem of solving linear equations.

3

CHAPTER 1 Vectors in the plane (R2) and three-space (R3). objects with length and

These vectors are the same since they have the same length and direction (equivalent).

direction

Vectors in the plane: Naming them: • (v1,v2 v = (v

) 1,v2)

Each vector in the plane corresponds in a one-to-one way with the points in the plane. Combining vectors: Geometrically and Algebraically Addition u = (u1,u2) v = (v1,v2

u u+v u + v = (u

)

1+v1 , u2+v2) v

4

Subtraction u u-v u - v = (u1-v1 , u2-v2

2u

) v Scalar Multiplication

u ku = k (u1,u2) = (ku1,ku2

) •

-u Vectors in 3-space. (u1,u2,u3

z

) Analogous geometric and algebraic rules.

(u1,u2,u3

y x Vectors from one point to another: See what vector you get when going from the point P=(1,5,-2) to Q=(3,4,5): Answer: QP=(2,-1,7). This vector is the same as performing u-v with u the vector (3,4,5) and v the vector (1,5,-2). Same in two dimensions.

) = u

5

Norm of a Vector and Vector Arithmetic Rules that hold for addition and scalar multiplication: 1. u+v =v+u 2. (u+v)+w = u+(v+w) 3. u+0=0+u=u 4. u+(-u)=0 5. a(bu)=ab(u) 6. a(u+v)=au+av 7. (a+b)u = au+bu 8. 1u = u These are the same as the rules for matrices. (recall we called nx1 and 1xn matrices column and row vectors.) Length of a vector v = (v1,v2) ||v|| = (v1

2 + v22 )1/2 for vectors in the plane

[ ||v|| = (v1

2 + v22 + v3

2 )1/2 for vectors in the 3-space ] Do problems 6 and 9 in class. 6. Find the length of (1/||v||) v [use the definition of length] Discuss the idea that normalizing v. 9. Use knowledge about triangles to show ||u+v|| ≤ ||u|| + ||v|| [look at the drawing we had of u+v earlier.

6

Dot Product (scalar product) (1.2) Geometrically u θ v u • v = ||u|| ||v|| cos θ if u ≠ 0 and v ≠ 0 0 u = 0 or v = 0 Algebraically: u • v = u1v1 + u2v2 in the plane u • v = u1v1 + u2v2 + u3v3

d) v • v > 0 if u ≠ 0, and v • v = 0 if v = 0.

in 3-space Observations If u ≠ 0 and v ≠ 0 then u ⊥ v if and only if u • v = 0. cos θ = u • v / ||u|| ||v|| OTHER PROPERTIES OF THE DOT PRODUCT a) u • v = v • u b) u • (v + w) = u • v + u • w c) k(u • v) = (ku) • v = u •(kv)

7

Orthogonal Projection u θ

proj

a

a u = u•a a ||a||

why? ||proj

2

( if θ>0 )

a u|| = cos θ = u•a ( solve for ||proja ||u|| ||u||||a|| What is the direction of ||proj

u|| )

a

proj

u|| ? Conclude

a u = u•a a ||a|| ||a||

8

EUCLIDEAN n-SPACE In previous slides we talked about vectors in 2 and 3 dimensional space. (a1,a2,a3) is point or vector in 3-space. This idea generalizes to higher dimensions. This is important since most applications in areas such as physics, economics, and statistics involve problems with more than three variables. Definition: If n is an positive integer and ai's are real numbers, then (a1,a2, . . . ,an) is an ordered n-tuple. The set of all possible n-tuples is called n-space (n dimensional space) or Rn

Suppose u = (u

. An n-tuple is called a point or vector in n-space. ALGEBRAIC DEFINITIONS AND RULES:

1,u2, . . . ,un) v = (v1,v2, . . . ,vn). We define u + v = u = (u1+v1, u2+v2, . . . , un+vn) ku = u = (ku1,ku2, . . . ,kun) 0 = (0,0, . . . ,0) It follows that -u = (-u1,-u2, . . . ,-un) u - v = u = (u1-v1, u2-v2, . . . , un-vn

9. u+v =v+u

) Theorem:

10.(u+v)+w = u+(v+w) 11.u+0=0+u=u 12.u+(-u)=0 13.a(bu)=ab(u) 14.a(u+v)=au+av

9

15.(a+b)u = au+bu 16.1u = u Generalized Dot Product - Euclidean Inner Product n

u • v = u1v1 + u2v2 + . . . + unvn = ∑ uivi i = 1 (5,4,1,2,3) • (-2,3,1,4,1) = -10 + 12 + 1 + 8 + 3 = 14 PROPERTIES OF DOT PRODUCT a) u • v = v • u b) (u +v) • w = u • w + v • w c) k(u • v) = (ku) • v = u •(kv) d) v • v > 0 if u ≠ 0, and v • v = 0 if v = 0. Proof of b) and d). (u+v)• w = ∑ (ui +vi ) wi = ∑ (ui wi + vi wi) = ∑ ui wi + ∑vi wi = u • w + v • w v • v = ∑ vivi = ∑ vi

2 > 0 v • v = 0 iff ∑ vi

2 = 0 iff vi = 0 for each i. Euclidean Norm (length) ||u|| = (u • u)1/2 = (u1

2 +u22 +. . . + un

2 )1/2 Distance d(u,v) = ||u-v|| = ((u1

-v1)2 +(u2 -v2)2. . . + (un

-vn)2 )1/2 Cauchy Schwarz Inequality (for Rn) If u and v are in Rn then |u • v| ≤ ||u|| ||v||

10

Proof: (Special Cases: 2 and 3 dimension.) |u • v| = | ||u|| ||v|| cos θ | ≤ ||u|| ||v|| (since |cos θ| ≤ 1) FACTS ABOUT LENGTH AND DISTANCE Length a) ||u|| ≥ 0 b) ||u|| = 0 iff u = 0 c) ||ku|| = |k| ||u|| d) ||u +v|| ≤ ||u|| + ||v|| Proof of d. (using the Cauchy-Schwarz inequality ||u +v||2 = (u +v)•(u +v) = u • u + 2u•v + v • v = ||u ||2 + 2u•v + ||v ||2 ≤ ||u ||2 + 2 ||u || || v|| + ||v ||2 = (||u|| + ||v||)2 Distance a) d(u,v) ≥ 0 b) d(u,v) = 0 iff u=v c) d(u,v) = d(v,u) d) d(u,v) ≤ d(u,w) + d(w,v) Proof of d. d(u,v) = ||u -v|| = ||u-w + w-v|| ≤ ||u -w|| + ||w -v|| = d(u,w) + d(w,v) Fact: u•v = 1/4 ||u +v||2 - 1/4 ||u -v||2 Proof: (see work done earlier in class to get the first equation) ||u +v||2 = ||u ||2 + 2u•v + ||v ||2

||u - v||2 = ||u ||2 - 2u•v + ||v ||2 Subtract second from first and solve for u•v.

11

ORTHOGONAL (generalization of perpendicular to n-space) The vectors u and v are called orthogonal iff u•v = 0. Theorem: u and v are orthogonal iff ||u +v||2 = ||u||2 + ||v||2 Proof: ||u +v||2 = (u +v)•(u +v) = u • u + 2u•v + v • v = ||u ||2 + 2u•v + ||v ||2 = ||u ||2 + 0 + ||v ||2 = ||u ||2 + ||v ||2

u

NOTATION FOR VECTORS IN n-SPACE

1 u = u2 (column vector) or u = (u1, u2, . . . , un) (row vector) . un Using column vectors then one gets the following u•v = vTu (do an example)

12

Lines and Planes (1.3) Lines in 2-space. Standard Equation of a line in R2: Ax+By+C=0. Where –C/B is the interception with the y-axis and A/B is the slope. Alternative expression (parametric equation): Assume a point on the line (x0,y0) and a vector v=(a,b) with the direction of the line are given then the equation of the line in vector notation is: P0P=tv or (x,y)= (x0,y0)+tv where P is any point in the line and t is a scalar called the parameter. Note the parametric equation implies (x- x0)/a=(y- y0)/b <=> (1/a) x + (-1/b) y + (y0/b- x0/a)=0 so we have a relationship between A,B,C and a,b,x0,y0. The parametric approach (vector notation) is beneficial because it is intuitive and it can be extended to n-dimensions. Lines and Planes in 3-space Straight lines in 3-space Given a point in the line (x0, y0, z0) and a vector v=(a,b,c) in the direction of the line then the parametric equation of the line in vector notation is: (x,y,z)=(x0,y0,z0)+t v.

13

x = x0 + at y = y0 + bt Line parallel to vector v=(a,b,c) through (x0 , y0 , z0 ) z = z0 + ct or (x - x0)/a = (y - y0)/b =(z- z0)/c Planes in 3-space. Equation of a plane in vector notation: n•P0P=0 where P0 is a point in the plane. P is any other point in the plane and n is a normal vector. (a,b,c)• (x-x0,y-y0,z-z0)=0 <=> a(x-x0)+b(y-y0)+c(z-z0

n

)=0 <=> ax+by+cz+d=0

P P0 . normal vector (a,b,c)=n If d=0 then it goes through the origin (0,0,0) Recall n•P0P=0, therefore nP0=0 if (0,0,0) in on the plane. Parametric equation of a Plane in vector notation: Assume a point P0 and two vectors v1 , v2 (both in the direction of the plane) are given, then the parametric equation is:

14

P0P=t1v1+t2v2. Where P is any point in the plane and t1 and t2 are the parameters. Examples: 1 – Equation of a Plane passing through the point P1=(1,2,-1) with normal n=(3,2,-5). 2 – Equation of a Plane passing through the point P1=(1,2,-1), P2=(2,3,1), P3=(3,-1,2). 3 - Equation of a line passing through the points P1=(2,4,-1) and P2=(5,0,7).

15

Chapter 2: Linear Equations (2.1) Variables x1 , x2 , x3 , ... , xn Constants a1 , a2 , a3 , ... , an , b Linerar equation a1 x1 + a2x2 + a3x3 + ... + anxn = b n ∑ aixi = b i=1 Remark: xi has to appear linear, i.e. xi

2 or sin(xi) are not allowed Definition: s1 , s2 , s3 , ... , sn is called a solution of the linear equation if a1 s1 + a2s2 + a3s3 + ... + ansn = b Example: 1, 1 is a solution to 3x+2y = 5 (or 3x1 + 2x2 = 5) Solving: x = 5/3 - 2/3 y Parametric form of the solution (general solution) x = 5/3 - 2/3 t y = t specific solutions (t=1) x =1 y = 1 (t=2) x = 1/3 y = 2

16

Example 2 x1 - 5x2 + x3 - 4x4 = 6 (or x1 = 3 + 5/2 x2 - x3/2 + 2x4 ) Solution x1 = 3 + 5/2 s - t/2 + 2u x2 = s x3 = t x4

= u System of Linear equations - finite set of linear equations A solution to the system must satisfy all the equations. A solution is the intersection of the solution sets of each equation. Example: 2 equations in 2 unknowns. Solution is the intersection of 2 straight lines in the plane

Consistent (one solution) Consistent (infinite solutions) Inconsistent (no solutions)

17

General System of Linear Equations a11 x1 + a12x2 + a13x3 + ... + a1nxn = b1 a21 x1 + a22x2 + a23x3 + ... + a2nxn = b2 . . am1 x1 + am2x2 + am3x3 + ... + amnxn = bm aij ith row jth

a

column Augmented Matrix (shorthand)

11 a12 a13 ... a1n b1 a21 a22 a23 ... a2n b2 am1 am2 am3 ... amn b

What is the augmented matrix for the following? x

m

1 - 2x2 + 3x3 = 5 2 x1 + x2 - 7x3 = 6 To solve systems of equations we manipulate these matrices.

18

Elementary Row Operations Three operations can be used to change equations (augmented matrices) and not change the solution set. 1) Multiply a equation (row) by a non-zero constant 2) Interchange 2 equations (rows). 3) Add a multiple of an equation (row) to another. Example: (Gauss Jordan Reduction) x1 + 2x2 = 3 3 x1 - x2

1 2 3

= -5

3 -1 -5 -3I+II 1 2 3 0 -7 -14 -1/7 II 1 2 3 0 1 2 -2II + I 1 0 -1 0 1 2 Solution x1 = -1 x2 = 2

19

Gaussian Elimination (2.2) A matrix is in reduced row-echelon form (RREF) if it has the following properties: 1. If any row is not all zeros then the first non-zero number is a 1.

2. All rows that are entirely zero are at the bottom of the matrix.

3. The leading 1 in any row is farther to the right than the leading

one in any row above it.

4. Each column with a leading 1 has zero everywhere else.

0 1 2 0 0 5 0 0 0 1 0 6 0 0 0 0 1 8 0 0 0 0 0 0

A matrix is in row-echelon form (REF) if it satisfies 1-3 above.

0 1 2 6 0 5 0 0 0 1 2 6 0 0 0 0 1 8 0 0 0 0 0 0

Remark: Gauss elimination takes a matrix to its REF while a Gauss-Jordan takes a matrix to its RREF.

20

Solving RREF matrices

1 2 0 0 3 0 4 0 0 1 0 1 0 5 0 0 0 1 2 0 6 0 0 0 0 0 1 2

x1 = 4 - 2x2 - 3x5 x3 = 5 - x5 x4 = 6 - 2x5 x6 = 2 x1 = 4 - 2s - 3t x2 = s x3 = 5 - t x4 = 6 - 2t x5 = t x6 = 2 Solving a matrix in REF. 1 2 4 5 -3 6 0 0 1 -1 2 5 0 0 0 0 1 3 x1 = 6 - 2x2 - 4x3 - 5x4 + 3x5 x3 = 5 + x4 - 2x5 x5 = 3 x1 = 6 - 2s - 4(t-1) + 5t + 3(3) = 19 - 2s + t x2 = s x3 = 5 + t - 2(3) = t-1 x4 = t

21

x5 = 3

22

Example of Gauss-Jordan Reduction (Process that leads to a RREF)

0 0 -2 0 7 6 3 6 0 9 18 12 2 4 5 6 -5 -2

switch I and II 3 6 0 9 18 12 0 0 -2 0 7 6 2 4 5 6 -5 -2

1/3 I 1 2 0 3 6 4 0 0 -2 0 7 6 2 4 5 6 -5 -2

-2I + III 1 2 0 3 6 4 0 0 -2 0 7 6 0 0 5 0 -17 -10

-1/2 II 1 2 0 3 6 4 0 0 1 0 -7/2 -3 0 0 5 0 -17 -10

-5II + III 1 2 0 3 6 4 0 0 1 0 -7/2 -3 0 0 0 0 1/2 5

2 III 1 2 0 3 6 4 0 0 1 0 -7/2 -3 0 0 0 0 1 10

-6 III + I 1 2 0 3 0 -56 0 0 1 0 -7/2 -3 0 0 0 0 1 10

7/2 III + II 1 2 0 3 0 -56 0 0 1 0 0 32 0 0 0 0 1 10

23

x1 = -56 - 2s - 3t x2 = s x3 = 32 x4 = t x5 = 10

24

Homogeneous Systems of Linear Equations a11 x1 + a12x2 + a13x3 + ... + a1nxn = 0 a21 x1 + a22x2 + a23x3 + ... + a2nxn = 0 . . am1 x1 + am2x2 + am3x3 + ... + amnxn

• A homogeneous systems stays homogeneous after applying an elementary row operation.

= 0

• A homogeneous system is always consistent (it has one or more solutions). All xi

This is called the = 0 works. trivial solution

• Either there is only the trivial solution or there are infinitely many solutions.

.

• Is there a way to tell if there are infinitely many solutions without solving.

Theorem: A homogeneous system of linear equations with more unknowns than equations has infinitely many solutions. [n>m] idea of proof: After applying Gauss-Jordan method, since there are more variables than equations, not all the variables are leading 1's so some will be free to vary (will be a parameter). 0 1 0 2 0 -2 0 x2 = -2x4 + 2x6 0 0 1 3 0 1 0 x3 = -3x4 - 2x6 0 0 0 0 1 0 0 x5 = 0 x1 = s x2 = -2t + 2u x3 = -3t - u x4 = t

25

x5 = 0 x6 = u

linear algebra ch 1 and 2 notes

Documents