linear algebra - quantitative economics with juliaย ยท 1. no vector in ๐ดcan be formed as a linear...

21
Linear Algebra Jesse Perla, Thomas J. Sargent and John Stachurski May 20, 2020 1 Contents โ€ข Overview 2 โ€ข Vectors 3 โ€ข Matrices 4 โ€ข Solving Systems of Equations 5 โ€ข Eigenvalues and Eigenvectors 6 โ€ข Further Topics 7 โ€ข Exercises 8 โ€ข Solutions 9 2 Overview Linear algebra is one of the most useful branches of applied mathematics for economists to invest in. For example, many applied problems in economics and ๏ฌnance require the solution of a linear system of equations, such as 1 = 1 + 2 2 = 1 + 2 or, more generally, 1 = 11 1 + 12 2 +โ‹ฏ+ 1 โ‹ฎ = 1 1 + 2 2 +โ‹ฏ+ (1) The objective here is to solve for the โ€œunknownsโ€ 1 ,โ€ฆ, given 11 ,โ€ฆ, and 1 ,โ€ฆ, . When considering such problems, it is essential that we ๏ฌrst consider at least some of the fol- lowing questions โ€ข Does a solution actually exist? โ€ข Are there in fact many solutions, and if so how should we interpret them? โ€ข If no solution exists, is there a best โ€œapproximateโ€ solution? 1

Upload: others

Post on 31-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

Linear Algebra

Jesse Perla, Thomas J. Sargent and John Stachurski

May 20, 2020

1 Contents

โ€ข Overview 2โ€ข Vectors 3โ€ข Matrices 4โ€ข Solving Systems of Equations 5โ€ข Eigenvalues and Eigenvectors 6โ€ข Further Topics 7โ€ข Exercises 8โ€ข Solutions 9

2 Overview

Linear algebra is one of the most useful branches of applied mathematics for economists toinvest in.

For example, many applied problems in economics and finance require the solution of a linearsystem of equations, such as

๐‘ฆ1 = ๐‘Ž๐‘ฅ1 + ๐‘๐‘ฅ2๐‘ฆ2 = ๐‘๐‘ฅ1 + ๐‘‘๐‘ฅ2

or, more generally,

๐‘ฆ1 = ๐‘Ž11๐‘ฅ1 + ๐‘Ž12๐‘ฅ2 + โ‹ฏ + ๐‘Ž1๐‘˜๐‘ฅ๐‘˜โ‹ฎ

๐‘ฆ๐‘› = ๐‘Ž๐‘›1๐‘ฅ1 + ๐‘Ž๐‘›2๐‘ฅ2 + โ‹ฏ + ๐‘Ž๐‘›๐‘˜๐‘ฅ๐‘˜

(1)

The objective here is to solve for the โ€œunknownsโ€ ๐‘ฅ1, โ€ฆ , ๐‘ฅ๐‘˜ given ๐‘Ž11, โ€ฆ , ๐‘Ž๐‘›๐‘˜ and ๐‘ฆ1, โ€ฆ , ๐‘ฆ๐‘›.

When considering such problems, it is essential that we first consider at least some of the fol-lowing questions

โ€ข Does a solution actually exist?โ€ข Are there in fact many solutions, and if so how should we interpret them?โ€ข If no solution exists, is there a best โ€œapproximateโ€ solution?

1

Page 2: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

โ€ข If a solution exists, how should we compute it?

These are the kinds of topics addressed by linear algebra.

In this lecture we will cover the basics of linear and matrix algebra, treating both theory andcomputation.

We admit some overlap with this lecture, where operations on Julia arrays were first ex-plained.

Note that this lecture is more theoretical than most, and contains background material thatwill be used in applications as we go along.

3 Vectors

A vector is an element of a vector space.

Vectors can be added together and scaled (multiplied) by scalars.

Vectors can be written as ๐‘ฅ = [๐‘ฅ1, โ€ฆ , ๐‘ฅ๐‘›].

The set of all ๐‘›-vectors is denoted by โ„๐‘›.

For example, โ„2 is the plane, and a vector in โ„2 is just a point in the plane.

Traditionally, vectors are represented visually as arrows from the origin to the point.

The following figure represents three vectors in this manner.

3.1 Setup

In [1]: using InstantiateFromURL# optionally add arguments to force installation: instantiate = true,๏ฟฝ

โ†ชprecompile = truegithub_project("QuantEcon/quantecon-notebooks-julia", version = "0.7.0")

In [2]: using LinearAlgebra, Statistics, Plotsgr(fmt=:png);

In [3]: x_vals = [0 0 0 ; 2 -3 -4]y_vals = [0 0 0 ; 4 3 -3.5]

plot(x_vals, y_vals, arrow = true, color = :blue,legend = :none, xlims = (-5, 5), ylims = (-5, 5),annotations = [(2.2, 4.4, "[2, 4]"),

(-3.3, 3.3, "[-3, 3]"),(-4.4, -3.85, "[-4, -3.5]")],

xticks = -5:1:5, yticks = -5:1:5,framestyle = :origin)

Out[3]:

2

Page 3: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

3.2 Vector Operations

The two most common operators for vectors are addition and scalar multiplication, which wenow describe.

As a matter of definition, when we add two vectors, we add them element by element

๐‘ฅ + ๐‘ฆ =โŽกโŽขโŽขโŽฃ

๐‘ฅ1๐‘ฅ2โ‹ฎ

๐‘ฅ๐‘›

โŽคโŽฅโŽฅโŽฆ

+โŽกโŽขโŽขโŽฃ

๐‘ฆ1๐‘ฆ2โ‹ฎ

๐‘ฆ๐‘›

โŽคโŽฅโŽฅโŽฆ

โˆถ=โŽกโŽขโŽขโŽฃ

๐‘ฅ1 + ๐‘ฆ1๐‘ฅ2 + ๐‘ฆ2

โ‹ฎ๐‘ฅ๐‘› + ๐‘ฆ๐‘›

โŽคโŽฅโŽฅโŽฆ

Scalar multiplication is an operation that takes a number ๐›พ and a vector ๐‘ฅ and produces

๐›พ๐‘ฅ โˆถ=โŽกโŽขโŽขโŽฃ

๐›พ๐‘ฅ1๐›พ๐‘ฅ2

โ‹ฎ๐›พ๐‘ฅ๐‘›

โŽคโŽฅโŽฅโŽฆ

Scalar multiplication is illustrated in the next figure

In [4]: # illustrate scalar multiplication

x = [2]scalars = [-2 1 2]vals = [0 0 0; x * scalars]labels = [(-3.6, -4.2, "-2x"), (2.4, 1.8, "x"), (4.4, 3.8, "2x")]

plot(vals, vals, arrow = true, color = [:red :red :blue],legend = :none, xlims = (-5, 5), ylims = (-5, 5),annotations = labels, xticks = -5:1:5, yticks = -5:1:5,framestyle = :origin)

Out[4]:

In Julia, a vector can be represented as a one dimensional Array.

Julia Arrays allow us to express scalar multiplication and addition with a very natural syntax

In [5]: x = ones(3)

Out[5]: 3-element Array{Float64,1}:1.01.01.0

In [6]: y = [2, 4, 6]

Out[6]: 3-element Array{Int64,1}:246

3

Page 4: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

In [7]: x + y

Out[7]: 3-element Array{Float64,1}:3.05.07.0

In [8]: 4x # equivalent to 4 * x and 4 .* x

Out[8]: 3-element Array{Float64,1}:4.04.04.0

3.3 Inner Product and Norm

The inner product of vectors ๐‘ฅ, ๐‘ฆ โˆˆ โ„๐‘› is defined as

๐‘ฅโ€ฒ๐‘ฆ โˆถ=๐‘›

โˆ‘๐‘–=1

๐‘ฅ๐‘–๐‘ฆ๐‘–

Two vectors are called orthogonal if their inner product is zero.

The norm of a vector ๐‘ฅ represents its โ€œlengthโ€ (i.e., its distance from the zero vector) and isdefined as

โ€–๐‘ฅโ€– โˆถ=โˆš

๐‘ฅโ€ฒ๐‘ฅ โˆถ= (๐‘›

โˆ‘๐‘–=1

๐‘ฅ2๐‘– )

1/2

The expression โ€–๐‘ฅ โˆ’ ๐‘ฆโ€– is thought of as the distance between ๐‘ฅ and ๐‘ฆ.

Continuing on from the previous example, the inner product and norm can be computed asfollows

In [9]: using LinearAlgebra

In [10]: dot(x, y) # Inner product of x and y

Out[10]: 12.0

In [11]: sum(prod, zip(x, y)) # Gives the same result

Out[11]: 12.0

In [12]: norm(x) # Norm of x

Out[12]: 1.7320508075688772

In [13]: sqrt(sum(abs2, x)) # Gives the same result

Out[13]: 1.7320508075688772

4

Page 5: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

3.4 Span

Given a set of vectors ๐ด โˆถ= {๐‘Ž1, โ€ฆ , ๐‘Ž๐‘˜} in โ„๐‘›, itโ€™s natural to think about the new vectors wecan create by performing linear operations.New vectors created in this manner are called linear combinations of ๐ด.In particular, ๐‘ฆ โˆˆ โ„๐‘› is a linear combination of ๐ด โˆถ= {๐‘Ž1, โ€ฆ , ๐‘Ž๐‘˜} if

๐‘ฆ = ๐›ฝ1๐‘Ž1 + โ‹ฏ + ๐›ฝ๐‘˜๐‘Ž๐‘˜ for some scalars ๐›ฝ1, โ€ฆ , ๐›ฝ๐‘˜

In this context, the values ๐›ฝ1, โ€ฆ , ๐›ฝ๐‘˜ are called the coefficients of the linear combination.The set of linear combinations of ๐ด is called the span of ๐ด.The next figure shows the span of ๐ด = {๐‘Ž1, ๐‘Ž2} in โ„3.The span is a 2 dimensional plane passing through these two points and the origin.

In [14]: # fixed linear function, to generate a planef(x, y) = 0.2x + 0.1y

# lines to vectorsx_vec = [0 0; 3 3]y_vec = [0 0; 4 -4]z_vec = [0 0; f(3, 4) f(3, -4)]

# draw the planen = 20grid = range(-5, 5, length = n)z2 = [ f(grid[row], grid[col]) for row in 1:n, col in 1:n ]wireframe(grid, grid, z2, fill = :blues, gridalpha =1 )plot!(x_vec, y_vec, z_vec, color = [:blue :green], linewidth = 3, labels๏ฟฝ

โ†ช= "",colorbar = false)

Out[14]:

3.4.1 Examples

If ๐ด contains only one vector ๐‘Ž1 โˆˆ โ„2, then its span is just the scalar multiples of ๐‘Ž1, which isthe unique line passing through both ๐‘Ž1 and the origin.If ๐ด = {๐‘’1, ๐‘’2, ๐‘’3} consists of the canonical basis vectors of โ„3, that is

๐‘’1 โˆถ= โŽกโŽขโŽฃ

100

โŽคโŽฅโŽฆ

, ๐‘’2 โˆถ= โŽกโŽขโŽฃ

010

โŽคโŽฅโŽฆ

, ๐‘’3 โˆถ= โŽกโŽขโŽฃ

001

โŽคโŽฅโŽฆ

then the span of ๐ด is all of โ„3, because, for any ๐‘ฅ = (๐‘ฅ1, ๐‘ฅ2, ๐‘ฅ3) โˆˆ โ„3, we can write

๐‘ฅ = ๐‘ฅ1๐‘’1 + ๐‘ฅ2๐‘’2 + ๐‘ฅ3๐‘’3

Now consider ๐ด0 = {๐‘’1, ๐‘’2, ๐‘’1 + ๐‘’2}.If ๐‘ฆ = (๐‘ฆ1, ๐‘ฆ2, ๐‘ฆ3) is any linear combination of these vectors, then ๐‘ฆ3 = 0 (check it).Hence ๐ด0 fails to span all of โ„3.

5

Page 6: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

3.5 Linear Independence

As weโ€™ll see, itโ€™s often desirable to find families of vectors with relatively large span, so thatmany vectors can be described by linear operators on a few vectors.

The condition we need for a set of vectors to have a large span is whatโ€™s called linear inde-pendence.

In particular, a collection of vectors ๐ด โˆถ= {๐‘Ž1, โ€ฆ , ๐‘Ž๐‘˜} in โ„๐‘› is said to be

โ€ข linearly dependent if some strict subset of ๐ด has the same span as ๐ดโ€ข linearly independent if it is not linearly dependent

Put differently, a set of vectors is linearly independent if no vector is redundant to the span,and linearly dependent otherwise.

To illustrate the idea, recall the figure that showed the span of vectors {๐‘Ž1, ๐‘Ž2} in โ„3 as aplane through the origin.

If we take a third vector ๐‘Ž3 and form the set {๐‘Ž1, ๐‘Ž2, ๐‘Ž3}, this set will be

โ€ข linearly dependent if ๐‘Ž3 lies in the planeโ€ข linearly independent otherwise

As another illustration of the concept, since โ„๐‘› can be spanned by ๐‘› vectors (see the discus-sion of canonical basis vectors above), any collection of ๐‘š > ๐‘› vectors in โ„๐‘› must be linearlydependent.

The following statements are equivalent to linear independence of ๐ด โˆถ= {๐‘Ž1, โ€ฆ , ๐‘Ž๐‘˜} โŠ‚ โ„๐‘›.

1. No vector in ๐ด can be formed as a linear combination of the other elements.

2. If ๐›ฝ1๐‘Ž1 + โ‹ฏ ๐›ฝ๐‘˜๐‘Ž๐‘˜ = 0 for scalars ๐›ฝ1, โ€ฆ , ๐›ฝ๐‘˜, then ๐›ฝ1 = โ‹ฏ = ๐›ฝ๐‘˜ = 0.

(The zero in the first expression is the origin of โ„๐‘›)

3.6 Unique Representations

Another nice thing about sets of linearly independent vectors is that each element in the spanhas a unique representation as a linear combination of these vectors.

In other words, if ๐ด โˆถ= {๐‘Ž1, โ€ฆ , ๐‘Ž๐‘˜} โŠ‚ โ„๐‘› is linearly independent and

๐‘ฆ = ๐›ฝ1๐‘Ž1 + โ‹ฏ ๐›ฝ๐‘˜๐‘Ž๐‘˜

then no other coefficient sequence ๐›พ1, โ€ฆ , ๐›พ๐‘˜ will produce the same vector ๐‘ฆ.

Indeed, if we also have ๐‘ฆ = ๐›พ1๐‘Ž1 + โ‹ฏ ๐›พ๐‘˜๐‘Ž๐‘˜, then

(๐›ฝ1 โˆ’ ๐›พ1)๐‘Ž1 + โ‹ฏ + (๐›ฝ๐‘˜ โˆ’ ๐›พ๐‘˜)๐‘Ž๐‘˜ = 0

Linear independence now implies ๐›พ๐‘– = ๐›ฝ๐‘– for all ๐‘–.

6

Page 7: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

4 Matrices

Matrices are a neat way of organizing data for use in linear operations.

An ๐‘› ร— ๐‘˜ matrix is a rectangular array ๐ด of numbers with ๐‘› rows and ๐‘˜ columns:

๐ด =โŽกโŽขโŽขโŽฃ

๐‘Ž11 ๐‘Ž12 โ‹ฏ ๐‘Ž1๐‘˜๐‘Ž21 ๐‘Ž22 โ‹ฏ ๐‘Ž2๐‘˜

โ‹ฎ โ‹ฎ โ‹ฎ๐‘Ž๐‘›1 ๐‘Ž๐‘›2 โ‹ฏ ๐‘Ž๐‘›๐‘˜

โŽคโŽฅโŽฅโŽฆ

Often, the numbers in the matrix represent coefficients in a system of linear equations, as dis-cussed at the start of this lecture.

For obvious reasons, the matrix ๐ด is also called a vector if either ๐‘› = 1 or ๐‘˜ = 1.

In the former case, ๐ด is called a row vector, while in the latter it is called a column vector.

If ๐‘› = ๐‘˜, then ๐ด is called square.

The matrix formed by replacing ๐‘Ž๐‘–๐‘— by ๐‘Ž๐‘—๐‘– for every ๐‘– and ๐‘— is called the transpose of ๐ด, anddenoted ๐ดโ€ฒ or ๐ดโŠค.

If ๐ด = ๐ดโ€ฒ, then ๐ด is called symmetric.

For a square matrix ๐ด, the ๐‘– elements of the form ๐‘Ž๐‘–๐‘– for ๐‘– = 1, โ€ฆ , ๐‘› are called the principaldiagonal.

๐ด is called diagonal if the only nonzero entries are on the principal diagonal.

If, in addition to being diagonal, each element along the principal diagonal is equal to 1, then๐ด is called the identity matrix, and denoted by ๐ผ .

4.1 Matrix Operations

Just as was the case for vectors, a number of algebraic operations are defined for matrices.

Scalar multiplication and addition are immediate generalizations of the vector case:

๐›พ๐ด = ๐›พ โŽกโŽขโŽฃ

๐‘Ž11 โ‹ฏ ๐‘Ž1๐‘˜โ‹ฎ โ‹ฎ โ‹ฎ

๐‘Ž๐‘›1 โ‹ฏ ๐‘Ž๐‘›๐‘˜

โŽคโŽฅโŽฆ

โˆถ= โŽกโŽขโŽฃ

๐›พ๐‘Ž11 โ‹ฏ ๐›พ๐‘Ž1๐‘˜โ‹ฎ โ‹ฎ โ‹ฎ

๐›พ๐‘Ž๐‘›1 โ‹ฏ ๐›พ๐‘Ž๐‘›๐‘˜

โŽคโŽฅโŽฆ

and

๐ด + ๐ต = โŽกโŽขโŽฃ

๐‘Ž11 โ‹ฏ ๐‘Ž1๐‘˜โ‹ฎ โ‹ฎ โ‹ฎ

๐‘Ž๐‘›1 โ‹ฏ ๐‘Ž๐‘›๐‘˜

โŽคโŽฅโŽฆ

+ โŽกโŽขโŽฃ

๐‘11 โ‹ฏ ๐‘1๐‘˜โ‹ฎ โ‹ฎ โ‹ฎ

๐‘๐‘›1 โ‹ฏ ๐‘๐‘›๐‘˜

โŽคโŽฅโŽฆ

โˆถ= โŽกโŽขโŽฃ

๐‘Ž11 + ๐‘11 โ‹ฏ ๐‘Ž1๐‘˜ + ๐‘1๐‘˜โ‹ฎ โ‹ฎ โ‹ฎ

๐‘Ž๐‘›1 + ๐‘๐‘›1 โ‹ฏ ๐‘Ž๐‘›๐‘˜ + ๐‘๐‘›๐‘˜

โŽคโŽฅโŽฆ

In the latter case, the matrices must have the same shape in order for the definition to makesense.

We also have a convention for multiplying two matrices.

The rule for matrix multiplication generalizes the idea of inner products discussed above, andis designed to make multiplication play well with basic linear operations.

7

Page 8: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

If ๐ด and ๐ต are two matrices, then their product ๐ด๐ต is formed by taking as its ๐‘–, ๐‘—-th elementthe inner product of the ๐‘–-th row of ๐ด and the ๐‘—-th column of ๐ต.

There are many tutorials to help you visualize this operation, such as this one, or the discus-sion on the Wikipedia page.

If ๐ด is ๐‘› ร— ๐‘˜ and ๐ต is ๐‘— ร— ๐‘š, then to multiply ๐ด and ๐ต we require ๐‘˜ = ๐‘—, and the resultingmatrix ๐ด๐ต is ๐‘› ร— ๐‘š.

As perhaps the most important special case, consider multiplying ๐‘› ร— ๐‘˜ matrix ๐ด and ๐‘˜ ร— 1column vector ๐‘ฅ.

According to the preceding rule, this gives us an ๐‘› ร— 1 column vector

๐ด๐‘ฅ = โŽกโŽขโŽฃ

๐‘Ž11 โ‹ฏ ๐‘Ž1๐‘˜โ‹ฎ โ‹ฎ โ‹ฎ

๐‘Ž๐‘›1 โ‹ฏ ๐‘Ž๐‘›๐‘˜

โŽคโŽฅโŽฆ

โŽกโŽขโŽฃ

๐‘ฅ1โ‹ฎ

๐‘ฅ๐‘˜

โŽคโŽฅโŽฆ

โˆถ= โŽกโŽขโŽฃ

๐‘Ž11๐‘ฅ1 + โ‹ฏ + ๐‘Ž1๐‘˜๐‘ฅ๐‘˜โ‹ฎ

๐‘Ž๐‘›1๐‘ฅ1 + โ‹ฏ + ๐‘Ž๐‘›๐‘˜๐‘ฅ๐‘˜

โŽคโŽฅโŽฆ

(2)

Note๐ด๐ต and ๐ต๐ด are not generally the same thing.

Another important special case is the identity matrix.

You should check that if ๐ด is ๐‘› ร— ๐‘˜ and ๐ผ is the ๐‘˜ ร— ๐‘˜ identity matrix, then ๐ด๐ผ = ๐ด.

If ๐ผ is the ๐‘› ร— ๐‘› identity matrix, then ๐ผ๐ด = ๐ด.

4.2 Matrices in Julia

Julia arrays are also used as matrices, and have fast, efficient functions and methods for allthe standard matrix operations.

You can create them as follows

In [15]: A = [1 23 4]

Out[15]: 2ร—2 Array{Int64,2}:1 23 4

In [16]: typeof(A)

Out[16]: Array{Int64,2}

In [17]: size(A)

Out[17]: (2, 2)

8

Page 9: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

The size function returns a tuple giving the number of rows and columns.

To get the transpose of A, use transpose(A) or, more simply, A'.

There are many convenient functions for creating common matrices (matrices of zeros, ones,etc.) โ€” see here.

Since operations are performed elementwise by default, scalar multiplication and additionhave very natural syntax

In [18]: A = ones(3, 3)

Out[18]: 3ร—3 Array{Float64,2}:1.0 1.0 1.01.0 1.0 1.01.0 1.0 1.0

In [19]: 2I

Out[19]: UniformScaling{Int64}2*I

In [20]: A + I

Out[20]: 3ร—3 Array{Float64,2}:2.0 1.0 1.01.0 2.0 1.01.0 1.0 2.0

To multiply matrices we use the * operator.

In particular, A * B is matrix multiplication, whereas A .* B is element by element multi-plication.

4.3 Matrices as Maps

Each ๐‘› ร— ๐‘˜ matrix ๐ด can be identified with a function ๐‘“(๐‘ฅ) = ๐ด๐‘ฅ that maps ๐‘ฅ โˆˆ โ„๐‘˜ into๐‘ฆ = ๐ด๐‘ฅ โˆˆ โ„๐‘›.

These kinds of functions have a special property: they are linear.

A function ๐‘“ โˆถ โ„๐‘˜ โ†’ โ„๐‘› is called linear if, for all ๐‘ฅ, ๐‘ฆ โˆˆ โ„๐‘˜ and all scalars ๐›ผ, ๐›ฝ, we have

๐‘“(๐›ผ๐‘ฅ + ๐›ฝ๐‘ฆ) = ๐›ผ๐‘“(๐‘ฅ) + ๐›ฝ๐‘“(๐‘ฆ)

You can check that this holds for the function ๐‘“(๐‘ฅ) = ๐ด๐‘ฅ + ๐‘ when ๐‘ is the zero vector, andfails when ๐‘ is nonzero.

In fact, itโ€™s known that ๐‘“ is linear if and only if there exists a matrix ๐ด such that ๐‘“(๐‘ฅ) = ๐ด๐‘ฅfor all ๐‘ฅ.

9

Page 10: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

5 Solving Systems of Equations

Recall again the system of equations (1).

If we compare (1) and (2), we see that (1) can now be written more conveniently as

๐‘ฆ = ๐ด๐‘ฅ (3)

The problem we face is to determine a vector ๐‘ฅ โˆˆ โ„๐‘˜ that solves (3), taking ๐‘ฆ and ๐ด as given.

This is a special case of a more general problem: Find an ๐‘ฅ such that ๐‘ฆ = ๐‘“(๐‘ฅ).Given an arbitrary function ๐‘“ and a ๐‘ฆ, is there always an ๐‘ฅ such that ๐‘ฆ = ๐‘“(๐‘ฅ)?If so, is it always unique?

The answer to both these questions is negative, as the next figure shows

In [21]: f(x) = 0.6cos(4x) + 1.3grid = range(-2, 2, length = 100)y_min, y_max = extrema( f(x) for x in grid )plt1 = plot(f, xlim = (-2, 2), label = "f")hline!(plt1, [f(0.5)], linestyle = :dot, linewidth = 2, label = "")vline!(plt1, [-1.07, -0.5, 0.5, 1.07], linestyle = :dot, linewidth = 2,๏ฟฝ

โ†ชlabel = "")plot!(plt1, fill(0, 2), [y_min y_min; y_max y_max], lw = 3, color = :blue,

label = ["range of f" ""])plt2 = plot(f, xlim = (-2, 2), label = "f")hline!(plt2, [2], linestyle = :dot, linewidth = 2, label = "")plot!(plt2, fill(0, 2), [y_min y_min; y_max y_max], lw = 3, color = :blue,

label = ["range of f" ""])plot(plt1, plt2, layout = (2, 1), ylim = (0, 3.5))

Out[21]:

In the first plot there are multiple solutions, as the function is not one-to-one, while in thesecond there are no solutions, since ๐‘ฆ lies outside the range of ๐‘“ .

Can we impose conditions on ๐ด in (3) that rule out these problems?

In this context, the most important thing to recognize about the expression ๐ด๐‘ฅ is that it cor-responds to a linear combination of the columns of ๐ด.

In particular, if ๐‘Ž1, โ€ฆ , ๐‘Ž๐‘˜ are the columns of ๐ด, then

๐ด๐‘ฅ = ๐‘ฅ1๐‘Ž1 + โ‹ฏ + ๐‘ฅ๐‘˜๐‘Ž๐‘˜

Hence the range of ๐‘“(๐‘ฅ) = ๐ด๐‘ฅ is exactly the span of the columns of ๐ด.

We want the range to be large, so that it contains arbitrary ๐‘ฆ.

As you might recall, the condition that we want for the span to be large is linear indepen-dence.

A happy fact is that linear independence of the columns of ๐ด also gives us uniqueness.

Indeed, it follows from our earlier discussion that if {๐‘Ž1, โ€ฆ , ๐‘Ž๐‘˜} are linearly independent and๐‘ฆ = ๐ด๐‘ฅ = ๐‘ฅ1๐‘Ž1 + โ‹ฏ + ๐‘ฅ๐‘˜๐‘Ž๐‘˜, then no ๐‘ง โ‰  ๐‘ฅ satisfies ๐‘ฆ = ๐ด๐‘ง.

10

Page 11: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

5.1 The ๐‘› ร— ๐‘› Case

Letโ€™s discuss some more details, starting with the case where ๐ด is ๐‘› ร— ๐‘›.

This is the familiar case where the number of unknowns equals the number of equations.

For arbitrary ๐‘ฆ โˆˆ โ„๐‘›, we hope to find a unique ๐‘ฅ โˆˆ โ„๐‘› such that ๐‘ฆ = ๐ด๐‘ฅ.

In view of the observations immediately above, if the columns of ๐ด are linearly independent,then their span, and hence the range of ๐‘“(๐‘ฅ) = ๐ด๐‘ฅ, is all of โ„๐‘›.

Hence there always exists an ๐‘ฅ such that ๐‘ฆ = ๐ด๐‘ฅ.

Moreover, the solution is unique.

In particular, the following are equivalent

1. The columns of ๐ด are linearly independent

2. For any ๐‘ฆ โˆˆ โ„๐‘›, the equation ๐‘ฆ = ๐ด๐‘ฅ has a unique solution

The property of having linearly independent columns is sometimes expressed as having fullcolumn rank.

5.1.1 Inverse Matrices

Can we give some sort of expression for the solution?

If ๐‘ฆ and ๐ด are scalar with ๐ด โ‰  0, then the solution is ๐‘ฅ = ๐ดโˆ’1๐‘ฆ.

A similar expression is available in the matrix case.

In particular, if square matrix ๐ด has full column rank, then it possesses a multiplicative in-verse matrix ๐ดโˆ’1, with the property that ๐ด๐ดโˆ’1 = ๐ดโˆ’1๐ด = ๐ผ .

As a consequence, if we pre-multiply both sides of ๐‘ฆ = ๐ด๐‘ฅ by ๐ดโˆ’1, we get ๐‘ฅ = ๐ดโˆ’1๐‘ฆ.

This is the solution that weโ€™re looking for.

5.1.2 Determinants

Another quick comment about square matrices is that to every such matrix we assign aunique number called the determinant of the matrix โ€” you can find the expression for ithere.

If the determinant of ๐ด is not zero, then we say that ๐ด is nonsingular.

Perhaps the most important fact about determinants is that ๐ด is nonsingular if and only if ๐ดis of full column rank.

This gives us a useful one-number summary of whether or not a square matrix can be in-verted.

11

Page 12: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

5.2 More Rows than Columns

This is the ๐‘› ร— ๐‘˜ case with ๐‘› > ๐‘˜.

This case is very important in many settings, not least in the setting of linear regression(where ๐‘› is the number of observations, and ๐‘˜ is the number of explanatory variables).

Given arbitrary ๐‘ฆ โˆˆ โ„๐‘›, we seek an ๐‘ฅ โˆˆ โ„๐‘˜ such that ๐‘ฆ = ๐ด๐‘ฅ.

In this setting, existence of a solution is highly unlikely.

Without much loss of generality, letโ€™s go over the intuition focusing on the case where thecolumns of ๐ด are linearly independent.

It follows that the span of the columns of ๐ด is a ๐‘˜-dimensional subspace of โ„๐‘›.

This span is very โ€œunlikelyโ€ to contain arbitrary ๐‘ฆ โˆˆ โ„๐‘›.

To see why, recall the figure above, where ๐‘˜ = 2 and ๐‘› = 3.

Imagine an arbitrarily chosen ๐‘ฆ โˆˆ โ„3, located somewhere in that three dimensional space.

Whatโ€™s the likelihood that ๐‘ฆ lies in the span of {๐‘Ž1, ๐‘Ž2} (i.e., the two dimensional planethrough these points)?

In a sense it must be very small, since this plane has zero โ€œthicknessโ€.

As a result, in the ๐‘› > ๐‘˜ case we usually give up on existence.

However, we can still seek a best approximation, for example an ๐‘ฅ that makes the distanceโ€–๐‘ฆ โˆ’ ๐ด๐‘ฅโ€– as small as possible.

To solve this problem, one can use either calculus or the theory of orthogonal projections.

The solution is known to be ฬ‚๐‘ฅ = (๐ดโ€ฒ๐ด)โˆ’1๐ดโ€ฒ๐‘ฆ โ€” see for example chapter 3 of these notes

5.3 More Columns than Rows

This is the ๐‘› ร— ๐‘˜ case with ๐‘› < ๐‘˜, so there are fewer equations than unknowns.

In this case there are either no solutions or infinitely many โ€” in other words, uniquenessnever holds.

For example, consider the case where ๐‘˜ = 3 and ๐‘› = 2.

Thus, the columns of ๐ด consists of 3 vectors in โ„2.

This set can never be linearly independent, since it is possible to find two vectors that spanโ„2.

(For example, use the canonical basis vectors)

It follows that one column is a linear combination of the other two.

For example, letโ€™s say that ๐‘Ž1 = ๐›ผ๐‘Ž2 + ๐›ฝ๐‘Ž3.

Then if ๐‘ฆ = ๐ด๐‘ฅ = ๐‘ฅ1๐‘Ž1 + ๐‘ฅ2๐‘Ž2 + ๐‘ฅ3๐‘Ž3, we can also write

๐‘ฆ = ๐‘ฅ1(๐›ผ๐‘Ž2 + ๐›ฝ๐‘Ž3) + ๐‘ฅ2๐‘Ž2 + ๐‘ฅ3๐‘Ž3 = (๐‘ฅ1๐›ผ + ๐‘ฅ2)๐‘Ž2 + (๐‘ฅ1๐›ฝ + ๐‘ฅ3)๐‘Ž3

In other words, uniqueness fails.

12

Page 13: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

5.4 Linear Equations with Julia

Hereโ€™s an illustration of how to solve linear equations with Juliaโ€™s built-in linear algebra facili-ties

In [22]: A = [1.0 2.0; 3.0 4.0];

In [23]: y = ones(2, 1); # A column vector

In [24]: det(A)

Out[24]: -2.0

In [25]: A_inv = inv(A)

Out[25]: 2ร—2 Array{Float64,2}:-2.0 1.01.5 -0.5

In [26]: x = A_inv * y # solution

Out[26]: 2ร—1 Array{Float64,2}:-0.99999999999999980.9999999999999999

In [27]: A * x # should equal y (a vector of ones)

Out[27]: 2ร—1 Array{Float64,2}:1.01.0000000000000004

In [28]: A \ y # produces the same solution

Out[28]: 2ร—1 Array{Float64,2}:-1.01.0

Observe how we can solve for ๐‘ฅ = ๐ดโˆ’1๐‘ฆ by either via inv(A) * y, or using A \ y.

The latter method is preferred because it automatically selects the best algorithm for theproblem based on the types of A and y.

If A is not square then A \ y returns the least squares solution ฬ‚๐‘ฅ = (๐ดโ€ฒ๐ด)โˆ’1๐ดโ€ฒ๐‘ฆ.

13

Page 14: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

6 Eigenvalues and Eigenvectors

Let ๐ด be an ๐‘› ร— ๐‘› square matrix.

If ๐œ† is scalar and ๐‘ฃ is a non-zero vector in โ„๐‘› such that

๐ด๐‘ฃ = ๐œ†๐‘ฃ

then we say that ๐œ† is an eigenvalue of ๐ด, and ๐‘ฃ is an eigenvector.

Thus, an eigenvector of ๐ด is a vector such that when the map ๐‘“(๐‘ฅ) = ๐ด๐‘ฅ is applied, ๐‘ฃ ismerely scaled.

The next figure shows two eigenvectors (blue arrows) and their images under ๐ด (red arrows).

As expected, the image ๐ด๐‘ฃ of each ๐‘ฃ is just a scaled version of the original

In [29]: A = [1 22 1]

evals, evecs = eigen(A)

a1, a2 = evalseig_1 = [0 0; evecs[:,1]']eig_2 = [0 0; evecs[:,2]']x = range(-5, 5, length = 10)y = -x

plot(eig_1[:, 2], a1 * eig_2[:, 2], arrow = true, color = :red,legend = :none, xlims = (-3, 3), ylims = (-3, 3), xticks = -3:3,๏ฟฝ

โ†ชyticks = -3:3,framestyle = :origin)

plot!(a2 * eig_1[:, 2], a2 * eig_2, arrow = true, color = :red)plot!(eig_1, eig_2, arrow = true, color = :blue)plot!(x, y, color = :blue, lw = 0.4, alpha = 0.6)plot!(x, x, color = :blue, lw = 0.4, alpha = 0.6)

Out[29]:

The eigenvalue equation is equivalent to (๐ด โˆ’ ๐œ†๐ผ)๐‘ฃ = 0, and this has a nonzero solution ๐‘ฃ onlywhen the columns of ๐ด โˆ’ ๐œ†๐ผ are linearly dependent.

This in turn is equivalent to stating that the determinant is zero.

Hence to find all eigenvalues, we can look for ๐œ† such that the determinant of ๐ด โˆ’ ๐œ†๐ผ is zero.

This problem can be expressed as one of solving for the roots of a polynomial in ๐œ† of degree๐‘›.

This in turn implies the existence of ๐‘› solutions in the complex plane, although some mightbe repeated.

Some nice facts about the eigenvalues of a square matrix ๐ด are as follows

1. The determinant of ๐ด equals the product of the eigenvalues.

2. The trace of ๐ด (the sum of the elements on the principal diagonal) equals the sum ofthe eigenvalues.

14

Page 15: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

3. If ๐ด is symmetric, then all of its eigenvalues are real.

4. If ๐ด is invertible and ๐œ†1, โ€ฆ , ๐œ†๐‘› are its eigenvalues, then the eigenvalues of ๐ดโˆ’1 are1/๐œ†1, โ€ฆ , 1/๐œ†๐‘›.

A corollary of the first statement is that a matrix is invertible if and only if all its eigenvaluesare nonzero.

Using Julia, we can solve for the eigenvalues and eigenvectors of a matrix as follows

In [30]: A = [1.0 2.0; 2.0 1.0];

In [31]: evals, evecs = eigen(A);

In [32]: evals

Out[32]: 2-element Array{Float64,1}:-1.03.0

In [33]: evecs

Out[33]: 2ร—2 Array{Float64,2}:-0.707107 0.7071070.707107 0.707107

Note that the columns of evecs are the eigenvectors.

Since any scalar multiple of an eigenvector is an eigenvector with the same eigenvalue (checkit), the eig routine normalizes the length of each eigenvector to one.

6.1 Generalized Eigenvalues

It is sometimes useful to consider the generalized eigenvalue problem, which, for given matri-ces ๐ด and ๐ต, seeks generalized eigenvalues ๐œ† and eigenvectors ๐‘ฃ such that

๐ด๐‘ฃ = ๐œ†๐ต๐‘ฃ

This can be solved in Julia via eigen(A, B).

Of course, if ๐ต is square and invertible, then we can treat the generalized eigenvalue problemas an ordinary eigenvalue problem ๐ตโˆ’1๐ด๐‘ฃ = ๐œ†๐‘ฃ, but this is not always the case.

7 Further Topics

We round out our discussion by briefly mentioning several other important topics.

15

Page 16: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

7.1 Series Expansions

Recall the usual summation formula for a geometric progression, which states that if |๐‘Ž| < 1,then โˆ‘โˆž

๐‘˜=0 ๐‘Ž๐‘˜ = (1 โˆ’ ๐‘Ž)โˆ’1.

A generalization of this idea exists in the matrix setting.

7.1.1 Matrix Norms

Let ๐ด be a square matrix, and let

โ€–๐ดโ€– โˆถ= maxโ€–๐‘ฅโ€–=1

โ€–๐ด๐‘ฅโ€–

The norms on the right-hand side are ordinary vector norms, while the norm on the left-handside is a matrix norm โ€” in this case, the so-called spectral norm.

For example, for a square matrix ๐‘†, the condition โ€–๐‘†โ€– < 1 means that ๐‘† is contractive, in thesense that it pulls all vectors towards the origin Section ??.

7.1.2 Neumannโ€™s Theorem

Let ๐ด be a square matrix and let ๐ด๐‘˜ โˆถ= ๐ด๐ด๐‘˜โˆ’1 with ๐ด1 โˆถ= ๐ด.

In other words, ๐ด๐‘˜ is the ๐‘˜-th power of ๐ด.

Neumannโ€™s theorem states the following: If โ€–๐ด๐‘˜โ€– < 1 for some ๐‘˜ โˆˆ โ„•, then ๐ผ โˆ’ ๐ด is invertible,and

(๐ผ โˆ’ ๐ด)โˆ’1 =โˆž

โˆ‘๐‘˜=0

๐ด๐‘˜ (4)

7.1.3 Spectral Radius

A result known as Gelfandโ€™s formula tells us that, for any square matrix ๐ด,

๐œŒ(๐ด) = lim๐‘˜โ†’โˆž

โ€–๐ด๐‘˜โ€–1/๐‘˜

Here ๐œŒ(๐ด) is the spectral radius, defined as max๐‘– |๐œ†๐‘–|, where {๐œ†๐‘–}๐‘– is the set of eigenvalues of๐ด.

As a consequence of Gelfandโ€™s formula, if all eigenvalues are strictly less than one in modulus,there exists a ๐‘˜ with โ€–๐ด๐‘˜โ€– < 1.

In which case (4) is valid.

16

Page 17: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

7.2 Positive Definite Matrices

Let ๐ด be a symmetric ๐‘› ร— ๐‘› matrix.

We say that ๐ด is

1. positive definite if ๐‘ฅโ€ฒ๐ด๐‘ฅ > 0 for every ๐‘ฅ โˆˆ โ„๐‘› {0}

2. positive semi-definite or nonnegative definite if ๐‘ฅโ€ฒ๐ด๐‘ฅ โ‰ฅ 0 for every ๐‘ฅ โˆˆ โ„๐‘›

Analogous definitions exist for negative definite and negative semi-definite matrices.

It is notable that if ๐ด is positive definite, then all of its eigenvalues are strictly positive, andhence ๐ด is invertible (with positive definite inverse).

7.3 Differentiating Linear and Quadratic forms

The following formulas are useful in many economic contexts. Let

โ€ข ๐‘ง, ๐‘ฅ and ๐‘Ž all be ๐‘› ร— 1 vectorsโ€ข ๐ด be an ๐‘› ร— ๐‘› matrixโ€ข ๐ต be an ๐‘š ร— ๐‘› matrix and ๐‘ฆ be an ๐‘š ร— 1 vector

Then

1. ๐œ•๐‘Žโ€ฒ๐‘ฅ๐œ•๐‘ฅ = ๐‘Ž

2. ๐œ•๐ด๐‘ฅ๐œ•๐‘ฅ = ๐ดโ€ฒ

3. ๐œ•๐‘ฅโ€ฒ๐ด๐‘ฅ๐œ•๐‘ฅ = (๐ด + ๐ดโ€ฒ)๐‘ฅ

4. ๐œ•๐‘ฆโ€ฒ๐ต๐‘ง๐œ•๐‘ฆ = ๐ต๐‘ง

5. ๐œ•๐‘ฆโ€ฒ๐ต๐‘ง๐œ•๐ต = ๐‘ฆ๐‘งโ€ฒ

Exercise 1 below asks you to apply these formulas.

7.4 Further Reading

The documentation of the linear algebra features built into Julia can be found here.

Chapters 2 and 3 of the Econometric Theory contains a discussion of linear algebra along thesame lines as above, with solved exercises.

If you donโ€™t mind a slightly abstract approach, a nice intermediate-level text on linear algebrais [1].

17

Page 18: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

8 Exercises

8.1 Exercise 1

Let ๐‘ฅ be a given ๐‘› ร— 1 vector and consider the problem

๐‘ฃ(๐‘ฅ) = max๐‘ฆ,๐‘ข

{โˆ’๐‘ฆโ€ฒ๐‘ƒ๐‘ฆ โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข}

subject to the linear constraint

๐‘ฆ = ๐ด๐‘ฅ + ๐ต๐‘ข

Here

โ€ข ๐‘ƒ is an ๐‘› ร— ๐‘› matrix and ๐‘„ is an ๐‘š ร— ๐‘š matrixโ€ข ๐ด is an ๐‘› ร— ๐‘› matrix and ๐ต is an ๐‘› ร— ๐‘š matrixโ€ข both ๐‘ƒ and ๐‘„ are symmetric and positive semidefinite

(What must the dimensions of ๐‘ฆ and ๐‘ข be to make this a well-posed problem?)

One way to solve the problem is to form the Lagrangian

โ„’ = โˆ’๐‘ฆโ€ฒ๐‘ƒ๐‘ฆ โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข + ๐œ†โ€ฒ [๐ด๐‘ฅ + ๐ต๐‘ข โˆ’ ๐‘ฆ]

where ๐œ† is an ๐‘› ร— 1 vector of Lagrange multipliers.

Try applying the formulas given above for differentiating quadratic and linear forms to ob-tain the first-order conditions for maximizing โ„’ with respect to ๐‘ฆ, ๐‘ข and minimizing it withrespect to ๐œ†.

Show that these conditions imply that

1. ๐œ† = โˆ’2๐‘ƒ ๐‘ฆ

2. The optimizing choice of ๐‘ข satisfies ๐‘ข = โˆ’(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ

3. The function ๐‘ฃ satisfies ๐‘ฃ(๐‘ฅ) = โˆ’๐‘ฅโ€ฒ ฬƒ๐‘ƒ ๐‘ฅ where ฬƒ๐‘ƒ = ๐ดโ€ฒ๐‘ƒ๐ด โˆ’ ๐ดโ€ฒ๐‘ƒ๐ต(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด

As we will see, in economic contexts Lagrange multipliers often are shadow prices

NoteIf we donโ€™t care about the Lagrange multipliers, we can substitute the constraintinto the objective function, and then just maximize โˆ’(๐ด๐‘ฅ+๐ต๐‘ข)โ€ฒ๐‘ƒ(๐ด๐‘ฅ+๐ต๐‘ข)โˆ’๐‘ขโ€ฒ๐‘„๐‘ขwith respect to ๐‘ข. You can verify that this leads to the same maximizer.

9 Solutions

Thanks to Willem Hekman and Guanlong Ren for providing this solution.

18

Page 19: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

9.1 Exercise 1

We have an optimization problem:

๐‘ฃ(๐‘ฅ) = max๐‘ฆ,๐‘ข

{โˆ’๐‘ฆโ€ฒ๐‘ƒ๐‘ฆ โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข}

s.t.

๐‘ฆ = ๐ด๐‘ฅ + ๐ต๐‘ข

with primitives

โ€ข ๐‘ƒ be a symmetric and positive semidefinite ๐‘› ร— ๐‘› matrix.โ€ข ๐‘„ be a symmetric and positive semidefinite ๐‘š ร— ๐‘š matrix.โ€ข ๐ด an ๐‘› ร— ๐‘› matrix.โ€ข ๐ต an ๐‘› ร— ๐‘š matrix.

The associated Lagrangian is :

๐ฟ = โˆ’๐‘ฆโ€ฒ๐‘ƒ๐‘ฆ โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข + ๐œ†โ€ฒ[๐ด๐‘ฅ + ๐ต๐‘ข โˆ’ ๐‘ฆ]

9.1.1 1.

Differentiating Lagrangian equation w.r.t y and setting its derivative equal to zero yields

๐œ•๐ฟ๐œ•๐‘ฆ = โˆ’(๐‘ƒ + ๐‘ƒ โ€ฒ)๐‘ฆ โˆ’ ๐œ† = โˆ’2๐‘ƒ๐‘ฆ โˆ’ ๐œ† = 0 ,

since P is symmetric.

Accordingly, the first-order condition for maximizing L w.r.t. y implies

๐œ† = โˆ’2๐‘ƒ๐‘ฆ .

9.1.2 2.

Differentiating Lagrangian equation w.r.t. u and setting its derivative equal to zero yields

๐œ•๐ฟ๐œ•๐‘ข = โˆ’(๐‘„ + ๐‘„โ€ฒ)๐‘ข โˆ’ ๐ตโ€ฒ๐œ† = โˆ’2๐‘„๐‘ข + ๐ตโ€ฒ๐œ† = 0 .

Substituting ๐œ† = โˆ’2๐‘ƒ๐‘ฆ gives

๐‘„๐‘ข + ๐ตโ€ฒ๐‘ƒ๐‘ฆ = 0 .

Substituting the linear constraint ๐‘ฆ = ๐ด๐‘ฅ + ๐ต๐‘ข into above equation gives

๐‘„๐‘ข + ๐ตโ€ฒ๐‘ƒ(๐ด๐‘ฅ + ๐ต๐‘ข) = 0

19

Page 20: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)๐‘ข + ๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ = 0

which is the first-order condition for maximizing L w.r.t. u.

Thus, the optimal choice of u must satisfy

๐‘ข = โˆ’(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ ,

which follows from the definition of the first-oder conditions for Lagrangian equation.

9.1.3 3.

Rewriting our problem by substituting the constraint into the objective function, we get

๐‘ฃ(๐‘ฅ) = max๐‘ข

{โˆ’(๐ด๐‘ฅ + ๐ต๐‘ข)โ€ฒ๐‘ƒ(๐ด๐‘ฅ + ๐ต๐‘ข) โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข} .

Since we know the optimal choice of u satisfies ๐‘ข = โˆ’(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ, then

๐‘ฃ(๐‘ฅ) = โˆ’(๐ด๐‘ฅ + ๐ต๐‘ข)โ€ฒ๐‘ƒ(๐ด๐‘ฅ + ๐ต๐‘ข) โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข ๐‘ค๐‘–๐‘กโ„Ž ๐‘ข = โˆ’(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ

To evaluate the function

๐‘ฃ(๐‘ฅ) = โˆ’(๐ด๐‘ฅ + ๐ต๐‘ข)โ€ฒ๐‘ƒ(๐ด๐‘ฅ + ๐ต๐‘ข) โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข= โˆ’(๐‘ฅโ€ฒ๐ดโ€ฒ + ๐‘ขโ€ฒ๐ตโ€ฒ)๐‘ƒ (๐ด๐‘ฅ + ๐ต๐‘ข) โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข= โˆ’๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ด๐‘ฅ โˆ’ ๐‘ขโ€ฒ๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ โˆ’ ๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ต๐‘ข โˆ’ ๐‘ขโ€ฒ๐ตโ€ฒ๐‘ƒ๐ต๐‘ข โˆ’ ๐‘ขโ€ฒ๐‘„๐‘ข= โˆ’๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ด๐‘ฅ โˆ’ 2๐‘ขโ€ฒ๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ โˆ’ ๐‘ขโ€ฒ(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)๐‘ข

For simplicity, denote by ๐‘† โˆถ= (๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด, then ๐‘ข = โˆ’๐‘†๐‘ฅ.

Regarding the second term โˆ’2๐‘ขโ€ฒ๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ,

โˆ’2๐‘ขโ€ฒ๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ = โˆ’2๐‘ฅโ€ฒ๐‘†โ€ฒ๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ= 2๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ต(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ

Notice that the term (๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1 is symmetric as both P and Q are symmetric.

Regarding the third term โˆ’๐‘ขโ€ฒ(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)๐‘ข,

โˆ’๐‘ขโ€ฒ(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)๐‘ข = โˆ’๐‘ฅโ€ฒ๐‘†โ€ฒ(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)๐‘†๐‘ฅ= โˆ’๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ต(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ

Hence, the summation of second and third terms is ๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ต(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ.

This implies that

๐‘ฃ(๐‘ฅ) = โˆ’๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ด๐‘ฅ โˆ’ 2๐‘ขโ€ฒ๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ โˆ’ ๐‘ขโ€ฒ(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)๐‘ข= โˆ’๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ด๐‘ฅ + ๐‘ฅโ€ฒ๐ดโ€ฒ๐‘ƒ๐ต(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด๐‘ฅ= โˆ’๐‘ฅโ€ฒ[๐ดโ€ฒ๐‘ƒ๐ด โˆ’ ๐ดโ€ฒ๐‘ƒ๐ต(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด]๐‘ฅ

20

Page 21: Linear Algebra - Quantitative Economics with Juliaย ยท 1. No vector in ๐ดcan be formed as a linear combination of the other elements. 2. If 1 1 + โ‹ฏ = 0 for scalars 1,โ€ฆ, , then

Therefore, the solution to the optimization problem ๐‘ฃ(๐‘ฅ) = โˆ’๐‘ฅโ€ฒ ฬƒ๐‘ƒ ๐‘ฅ follows the above result bydenoting ฬƒ๐‘ƒ โˆถ= ๐ดโ€ฒ๐‘ƒ๐ด โˆ’ ๐ดโ€ฒ๐‘ƒ๐ต(๐‘„ + ๐ตโ€ฒ๐‘ƒ๐ต)โˆ’1๐ตโ€ฒ๐‘ƒ๐ด.

Footnotes

[1] Suppose that โ€–๐‘†โ€– < 1. Take any nonzero vector ๐‘ฅ, and let ๐‘Ÿ โˆถ= โ€–๐‘ฅโ€–. We have โ€–๐‘†๐‘ฅโ€– =๐‘Ÿโ€–๐‘†(๐‘ฅ/๐‘Ÿ)โ€– โ‰ค ๐‘Ÿโ€–๐‘†โ€– < ๐‘Ÿ = โ€–๐‘ฅโ€–. Hence every point is pulled towards the origin.

References[1] K Jรคnich. Linear Algebra. Springer Undergraduate Texts in Mathematics and Technology.

Springer, 1994.

21