MATH 242 Lecture 23: Regression & Constrained Optimization with Lagrange Multipliers | Study notes Mathematics

MATH 242, LECTURE 23

1. Regression analysis

Definition 1. The vertical deviation of a function f(x)from some collection of data points {(xi, yi)}is

the sum

(y1−f(x1))2+ (y2−f(x2))2+· · · + (yi−f(xi))2+· · · .

Example 2. Find the line whose vertical deviation from the points (1,1),(2,3) and (3,4) is minimal.

Some important features of this example:

•Though it looks like xand yshould be our variables, the slope mand y-intercept bof the line are

the “real” variables. Much as the coefficients of a quadratic polynomial are variables when we fit

a parabola to data.

•Ultimately, the critical point is found as a solution of a system of linear equations.

Definition 3. The linear regression line for a collection of data is the linear function whose vertical

deviation from that collection is minimal.

The ability to find such lines is programmed into statistical and data analysis software, as well as your

calculators. The book gives explicit formulae which you may use for the homework, but for the exam

it will be more important that you understand the way in which minimization techniques are employed.

(This is another case where we are learning exactly what our calculators are doing behind the scenes).

Theorem 4. The linear regression line for the collection of data (x1, y1),(x2, y2),...,(xk, yk)is the

function f(x) = mx +bsuch that the sum

(x1m+b−y1)2+· · · + (xkm+b−yk)2

is minimized. Taking the partial derivatives and setting them to zero leads to a system of two linear

equations in the variables mand b.

One of the main applications of linear regression is to fill in/ predict/ extrapolate values for a function

from known values.

Example 5. Because of a computer error, some of the sales figures for a real estate company were

lost. The sales figures (measured in millions of dollars) which are available for Gary Gladhand are:

1998 1999 2001 2003

0.9 1.5 1.9 2.4 Graph these, and then use the regression line to estimate/predict what he

sold in 2000 and what he will sell in 2004.

2. Constrained optimization and Lagrange multipliers

2.1. Motivation: the need for additional tools in constrained optimization. In multivariable

optimization, it is often the case that there is some equation which imposes relations among the variables

under consideration. Such constraint equations arise naturally in at least two distinct ways:

•The equation represents a relation intrinsic to the problem, as for example when the variables

represent money spent and there is one fixed limited source for the funds. We saw such problems

at the end of last term.

MATH 242 Lecture 23: Regression & Constrained Optimization with Lagrange Multipliers, Study notes of Mathematics