












Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An introduction to the least squares method, specifically linear regression, which is used to fit data to models. The difference between fitting and interpolation, tasks that arise in science, and the concept of model parameters. It also discusses the relationships between variables and the use of scatter plots. The document concludes with the computational formula for least-squares regression.
Typology: Study notes
1 / 20
This page cannot be seen from the preview
Don't miss anything!













Ramani Duraiswami, Dept. of Computer Science
Fitting data to a model
Practical science involves lots of fitting of data to models
-^
Difference between fitting and interpolation?ā Interpolation, the fit function passes through the pointā Fitting, the fit function satisfies some norm based criterion
-^
Tasks arise commonly in scienceā Fit straight lines and curves to dataā More generally fit data to a model
-^
Model contains parametersā Job of fitting is to estimate the parameters that ābestā make the
model fit the data
Ć
define best
Simplest example of model fitting problemā Linear regression
In much science we seek relations between variables
One variable is used to āexplainā another variable
X
Variable
Independent VariableExplaining VariableExogenous Variable
Predictor Variable
Y
Variable
Dependent VariableResponse VariableEndogenous VariableCriterion Variable
Scatter Plots
X
Y
X
Y
We will end up being reasonably confidentthat the true regression line is somewherein the indicated region.
X
Y
Estimated Regression Line
errors/residuals
X
Y
Estimated Regression Line
i
i
i^
y
y
e^
Ė ā
=
x^ i
bX a Y^
= Ė
:
Line
Regression the of
Equation
X
Y
i
i
i^
y
y
e^
Ė ā
=
Ė y
x^ i
(^
)^
1
1
ā
ā
=
=
N i
i
i
N i
i^
(^
)^
0
:
Remember
1
=
ā
N ā= i
i^
y
y
X b Y a X X
b
N i
i
N i
i
i
= =^
1
2
1
=^
=
=^
=
=
ā ā ā
ā ā ā ā
ā ā ā
āā āā āā
ā ā ā ā
=
N i
N i
i
i
N i
N i
i
N i
i
i i
X
X
N
Y X Y X N b
1
2
1
2
1
1
1
ComputationalFormula
Models
Have a certain model structureā E.g., ā straight lineā āparabolicā ātrigonometricā āGaussianā
-^
Models have specifiable parameters
-^
e.g.
-^
Model
Structure
Data
Parameters
Straight line:
a x + by +c =
(x
,yi
) i
(a,b,c)
Polynomial:
y=c
+c 0
1
x+ ā¦+c
n^
n x
(x
,yi
) i
(^ c
,c 0
1
, ā¦,c
n^
)
Trig.:
y=c
+c 0
1
sin x+ ā¦+c
n^
sin nx
(x
,yi
) i
(^ c
,c 0
1
, ā¦,c
n^
)
Gaussian
y=c
0
exp(-(x-
(x
,yi
) i
(c
āKernel/RBFā
y=
ai
k(x,xi^
)i
(x
,yi
) i
(a
)i
and
(x
)i
These models are all separable
Linear Systems
A
x
b
= =
Square system:
Rectangular system ??
infinity of solutions Minimize |Ax-b|
2
no solution
A
x
b
Least Squares for more complex models
-^
Number of equations and unknowns may not match
-^
Data may have noise
-^
Look for solution by minimizing some cost function
-^
Simplest and most intuitive cost function: ||
Ax - b
Define for each data point
x
a residual i
r
i
Minimize
ri
ri with respect to i^
x
l
r^ i
r
= i
(Aj
xij
-bj
).i
k^
ik
xk
-b
)i
The system A
t^ A x = A
tb
is called the Normal equations
-^
Can solve least squares problems using these
-^
For A size m
n and x of size n and b of size m what are
the dimensions of the normal equations?ā n
Ć
n
Solve via LU decomposition
-^
Is this a good idea?ā Somewhat expensive as we have to form A
t^ A which involves
matrix multiplication and then solution
t A) = (cond(A))
2
Example