




































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The method of solving polynomial equations using circulant matrices. The document focuses on the quadratic equation x2 = x + 1 and explains how to find its roots by applying the theory of circulant matrices. The document also mentions the use of eigenvalues and eigenvectors to find the roots of a polynomial. The document also touches upon the concept of solving polynomial equations of higher degrees using circulant matrices.
Typology: Assignments
1 / 76
This page cannot be seen from the preview
Don't miss anything!





































































5 December 2006
What do we mean by solving an equation?
Example 1. Solve the equation x^2 = 1.
x^2 = 1 x^2 − 1 = 0 (x − 1)(x + 1) = 0 x = 1 or = − 1
Exercise. Solve the equation
√ x +
x − a = 2
where a is a positive real number.
Example 2. Solve the equation x^2 = 5.
x^2 = 5 x^2 − 5 = 0 (x −
5)(x +
x =
5 or −
5? Well,
5 is the positive real number that square to 5.
What are “solved” when we solve these equations?
What do we mean by solving a polynomial
equation?
Suppose we can solve the equation xn^ = c, i.e. taking roots, try to express the the roots of a degree n polynomial using only the usual algebraic operations (addition, subtraction, multiplication, division) and application of taking roots.
The eigenvalues and eigenvectors of circulant matrices are very easy to compute using the nth roots of unity.
1 , ω = (−1 + i
3)/ 2 and ω^2 = ω.
To begin with, we define a distinguished circulant matrix W with first row (0, 1 , 0 ,... , 0). W is just the identity matrix with its top row moved to the bottom, e.g. for n = 4,
Direct checking shows that
i) Note that W T^ = W −^1 (i.e. W is an orthogonal matrix).
ii) The characteristic polynomial for W is p(t) = det(tI − W ) = tn^ − 1 , and hence the eigenvalues of W are the nth roots of unity.
iii) For each nth root of unity λ, vλ = (1, λ, λ^2 ,... , λn−^1 ) is an associated eigenvector.
Therefore, q(W ) = a 0 I + a 1 W + a 2 W 2 + a 3 W 3 is equal to
a 0 a 1 a 2 a 3 a 3 a 0 a 1 a 2 a 2 a 3 a 0 a 1 a 1 a 2 a 3 a 0
ii) For any nth root of unity λ, q(λ) is an eigenvalue of C = q(W ).
[ Indeed, if W v = λv, then W kv = λkv and hence q(W )v = q(λ)v.]
Example 3. Consider the circulant matrix
Read the polynomial q from the first row of C:
q(t) = 1 + 2t + t^2 + 3t^3.
Summary
Start with any circulant matrix C, one can generate both the roots and coefficients of a polynomial p.
Here, the polynomial p is the characteristic polynomial of C; the coefficients can be obtained from the identity p(t) = det (tI − C); the roots, i.e., the eigenvalues of C, can be found by applying q to the nth roots of unity.
This perspective leads to a unified method for solving general quadratic, cubic, and quartic equations.
In fact, given a polynomial p, we try to find a corresponding circulant C having p as its characteristic polynomial. The first row of C then defines a different polynomial q, and the roots of p are obtained by applying q to the nth roots of unity.
Quadratics. Let’s consider a general quadratic polynomial,
p(t) = t^2 + αt + β.
We also consider a general 2 × 2 circulant
a b b a
The characteristic polynomial of C is
det
t − a −b −b t − a
= t^2 − 2 at + a^2 − b^2.
The roots of the original quadratic are now found by applying q to the two square roots of unity:
q(1) =
−α 2
α^2 4 −^ β q(−1) =
−α 2
α^2 4 −^ β.
Cubics. A parallel analysis works for cubic polynomials.
We first notice that by simple algebra, for p(x) = xn^ + αn− 1 xn−^1 + · · · + α 1 x + α 0 , the substitution y = x − αn− 1 /n eliminates the term of degree n − 1.
Therefore, we only need to consider cubic polynomials of the form
p(t) = t^3 + βt + γ.
For a general 3 × 3 circulant matrix
a b c c a b b c a
we want to find a, b, and c so that p is the characteristic polynomial of C.