Introduction to Coding Theory: Problem Set 3 - Spring 2010, Summaries of Number Theory

15-859V: Introduction to coding theory. Spring 2010. Carnegie Mellon University. Venkatesan Guruswami. PROBLEM SET 3. Due by Friday, April 23. INSTRUCTIONS.

Typology: Summaries

2022/2023

Uploaded on 05/11/2023

newfound
newfound 🇨🇦

4.5

(13)

362 documents

1 / 4

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
15-859V: Introduction to coding theory Spring 2010
Carnegie Mellon University Venkatesan Guruswami
PROB LE M SET 3
Due by Friday, April 23
INSTRUCTIONS
This problem sets can be turned in groups of two people; i.e., a single write-up for each two person
team suffices. If you prefer, you can also work alone (see the last bullet item for some “credit” for
doing so). Solutions typeset in L
A
T
E
X are preferred.
You are strongly urged to try and solve the problems without consulting any reference material other
than the course notes and what we cover in class. If for some reason you feel the need to consult
some source (such as a textbook, research paper, or sources on the web), you must acknowledge the
source and also articulate the difficulty you couldn’t overcome before consulting the source and how
it helped you overcome that difficulty. Alternatively, before consulting any such material, I encourage
you to ask me for a hint, preferably by posting a comment on the blog post dedicated to this problem
set, so all students can take advantage of any hints.
Please use the comments section of the blog for any questions or clarifications about the problems.
Please start work on the problem set early. The problem set has seven problems and is worth a total of
100 points. As a rough estimate, a score of 70 (or 60 if you work and turn in solutions solo), suffices
for an A on this problem set.
1. (12 points) In this exercise, you are asked to show that good codes can be used to construct expanders
of logarithmic degree.
Let GFn×k
2be the generator matrix of an [n, k]2binary linear code with the property that every
nonzero codeword of Chas Hamming weight in the range [(1/2)n, (1/2 + )n](such a code is
usually called -biased). Define a graph H= (Fk
2, E)where (x, y )Eif x+yequals one of the
nrows of G(if some row occurs multiple times, place an edge of appropriate multiplicity between
x, y).
Prove that the second largest eigenvalue of the adjacency matrix of Hin absolute value is at most
2. Use this to deduce the existence of spectral expanders (with second largest eigenvalue in absolute
value bounded away from the degree) of logarithmic degree.
2. (10 points) Consider the binary expander code based on an unbalanced bipartite (n, m, D, γ, D (1
))-expander as defined in lecture (i.e., the code whose parity check matrix is the bipartite adjacency
matrix of the expander) for some < 1/20. Recall that in an (n, m, D, γ , D(1 ))-expander, every
subset Sof up to γn nodes on the left has at least D(1 )|S|neighbors on the right. In this exercise
you are asked to analyze the following parallel iterative decoder.
For clog nrounds (for a constant cchosen large enough), do the following in parallel for
each variable node: If the variable is in at least 2D/3unsatisfied checks, flip its value.
Prove that the above algorithm corrects any pattern of γ(1 3)nerrors.
pf3
pf4

Partial preview of the text

Download Introduction to Coding Theory: Problem Set 3 - Spring 2010 and more Summaries Number Theory in PDF only on Docsity!

15-859V: Introduction to coding theory Spring 2010 Carnegie Mellon University Venkatesan Guruswami

PROBLEM SET 3 Due by Friday, April 23

INSTRUCTIONS

  • This problem sets can be turned in groups of two people; i.e., a single write-up for each two person team suffices. If you prefer, you can also work alone (see the last bullet item for some “credit” for doing so). Solutions typeset in LATEX are preferred.
  • You are strongly urged to try and solve the problems without consulting any reference material other than the course notes and what we cover in class. If for some reason you feel the need to consult some source (such as a textbook, research paper, or sources on the web), you must acknowledge the source and also articulate the difficulty you couldn’t overcome before consulting the source and how it helped you overcome that difficulty. Alternatively, before consulting any such material, I encourage you to ask me for a hint, preferably by posting a comment on the blog post dedicated to this problem set, so all students can take advantage of any hints.
  • Please use the comments section of the blog for any questions or clarifications about the problems.
  • Please start work on the problem set early. The problem set has seven problems and is worth a total of 100 points. As a rough estimate, a score of 70 (or 60 if you work and turn in solutions solo), suffices for an A on this problem set.
  1. ( 12 points) In this exercise, you are asked to show that good codes can be used to construct expanders of logarithmic degree. Let G ∈ Fn 2 ×kbe the generator matrix of an [n, k] 2 binary linear code with the property that every nonzero codeword of C has Hamming weight in the range [(1/ 2 − )n, (1/2 + )n] (such a code is usually called -biased). Define a graph H = (Fk 2 , E) where (x, y) ∈ E if x + y equals one of the n rows of G (if some row occurs multiple times, place an edge of appropriate multiplicity between x, y). Prove that the second largest eigenvalue of the adjacency matrix of H in absolute value is at most 2 . Use this to deduce the existence of spectral expanders (with second largest eigenvalue in absolute value bounded away from the degree) of logarithmic degree.
  2. ( 10 points) Consider the binary expander code based on an unbalanced bipartite (n, m, D, γ, D(1 − ))-expander as defined in lecture (i.e., the code whose parity check matrix is the bipartite adjacency matrix of the expander) for some  < 1 / 20. Recall that in an (n, m, D, γ, D(1 − ))-expander, every subset S of up to γn nodes on the left has at least D(1 − )|S| neighbors on the right. In this exercise you are asked to analyze the following parallel iterative decoder.

For c log n rounds (for a constant c chosen large enough), do the following in parallel for each variable node: If the variable is in at least 2 D/ 3 unsatisfied checks, flip its value.

Prove that the above algorithm corrects any pattern of γ(1 − 3 )n errors.

  1. ( 15 points) Consider the Tanner code T (H, C 0 ) considered in lecture, where H = (L, R, E) is a d-regular n × n bipartite expander with second largest eigenvalue λ, and C 0 ∈ Fd 2 is a linear code of distance δ 0 d. We proved that T (H, C 0 ) has relative distance at least δ 0 (δ 0 − λ/d), and gave an iterative algorithm to correct a fraction (1 − ) δ 40 (δ 0 − λ/d) errors. The goal of this exercise is to improve the number of corrected errors to half the (designed) distance. Consider the following modification to the iterative algorithm based on alternate rounds of left and right side decoding discussed in lecture. For each threshold t ∈ { 0 , 1 , 2 ,... , δ 0 d/ 2 , run the following algorithm. In the first round of left side decoding with received word y, for any node u ∈ L such that y|E(u) is not within distance t of some codeword of C 0 , declare erasures on all edges in E(u), and decode the rest to the closest codeword in C 0. This gives a string z ∈ { 0 , 1 , ?}E^. Then on the right side, run an errors-an-erasure decoder at each node v ∈ R replacing z|E(v) with the codeword with smallest Hamming distance on the unerased positions (breaking ties arbitrarily). This gives a string w ∈ { 0 , 1 }E^. Now run the iterative decoding algorithm we discussed in lecture on the string w for c log n rounds for a suitable constant c. Prove that, for any desired  > 0 , the above algorithm, for a large enough choice of the constant c, corrects up to a fraction (1 − ) δ 20

δ 0 − (^2) dλ

of errors.

  1. ( 10 points) In this exercise, the goal is to show that a variant of the expander-based Tanner codes T (H, C 0 ) can be be used to achieve the capacity on the binary symmetric channel. For a n × n d-regular expander H = (L, R, E) as above, and binary linear codes C 0 , C 1 of block length d, define the code T (H, C 0 , C 1 ) to consist of those strings z ∈ { 0 , 1 }E^ such that z|E(u) ∈ C 0 for each u ∈ L and z|E(v) ∈ C 1 for each v ∈ R. Show that for any desired p ∈ (0, 1 /2) and  > 0 , with suitable choice of d and the codes C 0 , C 1 , one can construct a family of codes T (H, C 0 , C 1 ) with rate 1 − h(p) −  together with an O(N log N ) time decoding algorithm for communicating on BSCp with error probability at most 2 −cp,N^ for some constant cp, > 0.
  2. ( 10 points)

(a) (Due to Salil Vadhan) Let  > 0 be a sufficiently small real. Suppose that C ⊆ { 0 , 1 }n^ is a code of relative distance at least 1 / 3 and rate at most a^2 for some a > 0. Suppose a codeword c ∈ C is transmitted on BSCp for p = 1/ 2 − , and we receive r ∈ { 0 , 1 }n. Prove that if a is a small enough constant (independent of n, ), then with all but exponentially small probability over the errors, c will be the unique codeword within Hamming distance (1 − )n/ 2 from r. (b) Extra credit question: (For your fun only; No need to turn anything in, and I think the question might still be open.) Can one deduce the same conclusion without assuming the upper bound on rate, but instead based on the hypothesis C is list-decodable up to a fraction (1/ 2 − /3) of errors (with lists of size poly(1/), say), and has relative distance at least (1/ 2 − /3)?

  1. (4 + 8 + 8 = 20 points) In this problem, we will consider the number-theoretic counterpart of Reed- Solomon codes. Let 1 6 k < n be integers and let p 1 < p 2 < · · · < pn be n distinct primes. Denote K =

∏k i=1 pi^ and^ N^ =^

∏n i=1 pi. The notation^ ZM^ stands for integers modulo^ M^ , i.e., the set { 0 , 1 ,... , M − 1 }. Consider the Chinese Remainder code defined by the encoding map E : ZK → Zp 1 × Zp 2 × · · · × Zpn defined by: E(m) = (m mod p 1 , m mod p 2 , · · · , m mod pn). (Note that this is not a code in the usual sense we have been studying since the symbols at different positions belong to different alphabets. Still notions such as distance of this code make sense and are studied in the questions below.)

(c) Let n = p^3. Consider the evaluation map ev : Fq[X, Y ] → Fnq defined by

ev(f ) = (f (α, β) : (α, β) ∈ S).

Argue that if f 6 = 0 and is not divisible by Y p^ + Y − Xp+1, then ev(f ) has Hamming weight at least n − deg(f )(p + 1), where deg(f ) denotes the total degree of f. (Hint: You are allowed to make use of B´ezout’s theorem, which states that if f, g ∈ Fq[X, Y ] are nonzero polynomials with no common factors, then they have at most deg(f )deg(g) common zeroes.)

(d) For an integer parameter > 1 , consider the set F of bivariate polynomials

F= {f ∈ Fq[X, Y ] | deg(f ) 6, degX (f ) 6 p}

where degX (f ) denotes the degree of f in X. Argue that Fis an Fq-linear space of dimension ( + 1)(p + 1) − p(p 2 +1). (e) Consider the code C ⊆ Fnq for n = p^3 defined by

C = {ev(f ) | f ∈ F`}.

Prove that C is a linear code with minimum distance at least n − `(p + 1). (f) Deduce a construction of an [n, k]q code with distance d > n − k + 1 − p(p − 1)/ 2. (Remark: Reed-Solomon codes have d = n − k + 1, whereas these codes are off by p(p − 1)/ 2 from the Singleton bound. However they are much longer than RS codes, with a block length of n = q^3 /^2 , and the deficiency from the Singleton bound is only o(n).)