

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An assignment for a computer science course focusing on dna sequence alignment using dynamic programming. The assignment includes implementing a local alignment algorithm with affine gap penalty for two input dna sequences, experimenting with the program, and performing word-based alignment. Students are required to write a program that takes input fasta files and a substitution matrix, and outputs the optimal local alignment with a statement of the score and alignment positions. For word-based alignment, students must write down the recurrences and initialization for a dynamic programming algorithm, and implement the algorithm to find the optimal alignment of two given sequences.
Typology: Assignments
1 / 3
This page cannot be seen from the preview
Don't miss anything!


Assignment 2 : Due on Tuesday, October 7, by 11:59 PM Note : Solution to the programming questions should be emailed to the instructor. See below for how to create a subdirectory for each problem in the assignment. Your email should contain the location of these sub-directories on the csil systems. Note : See the course web page for the late turn-in policy. Also note that you may request a 3-day extension at most once in the semester, by informing the instructor by e-mail. Note : Collaboration policy for this assignment – you may discuss the assignment with at most one other student. See “Basic Information” link on the course web page for more details. (In short, you may discuss, but must do the assignment on your own.)
In other words, if the alignment length is greater than the width of your output (say 60 characters, for easy readability), the alignment should be split over multiple lines with a clear spacing between consecutive fragments of the alignment. The program should have a reasonable level of commenting, for the instructor to understand the flow at a high level. In the directory that you submit for this problem (call it “problem1”), create a text file named “README” that has clear instructions on how to run (and if needed compile) the code. Example of a nucleotide substitution matrix: A C G T A 91 - 114 - 31 - 123 C - 114 100 - 125 - 31 G - 31 - 125 100 - 114 T - 123 - 31 - 114 91