

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A project for computer science students where they are required to write software that constructs a suffix tree from a given dna string and identifies the longest substring match between the given dna string and a search pattern. The project includes input and output formats, grading criteria, and additional notes.
Typology: Study Guides, Projects, Research
1 / 3
This page cannot be seen from the preview
Don't miss anything!


Handed out: 2/26/ Due: 3/25/ For this project you will have to write software that builds a suffix tree from a DNA string (T) provided in the input, outputs the suffix tree you constructed, and identifies the longest substring of a second DNA string (P) that matches a substring of T. Deliverables: Source code Requirements:
Input formats The inputs will be provided in FASTA format (see http://en.wikipedia.org/wiki/Fasta_format) with the following assumptions:
Text ACAGGTAGCAGGGAC CATGACCAGGGCTGC GAC
Output format The output of your program should consist of a line like this: