

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Assignment; Professor: Provan; Class: FYS UNLOCKING THE CODE; Subject: STATISTICS AND OPERATIONS RESEARCH; University: University of North Carolina - Chapel Hill; Term: Unknown 1989;
Typology: Assignments
1 / 2
This page cannot be seen from the preview
Don't miss anything!


In Michael Crichton's Jurassic Park (p. 103), a putative dinosaur DNA sequence is given. What is the nearest match in the database to this sequence? Is Crichton pulling one over on us? In the output screen, scroll down to the first diagram of the first match (the one with the letter pairings separated by | ). Do you see anything unusual about the pattern of mismatches? Extra credit for the correct interpretation of this odd match. ( Hint : The sequence is formatted exactly the way it appears in the book. Further, the mismatches have nothing to do with biology or BLAST .) GCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGC GGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCG TGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGC TGCTCACGCTGTACCTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTG CCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAA AGTAGGACAGGTGCCGGCAGCGCTCTGGGTCATTTTCGGCGAGGACCGCTTTCGCTGGAG ATCGGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCCCTCGCTCAAGCCTTCGTCACT CCAAACGTTTCGGCGAGAAGCAGGCCATTATCGCCGGCATGGCGGCCGACGCGCTGGGCT GGCGTTCGCGACGCGAGGCTGGATGGCCTTCCCCATTATGATTCTTCTCGCTTCCGGCGG CCCGCGTTGCAGGCCATGCTGTCCAGGCAGGTAGATGACGACCATCAGGGACAGCTTCAA CGGCTCTTACCAGCCTAACTTCGATCACTGGACCGCTGATCGTCACGGCGATTTATGCCG CACATGGACGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAA CAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAA GCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGG CTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTG ACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCA ACACGACTTAACGGGTTGGCATGGATTGTAGGCGCCGCCCTATACCTTGTCTGCCTCCCC GCGGTGCATGGAGCCGGGCCACCTCGACCTGAATGGAAGCCGGCGGCACCTCGCTAACGG CCAAGAATTGGAGCCAATCAATTCTTGCGGAGAACTGTGAATGCGCAAACCAACCCTTGG CCATCGCGTCCGCCATCTCCAGCAGCCGCACGCGGCGCATCTCGGGCAGCGTTGGGTCCT mystery sequence#3: this is a protein sequence: MAHETSFNDA LDYIYIANSM NDRAFLIAEP HPEQPNVDGQ DQDDAELEEL DDMAVTDDGQ LEDTNNNNNS KRYYSSGKRR ADFIGSLALK PPPTDVNTTT TTAGSPLATA ALAAAAASAS VAAAAARITA KAAHRALTTK QDATSSPASS PALQLIDMDN NYTNVAVGLG AMLLNDTLLL EGNDSSLFGE MLANRSGQLD LINGTGGLNV TTSKVAEDDF TQLLRMAVTS VLLGLMILVT IIGNVFVIAA IILERNLQNV ANYLVASLAV ADLFVACLVM PLGAVYEISQ GWILGPELCD IWTSCDVLCC TASILHLVAI AVDRYWAVTN IDYIHSRTSN RVFMMIFCVW TAAVIVSLAP QFGWKDPDYL QRIEQQKCMV SQDVSYQVFA TCCTFYVPLL VILALYWKIY QTARKRIHRR RPRPVDAAVN NNQPDGGAAT DTKLHRLRLR LGRFSTAKSK TGSAVGVSGP ASGGRALGLV DGNSTNTVNT VEDTEFSSSN VDSKSRAGVE APSTSGNQIA TVSHLVALAK QQGKSTAKSS AAVNGMAPSG RQEDDGQRPE HGEQEDREEL EDQDEQVGPQ PTTATSAMTA AGTNESEDQC KANGVEVLED PQLQQQLEQV QQLQKSVKSG GGGGASTSNA TTITSISALS PQTPTSQGVG IAAAAAGPMT AKTSTLTSCN QSHPLCGTAN ESPSTPEPRS RQPTTPQQQP HQQAHQQQQQ QQQLSSIANP MQKVNKRKET LEAKRERKAA KTLAIITGAF VVCWLPFFVM ALTMPLCAAC QISDSVASLF LWLGYFNSTL NPVIYTIFSP EFRQAFKRIL FGGHRPVHYR SGKL Also find the nearest match to sequence#3 among humans and among rats. In each case, give the E value of the match.