

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Material Type: Quiz; Professor: Olsen; Class: Computing in Molecular Biology; Subject: Molecular and Cell Biology; University: University of Illinois - Urbana-Champaign; Term: Spring 2010;
Typology: Quizzes
1 / 2
This page cannot be seen from the preview
Don't miss anything!


Data for the Assignment This assignment explores some basic BLAST analyses using the NCBI WWW Server. A summary of some elements can be found at: http://www.ncbi.nlm.nih.gov/blast/about/. Pay attention to the details. In particular, read the entire BLAST Forms section of the page. How you use BLAST (or any other tool) matters. Data for the Assignment The data for this assignment are available in electronic form from the class WWW server. The data are in the Assignment 04 directory:http://www.life.uiuc.edu/mcb/432/Assignments/04/, or you can find links to the file on the Schedule page: http://www.life.uiuc.edu/mcb/432/Schedule.html The file pep1.txt is a plain text files with a peptide sequence in FASTA format.^1 Your goal is to identify the function of the protein, and what organism it is from. Using BLAST to Find Sequences Similar to a Protein Go to the NCBI BLAST Home page: http://blast.ncbi.nlm.nih.gov/Blast.cgi. Our first step in our quest to identify pep1 and its source organism is to use the blastp program to search for similar sequences in the nr (non-redundant) protein database. Follow the link from the BLAST Home page to "protein blast".
Page 2 Assignment 4 Name __key________________ When this is all verified, click any of the BLAST buttons. Eventually you should get the "Formatting Results" page with the BLAST results. It is a good idea to take a look at the page to see that it makes sense for what you thought you were trying to do. 2a. What is the RID (Request ID) for your search?^3 The RID allows you to access old search results. 1 pt S4_________ – SX_________ 2b. What was the length of the query sequence? 1 pt 132 2c. What is the number of sequences in the database searched?^4 1 pt 10,464,191 – 10,530, 2d. What is the number of amino acids (letters) in the database searched? 1 pt 3,569,800,998 – 3,591,264, 2e. What was the average length of the sequences in the database searched? (Calculate from the above values.) 1 pt 341 2f. For the highest-scoring (best) database sequence match, what is the score (in bits)? 1 pt 262 2g. The significance of a BLAST match to a database entry is summarized by the E -value (Expect). In words, the E -value is the expected number of times that a random query sequence would align with database sequences with a score greater than for the given match. For the highest-scoring database sequence match, what is the E -value? 1 pt 8e-69 or 8 x 10–69^ [but not 8e–69^ or 8 x e–69] 2h. How many database sequences have similarity to the query sequence with an E -value less than or equal to 10–5? 1 pt 500 or >500 or 1440– 2i. Based on the descriptions of the 10–20 highest-scoring sequences, how would you describe the function of the query protein? (That is, what would you call it if you wanted someone to understand the cellular role of the protein?) 3 pt 1 pt for [optional mitochondrial] 50S or 54S or mitochondrial 60S or mitochondrial large subunit 1 pt for ribosomal protein 1 pt for L14 or L 2j. What is the percent sequence identity of the query sequence to the highest scoring sequence that has the description you used to answer 2i? 1 pt varies over 50–60% 2k. From what organism does the genome with the highest-scoring sequence come? 1 pt Ustilago maydis 521 2l. To which domain of life (NCBI calls it a superkingdom) does this organism belong? 1 pt Eukaryota or Eucarya [not with Fungi included, as in Eukaryota; Fungi] (^3) Near the top for the results page, to the right of "NCBI/ BLAST/ blastp suite/ Formatting Results -". (^4) Look at "Other reports: Search Summary", just above "Graphic Summary".