




















Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Bio Informatics NCBI Gene Alignment
Typology: Exams
1 / 28
This page cannot be seen from the preview
Don't miss anything!





















Dot matrix method (Dot plot method)
This is a graphical way of comparing two sequences in a two dimensional matrix.
The two sequences to be compared are written in the horizontal and vertical axes
of the matrix.
Comparison is then done by scanning each residue of one sequence for similarity
with all residues in the other sequence.
If a residue match is found, a dot is placed within the graph.
When the two sequences have substantial regions of similarity, many dots line up
to form contiguous diagonal lines, which reveal the sequence alignment.
Interruptions in the middle of a diagonal line indicate insertions or deletions.
Parallel diagonal lines within the matrix represent repetitive regions of the
sequences
>BAG32276.
MANLGCWMLVLFVATWSDLGLCKKRPKPGGWNTGGSRYPGQGSPGGNRYPPQGGGGWGQPHGGGWGQPHG
GGWGQPHGGGWGQPHGGGWGQGGGTHSQWNKPSKPKTNMKHMAGAAAAGAVVGGLGGYMLGSAMSRPIIH
FGSDYEDRYYRENMHRYPNQVYYRPMDEYSNQNNFVHDCVNITIKQHTVTTTTKGENFTETDVKMMERVV
EQMCITQYERESQAYYQRGSSMVLFSSPPVILLISFLIFLIVG
>ABP65297.
MVKSHIGSWLLVLFVATWSDIGFCKKWPKPGGGWNTGGSRYPGQGSPGGNRYPPQGGGGWGQPHGGGWGQ
PHGGGWGQPHGGGWGQPHGGGGWGQGGGSHGQWGKPSKPKTNMKHVAGAAAAGAVVGGLGGYMLGSAMSR
PLIHFGNDYEDRYYRENMYRYPNQVYYKPVDQYSNQNNFVHDCVNITVKQHTVTTTTKGENFTETDMKIM
ERVVEQMCVTQYQRESEAYYQKGASAILFSPPPVILLISLLILLIVG
5
Smith–Waterman algorithm
This is is used for local alignment and it is considered a database search
algorithm.
This is because, the divergence level between the sequences to be aligned is not
easily known and sequence lengths are usually unequal.
Thus, identification of regional sequence similarity is of greater significance
than finding a match that includes all residues.
Positive scores are assigned for matching residues, zero for mismatches and
negative scores are not used.
Gaps are inserted where necessary.
The local alignment path may begin and end internally and gets the highest
score at the expense of the overall score for a full-length alignment.
Thus approach is used for aligning divergent sequences or sequences with
multiple domains of different origins.
Variants of BLAST
Nucleotide BLAST (BLASTN) queries nucleotide sequences with a nucleotide
sequence database.
Protein BLAST (BLASTP) uses protein sequences as queries to search against
a protein sequence database.
BLASTX uses nucleotide sequences as queries and translates them in all six
reading frames to produce translated protein sequences, which are used to query a
protein sequence database