Distance Matrix Methods-Phylogentic Analysis-Lecture Slides, Slides of Phylogenetics

This lecture was delivered by Neela Velhu at Acharya Nagarjuna University for Phylogentic Analysis course. It includes: Distance, Matrix, Methods, UPGMA, Fitch, Neighbor, Margoliash, Iterative, Pair, Alignment

Typology: Slides

2011/2012

Uploaded on 07/11/2012

devaki
devaki 🇮🇳

4.3

(23)

122 documents

1 / 18

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Distance Matrix Methods
1 4
3 2 5
1 4 2 3 5
Docsity.com
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12

Partial preview of the text

Download Distance Matrix Methods-Phylogentic Analysis-Lecture Slides and more Slides Phylogenetics in PDF only on Docsity!

Distance Matrix Methods

(^1 )

(^3 2 )

1 4 2 3 5

Distance matrix

  • A distance matrix is calculated from the sequence dataset
  • Algorithms: Fitch-Margoliash, Neighbor-Joining or UPGMA in tree building
  • Simple, finds only one tree
  • Somewhat old-fashioned (OK if your alignment is good and evolutionary distances are short)

Iterative Pair-Wise Alignment

 Compute score for each pair of alignment

 Merge two string with the minimum distance

between them.

 Successively merge in a string with the smallest

distance from any of the strings already in MSA.

 Use spanning tree algorithm

4

Iterative Pair-Wise Alignment

 Implementation:

 Represent strings as vertices of graph.

 Represent distance between vertices as edges

with weight corresponding to the distance

between them.

 Spanning Tree:

 A tree developed from a graph which connects

all the vertices and has no loop.

5

S 1 = ATGC-T-C S 4 = ATGCATGC D ( S 1 , S 4 ) = 2

S 2 = ATG-A-GC S 4 = ATGCATGC D ( S 2 , S 4 ) = 2

S 2 = ATGAGC S 3 = TTCTG- D ( S 2 , S 3 ) = 4

S 3 = -TTC-TG- S 4 = ATGCATGC D ( S 3 , S 4 ) = 4

 Distance matrix D

4

3

2

1

1 2 3 4

4

4 2

2 3 2

S

S

S

S

S S S S

  • Step 2: Find the minimal spanning tree based on matrix D From the Graph.

4

3

2

1

1 2 3 4

4

4 2

2 3 2

S

S

S

S

S S S S

UPGMA

 Abbreviation of “Unweighted Pair Group Method

with Arithmetic Mean”

 Originally developed for numeric taxonomy in

1958 by Sokal and Michener

 Simplest algorithm for tree construction, so it's fast!

10

How to construct a tree with UPGMA?

 Prepare a distance matrix

 Repeat step 1 and step 2 until there are only two

clusters

 Step 1:

Cluster a pair of leaves (taxa) by shortest distance

 Step 2:

Recalculate a new average distance with the new

cluster and other taxa, and make a new distance

matrix

11

  1. Phylogenetic Tree Construction

example (UPGMA algorithm)

  1. Pick smallest entry Dij
  2. Join the two intersecting species and assign branch lengths Dij/2 to each of the nodes

Dij Bear^ Raccoon^ Weasel^ Seal Bear - 0.26 0.34 0. Raccoon - 0.42 0. Weasel - 0. Seal -

Bear Raccoon 0.13 0.

UPMGA (Michener & Sokal 1957)

  1. Phylogenetic Tree Construction

example (UPGMA algorithm)

Dij Bear^ Raccoon^ Weasel^ Seal Bear - 0.26 0.34 0. Raccoon - 0.42 0. Weasel - 0. Seal -

  1. Compute new distances to the other species using arithmetic means

  2. 365 2

  3. 29 0. 44 2

2 0.^3420.^420.^38 ( )

( )

    

    

S BR SB^ SR

W BR WB WR

D D^ D

D D D

Bear Raccoon 0.13 0.

  1. Phylogenetic Tree Construction

example (UPGMA algorithm)

Dij BR^ Weasel^ Seal

BR - 0.38 0. Weasel - 0. Seal -

  1. Compute new distances to the other species using arithmetic means

  2. 4 3

  3. 34 0. 42 0. 44 ( ) 3 DW BRSDWBDWR ^ DWS    

Bear Raccoon Seal

0.1825 0.

  1. Phylogenetic Tree Construction

example (UPGMA algorithm)

Dij BRS^ Weasel

BRS - 0. Weasel -

  1. Pick smallest entry Dij.
  2. Join the two intersecting species and assign branch lengths Dij/2 to each of the nodes.
  3. Done!

Bear Raccoon Seal Weasel 0.13 0. 0.2 0.