Molecular Phylogeny Reconstruction


Molecular phylogenetics deals with the inference of evolutionary relationships among individuals, populations, species and higher taxonomic entities using molecular data. By modelling patterns of molecular change in protein and deoxyribonucleic acid (DNA) sequences over time, scientists now routinely reconstruct evolutionary histories of species and evaluate confidence levels of the inferences. Molecular phylogenetic inferences have been not only supportive of traditional phylogenies, but also instrumental in resolving some difficult questions regarding branching orders within many evolutionary lineages. Because of the vast and growing databases of molecular sequence information, this area promises to be an important key to understanding the history and relationships of all life forms on this planet.

Keywords: phylogeny; molecular evolution; sequence analysis; bioinformatics

Figure 1.

An alignment of a portion of the γ‐fibrinogen gene sequence from five mammals. Insertion–deletion mutations predicted by sequence alignment are shown with hyphens (‐) and the missing data is shown with question marks (?).

Figure 2.

Rooted (a) and unrooted (b) tree of five sequences. Branch lengths are drawn proportional to evolutionary distance, which can be expressed in the units of time or the number of substitutions.



Alfaro ME and Holder MT (2006) The posterior and the prior in Bayesian phylogenetics. Annual Review of Ecology, Evolution, and Systematics 37: 19–42.

Baldauf SL and Palmer JD (1993) Animals and fungi are each other's closest relatives: congruent evidence from multiple proteins. Proceedings of the National Academy of Sciences of the USA 90: 11558–11562.

Bergsten J (2005) A review of long‐branch attraction. Cladistics 21: 163–193.

Dopazo H and Dopazo J (2005) Genome‐scale evidence of the nematode‐arthropod clade. Genome Biology 6: R41.

Douady CJ, Delsuc F, Boucher Y, Doolittle WF and Douzery EJP (2003) Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability. Molecular Biology and Evolution 20: 248–254.

Drummond AJ and Rambaut A (2006) BEAST v1.4, available from

Felsenstein J (1981) Evolutionary trees from DNA sequences: a maximum likelihood approach. Journal of Molecular Evolution 17: 368–376.

Felsenstein J (1985) Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39: 783–791.

Felsenstein J (1988) Phylogenies from molecular sequences: inference and reliability. Annual Reviews in Genetics 22: 521–565.

Felsenstein J (1993) PHYLIP (phylogeny inference package). Version 3.6a. Distributed by the author, Department of Genetics, University of Washington, Seattle.

Felsenstein J (2004) Inferring Phylogenies. Sunderland, MA: Sinauer Associates.

Fitch W (1971) Toward defining the course of evolution: minimum change for a specific tree topology. Systematic Zoology 20: 406–416.

Gatesy JC (1997) More DNA support for the Cetacea/Hippopotamidae clade: the blood‐clotting protein gene gamma‐fibrinogen. Molecular Biology and Evolution 14: 537–543.

Guindon S and Gascuel O (2003) A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Systematic Biology 52: 696–704.

Hall BG (2005) Comparison of the accuracies of several phylogenetic methods using protein and DNA sequences. Molecular Biology and Evolution 22: 792–802.

Hall B (2007) Phylogenetic Trees Made Easy, 3rd edn. Sunderland, MA: Sinauer Associates.

Higgins DG, Thompson JD and Gibson TJ (1996) Using CLUSTAL for multiple sequence alignments. Methods in Enzymology 266: 383–402.

Hordijk W and Gascuel O (2005) Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood. Bioinformatics 21: 4338–4347.

Huelsenbeck JP, Ronquist F, Nielsen R and Bollback J (2001) Bayesian inference of phylogeny and its impact on evolutionary biology. Science 294: 2310–2314.

Kosakovsky Pond SL, Frost SD and Muse SV (2005) HyPhy: hypothesis testing using phylogenies. Bioinformatics 21: 676–679.

Kumar S and Dudley J (2007) Bioinformatics software for biologists in the genomics era. Bioinformatics 23: 1713–1717.

Kumar S and Filipski A (2007) Multiple sequence alignment: in pursuit of homologous DNA positions. Genome Research 17: 127–135.

Kumar S, Tamura K and Nei M (2004) MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Briefings in Bioinformatics 5: 150–163. [].

Liolios K, Tavernarakis N, Hugenholtz P and Kyrpides NC (2006) The genomes on line database (GOLD) v.2: a monitor of genome projects worldwide. Nucleic Acids Research 34: D332–D334.

Miyamoto MM and Cracraft J (1991) Phylogenetic analysis of DNA sequences. New York: Oxford University Press.

Murphy WJ, Pringle TH, Crider TA, Springer MS and Miller W (2007) Using genomic data to unravel the root of the placental mammal phylogeny. Genome Research 17: 413–421.

Nei M (1987) Molecular Evolutionary Genetics. New York: Columbia University Press.

Nei M and Kumar S (2000) Molecular Evolution and Phylogenetics. New York: Oxford University Press.

Paraskevis DP, Lemey P, Salemi M et al. (2003) Analysis of the evolutionary relationships of HIV‐1 and SIVcpz sequences using Bayesian inference: implications for the origin of HIV‐1. Molecular Biology and Evolution 20: 1986–1996.

Putnam NH, Srivastava M, Hellsten U et al. (2007) Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization. Science 317: 86–94.

Ronquist F and Huelsenbeck JP (2003) MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19: 1572–1574.

Saitou N and Nei M (1987) The neighbour‐joining method: a new method for reconstructing phylogenetic trees. Molecular Biology and Evolution 6: 514–525.

Satta Y, Klein J and Takahata N (2000) DNA archives and our nearest relative: the trichotomy problem revisited. Molecular Phylogenetics and Evolution 14: 259–275.

Springer MS, Murphy WJ, Eizirik E and O'Brian SJ (2003) Placental mammal diversification and the cretaceous‐tertiary boundary. Proceedings of the National Academy of Sciences of the USA 100: 1056–1061.

Stamatakis A, Ludwig T and Meier H (2005) RAxML‐III: a fast program for maximum likelihood‐based inference of large phylogenetic trees. Bioinformatics 21: 456–463.

Suzuki Y, Glazko GV and Nei M (2002) Overcredibility of molecular phylogenies obtained by Bayesian phylogenetics. Proceedings of the National Academy of Sciences of the USA 99: 16138–16143.

Swofford DL (2001) PAUP*: Phylogenetic Analysis Using Parsimony (and Other Methods) 4.0 Beta. Sunderland, MA: Sinauer Associates.

Takahashi K and Nei M (2000) Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used. Molecular Biology and Evolution 17: 1251–1258.

Tamura K, Dudley J, Nei M and Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) Software Version 4.0. Molecular Biology and Evolution 24: 1596–1599.

Wolf YI, Rogozin IB and Koonin EV (2004) Coelomata and not Ecdysozoa: evidence from genome‐wide phylogenetic analysis. Genome Research 14: 29–36.

Yang Z (1996) Among‐site rate variation and its impact on phylogenetic analyses. Trends in Ecology and Evolution 11: 367–371.

Yang Z (1997) PAML: a program package for phylogenetic analysis by maximum likelihood. Computer Applications in the Biosciences 13: 555–556.

Yang Z (2006) Computational Molecular Evolution. Oxford: Oxford University Press.

Yang Z and Rannala B (2005) Branch‐length prior influences Bayesian posterior probability of phylogeny. Systematic Biology 54: 455–470.

Further Reading

Durbin R, Eddy S, Krogh A and Mitchison G (1998) Biological Sequence Analysis. Cambridge: Cambridge University Press.

Graur D and Li W‐H (1999) Fundamentals of Molecular Evolution, 2nd edn. Sunderland, MA: Sinauer Associates.

Li W‐H (1997) Molecular Evolution. Sunderland, MA: Sinauer Associates.

Page RDM and Holmes EC (1998) Molecular Evolution: A Phylogenetic Approach. Oxford: Blackwell Science.

Patthy L (1999) Protein Evolution. Oxford: Blackwell Science.

Contact Editor close
Submit a note to the editor about this article by filling in the form below.

* Required Field

How to Cite close
Kumar, Sudhir, and Filipski, Alan(Mar 2008) Molecular Phylogeny Reconstruction. In: eLS. John Wiley & Sons Ltd, Chichester. [doi: 10.1002/9780470015902.a0001523.pub2]