Sequence Assembly

Abstract

Any shotgun DNA sequencing effort involves a first phase in which short, contiguous samples of a long DNA molecule are generated. These samples must be assembled afterwards to yield the original sequence in a process called sequence assembly.

Keywords: contig; gap; scaffold; consensus; shotgun

Figure 1.

A semiglobal alignment, which is used to detect overlap between fragments.

close

References

Batzoglou S, Jaffe DB, Stanley K, et al. (2002) ARACHNE: a whole‐genome shotgun assembler. Genome Research 12: 177–189.

Ewing B and Green P (1998) Base‐calling of automated sequencer traces using phred: II. Error probabilities. Genome Research 8: 186–194.

Ewing B, Hillier L, Wendl MC and Green P (1998) Base‐calling of automated sequencer traces using phred: I. Accuracy assessment. Genome Research 8: 175–185.

Gordon D, Abajian C and Green P (1998) Consed: a graphical tool for sequence finishing. Genome Research 8: 195–202.

Huang X and Madan A (1999) CAP3: a DNA sequence assembly program. Genome Research 9: 868–877.

Lander E and Waterman MS (1988) Genomic mapping by fingerprinting random clones: a mathematical analysis. Genomics 2: 231–239.

Maxam AM and Gilbert W (1977) A new method for sequencing DNA. Proceedings of the National Academy of Sciences of the United States of America 74: 560–564.

Myers EW, Sutton GG, Delcher AL, et al. (2000) A whole‐genome assembly of Drosophila. Science 287: 2196–2204.

Pevzner PA, Tang H and Waterman MS (2001) An Eulerian path approach to DNA fragment assembly. Proceedings of the National Academy of Sciences of the United States of America 98: 9748–9753.

Roach JC (1995) Random subcloning. Genome Research 5: 464–473.

Sanger F, Nicklen S and Coulson AR (1977) DNA sequencing with chain terminating inhibitors. Proceedings of the National Academy of Sciences of the United States of America 74: 5463–5467.

Setubal JC and Meidanis J (1997) Introduction to Computational Molecular Biology. Boston, MA: PWS.

Staden R, Judge DP and Bonfield JK (2001) Sequence assembly and finishing methods. In: Baxevanis DA and Ouellette BFF (eds.) Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, 2nd edn. New York, NY: John Wiley & Sons.

Sutton G, White O, Adams M and Kerlavage A (1995) TIGR Assembler: a new tool for assembling large shotgun sequencing projects. Genome Science and Technology 1: 9–19.

Further Reading

Pevzner P (2001) Computational Molecular Biology: An Algorithmic Approach, chaps 4 and 5, pp. 93–132. Cambridge, MA: MIT Press.

Setubal JC and Meidanis J (1997) Introduction to Computational Molecular Biology, chap. 4, pp. 47–104. Boston, MA: PWS.

Waterman MS (1995) Introduction to Computational Biology: Maps, Sequences, and Genomes, chap. 7, pp. 183–252. Cambridge, UK: Chapman & Hall.

Contact Editor close
Submit a note to the editor about this article by filling in the form below.

* Required Field

How to Cite close
Meidanis, Joao(Sep 2005) Sequence Assembly. In: eLS. John Wiley & Sons Ltd, Chichester. http://www.els.net [doi: 10.1038/npg.els.0005261]