Next Generation Sequencing Technologies and Their Applications

Abstract

The advances in next generation sequencing (NGS) technologies have tremendous impacts on the studies of structural and functional genomics. Sequencing‐based approaches like ChIP‐Seq and RNA‐Seq have started taking the place of microarray experiments to study protein–DNA (deoxyribonucleic acid) interactions and transcriptomic profiling, respectively. The arrival of NGS technologies has also enabled several whole human genome resequencing studies to be completed efficiently at an affordable price. The major strengths of NGS technologies are their ultra high‐throughput production, characterized by their ability to generate several hundred megabases to tens of gigabases of sequencing data per instrument run, and more importantly, the steep reduction in cost compared to the traditional Sanger sequencing method. Hence, NGS technologies have rapidly become the primary choice for large scale as well as genome‐wide sequencing studies. The new sequencing‐based approaches to explore structural and functional genomics have produced important information and significantly expanded our knowledge in these areas.

Key concepts

  • The rapid developments in sequencing technologies have transformed the approaches in the studies of structural and functional genomics.

  • The arrival of next generation sequencing (NGS) technologies has started substituting traditional Sanger sequencing method in many large‐scale or genome‐wide sequencing studies.

  • Shortly after the first next generation sequencer was introduced by Roche® 454 Life Science, the Genome Sequencer 20 (GS 20) System (it was subsequently replaced by GS FLX System), another two biotechnology companies also marketed their sequencing platforms: Illumina® Genome Analyzer (GA) and Applied Biosystems® (ABI) Supported Oligonucleotide Ligation Detection System (SOLiD).

  • The major attractions of NGS technologies are their ultra high‐throughput production, characterized by their ability to produce gigabases of sequencing data per instrument run, and more importantly, the steep reduction in cost compared to the traditional sequencing method.

  • Previously, the molecular genomics studies mainly relied on microarray technologies such as gene expression microarrays and the ChIP‐chip method (i.e. chromatin immunoprecipitation coupled with microarray) for genome‐wide interrogation.

  • The NGS technologies have been used in various research areas besides the standard sequencing applications such as whole genome sequencing; they have also been increasingly applied in detecting structural variations (paired‐end mapping), studies of protein–DNA interactions and histone modifications (ChIP‐Seq) and transcriptomic profiling of messenger RNAs (mRNAs) and noncoding RNAs (RNA‐Seq).

  • Sequencing‐based approaches have already yielded numerous novel and important findings in research areas like genome‐wide mapping of histone modifications and protein–DNA interactions, discovery of genetic variations and transcriptomics studies even though the approaches are still new and maturing.

  • The NGS technologies have shown their potential of being dominant in future genomics studies. This is evident from several international projects using NGS technologies like the ENCODE Project, 1000 Genomes Project and cancers sequencing project by the International Cancer Genome Consortium.

  • It is only a matter of time before achieving the goal of $1000 per whole genome sequencing. This should not be too far from now given the progresses in the development of third generation sequencing technologies.

  • Although the $1000 genome will technically make sequencing of thousands of human genomes a reality, the substantial cost that will be incurred for data storage, powerful computational packages and analytical softwares has to be borne in mind. However, beyond affordability, what are left behind are the bioinformatics challenges in processing and analysing the huge amount of sequencing data.

Keywords: next generation sequencing technologies; massively parallel sequencing; structural and functional genomics; ChIP‐Seq; RNA‐Seq

Figure 1.

Application of next generation sequencing technologies in structural and functional genomics.

close

References

Ahn SM, Kim TH, Lee S et al. (2009) The first Korean genome sequence and analysis: full genome sequencing for a socio‐ethnic group. Genome Research 19: 1622–1629.

Ansorge WJ (2009) Next‐generation DNA sequencing techniques. New Biotechnology 25: 195–203.

Bar M, Wyman SK, Fritz BR et al. (2008) MicroRNA discovery and profiling in human embryonic stem cells by deep sequencing of small RNA libraries. Stem Cells 26: 2496–2505.

Barski A, Cuddapah S, Cui K et al. (2007) High‐resolution profiling of histone methylations in the human genome. Cell 129: 823–837.

Bentley DR, Balasubramanian S, Swerdlow HP et al. (2008) Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456: 53–59.

Campbell PJ, Stephens PJ, Pleasance ED et al. (2008) Identification of somatically acquired rearrangements in cancer using genome‐wide massively parallel paired‐end sequencing. Nature Genetics 40: 722–729.

Carninci P and Hayashizaki Y (2007) Noncoding RNA transcription beyond annotated genes. Current Opinion in Genetics and Development 17: 139–144.

Cooper GM, Zerr T, Kidd JM et al. (2008) Systematic assessment of copy number variant detection via genome‐wide SNP genotyping. Nature Genetics 40: 1199–1203.

Denoeud F, Aury JM, Da Silva C et al. (2008) Annotating genomes with massive‐scale RNA sequencing. Genome Biology 9: R175.

Drmanac R, Sparks AB, Callow MJ et al. (2009) Human genome sequencing using unchained base reads on self‐assembling DNA nanoarrays. Science 327: 78–81.

ENCODE Project Consortium (2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447: 799–816.

Estivill X and Armengol L (2007) Copy number variants and common disorders: filling the gaps and exploring complexity in genome‐wide association studies. PLoS Genetics 3: 1787–1799.

Farnham PJ (2009) Insights from genomic profiling of transcription factors. Nature Reviews. Genetics 10: 605–616.

Flicek P and Birney E (2009) Sense from sequence reads: methods for alignment and assembly. Nature Methods 6: S6–S12.

Gupta PK (2008) Single‐molecule DNA sequencing technologies for future genomics research. Trends in Biotechnology 26: 602–611.

Heap GA, Yang JH, Downes K et al. (2009) Genome‐wide analysis of allelic expression imbalance in human primary cells by high throughput transcriptome resequencing. Human Molecular Genetics 19: 122–134.

Iafrate AJ, Feuk L, Rivera MN et al. (2004) Detection of large‐scale variation in the human genome. Nature Genetics 36: 949–951.

International Human Genome Sequencing Consortium (2004) Finishing the euchromatic sequence of the human genome. Nature 431: 931–945.

Johnson DS, Mortazavi A, Myers RM and Wold B (2007) Genome‐wide mapping of in vivo protein‐DNA interactions. Science 316: 1497–1502.

Kim JI, Ju YS, Park H et al. (2009) A highly annotated whole‐genome sequence of a Korean individual. Nature 460: 1011–1015.

Korbel JO, Urban AE, Affourtit JP et al. (2007) Paired‐end mapping reveals extensive structural variation in the human genome. Science 318: 420–426.

Levy S, Sutton G, Ng PC et al. (2007) The diploid genome sequence of an individual human. PLoS Biology 5: e254.

Ley TJ, Mardis ER, Ding L et al. (2008) DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature 456: 66–72.

Li Y and Wang J (2009) Faster human genome sequencing. Nature Biotechnology 27: 820–821.

Maher CA, Kumar‐Sinha C, Cao X et al. (2009) Transcriptome sequencing to detect gene fusions in cancer. Nature 458: 97–101.

Mardis ER (2008) Next‐generation DNA sequencing methods. Annual Review of Genomics and Human Genetics 9: 387–402.

McCarroll SA, Kuruvilla FG, Korn JM et al. (2008) Integrated detection and population‐genetic analysis of SNPs and copy number variation. Nature Genetics 40: 1166–1174.

McKernan KJ, Peckham HE, Costa GL et al. (2009) Sequence and structural variation in a human genome uncovered by short‐read, massively parallel ligation sequencing using two‐base encoding. Genome Research 19: 1527–1241.

Medvedev P, Stanciu M and Brudno M (2009) Computational methods for discovering structural variation with next‐generation sequencing. Nature Methods 6: S13–S20.

Morin RD, O'Connor MD, Griffith M et al. (2008) Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Research 18: 610–621.

Morozova O, Hirst M and Marra MA (2009) Applications of new sequencing technologies for transcriptome analysis. Annual Review of Genomics and Human Genetics 10: 135–151.

Pan Q, Shai O, Lee LJ et al. (2008) Deep surveying of alternative splicing complexity in the human transcriptome by high‐throughput sequencing. Nature Genetics 40: 1413–1415.

Park PJ (2009) ChIP‐seq: advantages and challenges of a maturing technology. Nature Reviews. Genetics 10: 669–680.

Pepke S, Wold B and Mortazavi A (2009) Computation for ChIP‐seq and RNA‐seq studies. Nature Methods 6: S22–S32.

Pop M and Salzberg SL (2008) Bioinformatics challenges of new sequencing technology. Trends in Genetics 24: 142–149.

Pushkarev D, Neff NF and Quake SR (2009) Single‐molecule sequencing of an individual human genome. Nature Biotechnology 27: 847–852.

Robertson G, Hirst M, Bainbridge M et al. (2007) Genome‐wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nature Methods 4: 651–657.

Scherer SW, Lee C, Birney E et al. (2007) Challenges and standards in integrating surveys of structural variation. Nature Genetics 39: S7–S15.

Sebat J, Lakshmi B, Troge J et al. (2004) Large‐scale copy number polymorphism in the human genome. Science 305: 525–528.

Shendure J and Ji H (2008) Next‐generation DNA sequencing. Nature Biotechnology 26: 1135–1145.

Strausberg RL, Levy S and Rogers YH (2008) Emerging DNA sequencing technologies for human genomic medicine. Drug Discovery Today 13: 569–577.

Tucker T, Marra M and Friedman JM (2009) Massively parallel sequencing: the next big thing in genetic medicine. American Journal of Human Genetics 85: 142–154.

Von Bubnoff A (2008) Next‐generation sequencing: the race is on. Cell 132: 721–723.

Wang J, Wang W, Li R et al. (2008) The diploid genome sequence of an Asian individual. Nature 456: 60–65.

Wang Z, Gerstein M and Snyder M (2009) RNA‐Seq: a revolutionary tool for transcriptomics. Nature Reviews. Genetics 10: 57–63.

Wheeler DA, Srinivasan M, Egholm M et al. (2008) The complete genome of an individual by massively parallel DNA sequencing. Nature 452: 872–876.

Wold B and Myers RM (2008) Sequence census methods for functional genomics. Nature Methods 5: 19–21.

Yoon S, Xuan Z, Makarov V et al. (2009) Sensitive and accurate detection of copy number variants using read depth of coverage. Genome Research 19: 1586–1592.

Further Reading

ABI® The SOLiD System: http://www3.appliedbiosystems.com/AB_Home/applicationstechnologies/SOLiDSystemSequencing/index.htm

Illumina® Sequencing Technology: http://www.illumina.com/technology/sequencing_technology.ilmn

Kahvejian A, Quackenbush J and Thompson JF (2008) What would you do if you could sequence everything? Nature Biotechnology 26: 1125–1133.

MacLean D, Jones JD and Studholme DJ (2009) Application of ‘next‐generation’ sequencing technologies to microbial genetics. Nature Reviews. Microbiology 7: 287–296.

Morozova O and Marra MA (2008) Applications of next‐generation sequencing technologies in functional genomics. Genomics 92: 255–264.

Roche® 454 Sequencing: http://www.454.com/

Contact Editor close
Submit a note to the editor about this article by filling in the form below.

* Required Field

How to Cite close
Chee‐Seng, Ku, En Yun, Loy, Yudi, Pawitan, and Kee‐Seng, Chia(Apr 2010) Next Generation Sequencing Technologies and Their Applications. In: eLS. John Wiley & Sons Ltd, Chichester. http://www.els.net [doi: 10.1002/9780470015902.a0022508]