Identifying Genes Underlying Human Inherited Disease

The field of genetic epidemiology seeks to identify the genetic risk factors underlying human disease. First, one must assess whether the trait has an underlying genetic influence; this is done by estimating familial relative risk and conducting segregation analysis. In order to discover genes influencing human disease, both family- and population-based studies are designed, and linkage and association methods are used to analyse the relationship between the trait(s) of interest and genetic markers. A variety of approaches to both linkage and association analysis exist, each having strengths and weaknesses for differing situations. In general, linkage analysis is more powerful for detecting rare genes, whereas association analysis is more powerful for detecting common allelic effects that might be weaker. In addition, when studying complex traits, other issues must be considered, including interactions with other genes and the environment, parent-of-origin effects, as well as the effect of structural variants.

Key Concepts:

  • Genetic epidemiologists design both family- and population-based studies to find genes influencing disease risk.
  • Before conducting a gene mapping study, one must first ascertain whether the trait is influenced by genetic effects.
  • Linkage and association analysis are two complementary methods used to analyse the relationship between trait(s) and genetic markers.
  • The approach used (linkage versus association) depends on the study design, and these approaches depend on the genetic parameters (rare versus common, large versus small effects).
  • Other factors – gene–gene interaction, gene–environment interaction, existence of structural variants and parent-of-origin effects – further complicate the search for disease genes.

Keywords: gene mapping; linkage analysis; genetic association analysis; complex traits; genetic epidemiology; multifactorial diseases

Figure 1. The most appropriate analysis strategy depends in part on the disease allele frequency (common versus rare) and strength of genetic effect due to that locus. Note that neither linkage nor association analysis has good power to detect rare loci with weak effects. Adapted by permission from Macmillan Publishers Ltd: Ardlie et al. ((2002) Figure 2). Copyright © 2002.
close
 References
    Abecasis GR, Cookson WOC and Cardon LR (2000) Pedigree tests of transmission disequilibrium. European Journal of Human Genetics 8: 545–551.
    Amos C, Elston R, Bonney G, Keats B and Berenson G (1990) A multivariate method for detecting genetic linkage, with application to a pedigree with an adverse lipoprotein phenotype. American Journal of Human Genetics 47: 247–254.
    Ardlie KG, Kruglyak L and Seielstad M (2002) Patterns of linkage disequilibrium in the human genome. Nature Reviews. Genetics 3: 299–309.
    Bartlett CW, Vieland VJ, Bartlett J et al. (2007) Discussing gene-gene interaction: warning – translating equations to English may result in jabberwocky. Genetic Epidemiology 31(suppl. 7): S61–S67.
    Bodmer W and Bonilla C (2008) Common and rare variants in multifactorial susceptibility to common diseases. Nature Genetics 40: 695–701.
    Burdick JT, Chen WM, Abecasis GR and Cheung VG (2006) In silico method for inferring genotypes in pedigrees. Nature Genetics 38: 1002–1004.
    Chanock SJ, Manolio T, Boehnke M et al. (2007) Replicating genotype-phenotype associations. Nature 447: 655–660.
    Clark A (2004) The role of haplotypes in candidate gene studies. Genetic Epidemiology 27: 321–333.
    Clerget-Darpoux F and Elston RC (2007) Are linkage analysis and the collection of family data dead? Prospects for family studies in the age of genome-wide association. Human Heredity 64: 91–96.
    Cordell H (2001) Sample size requirements to control for stochastic variation in magnitude and location of allele-sharing linkage statistics in affected sibling pairs. Annals of Human Genetics 65: 491–502.
    Davey Smith G and Ebrahim S (2003) ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? International Journal of Epidemiology 32: 1–22.
    Demenais F and Bonney G (1989) Equivalence of the mixed and regressive models for genetic analysis. I. Continuous traits. Genetic Epidemiology 6: 597–617.
    Devlin B and Roeder K (1999) Genomic control for association studies. Biometrics 55: 997–1004.
    Didelez V and Sheehan N (2007) Mendelian randomization as an instrumental variable approach to causal inference. Statistical Methods in Medical Research 16: 309–330.
    Eaves L, Neale M and Maes H (1996) Multivariate multipoint linkage analysis of quantitative trait loci. Behavioral Genetics 26: 519–525.
    Elston RC, Song D and Iyengar SK (2005) Mathematical assumptions versus biological reality: myths in affected sib pair linkage analysis. American Journal of Human Genetics 76: 152–156.
    book Falconer D and Mackay T (1996) Introduction to Quantitative Genetics, 4th edn. Harlow, England: Prentice Hall.
    Frazer KA, Murray SS, Schork NJ and Topol EJ (2009) Human genetic variation and its contribution to complex traits. Nature Reviews. Genetics 10: 241–251.
    Gabriel SB, Schaffner SF, Nguyen H et al. (2002) The structure of haplotype blocks in the human genome. Science 296: 2225–2229.
    George V and Elston RC (1987) Testing the association between polymorphic markers and quantitative traits in pedigrees. Genetic Epidemiology 4: 193–202.
    George V, Tiwari HK, Zhu X and Elston RC (1999) A test of transmission/disequilibrium for quantitative traits in pedigree data, by multiple regression. American Journal of Human Genetics 65: 236–245.
    Goddard K, Witte J, Suarez B, Catalona W and Olson J (2001) Model-free linkage analysis with covariates confirms linkage of prostate cancer to chromosomes 1 and 4. American Journal of Human Genetics 68: 1197–1206.
    Haseman J and Elston R (1972) The investigation of linkage between a quantitative trait and a marker locus. Behavioral Genetics 2: 3–19.
    Hodge S and Elston R (1994) Lods, wrods, and mods: the interpretation of lod scores calculated under different models. Genetic Epidemiology 11: 329–342.
    International HapMap Consortium (2005) A haplotype map of the human genome. Nature 437: 1299–1320.
    Jakobsson M, Scholz SW, Scheet P et al. (2008) Genotype, haplotype and copy-number variation in worldwide human populations. Nature 451: 998–1003.
    Janssens AC and van Duijn CM (2008) Genome-based prediction of common diseases: advances and prospects. Human Molecular Genetics 17: R166–R173.
    Jorde LB and Wooding SP (2004) Genetic variation, classification and ‘race’. Nature Genetics 36: S28–S33.
    Khoury MJ and Flanders WD (1996) Nontraditional epidemiologic approaches in the analysis of gene-environment interaction: case-control studies with no controls! American Journal of Epidemiology 144: 207–213.
    Lange C, Demeo DL and Laird NM (2002) Power and design considerations for a general class of family based association tests: quantitative traits. American Journal of Human Genetics 71: 1330–1341.
    Manolio TA, Collins FS, Cox NJ et al. (2009) Finding the missing heritability of complex diseases. Nature 461: 747–753.
    McCarroll SA and Altshuler DM (2007) Copy-number variation and association studies of human disease. Nature Genetics 39: S37–S42.
    McCarroll SA, Kuruvilla FG, Korn JM et al. (2008) Integrated detection and population-genetic analysis of SNPs and copy number variation. Nature Genetics 40: 1166–1174.
    McCarthy MI, Abecasis GR, Cardon LR et al. (2008) Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nature Reviews. Genetics 9: 356–369.
    McClearn GE (2004) Nature and nurture: interaction and coaction. American Journal of Medical Genetics. Part B, Neuropsychiatric Genetics 124B: 124–130.
    Olson J (1999) A general conditional-logistic model for affected-relative-pair linkage studies. American Journal of Human Genetics 65: 1760–1769.
    Olson JM and Cordell HJ (2000) Ascertainment bias in the estimation of sibling genetic risk parameters. Genetic Epidemiology 18: 217–235.
    book Ott J (1999) Analysis of Human Genetic Linkage, 3rd edn. Baltimore: Johns Hopkins Press.
    Page GP, George V, Go RC, Page PZ and Allison DB (2003) “Are we there yet?”: deciding when one has demonstrated specific genetic causation in complex diseases and quantitative traits. American Journal of Human Genetics 73: 711–719.
    Pritchard JK and Cox NJ (2002) The allelic architecture of human disease genes: common disease-common variantor not? Human Molecular Genetics 11: 2417–2423.
    Pritchard JK, Stephens M and Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155: 945–959.
    Reich DE and Lander ES (2001) On the allelic spectrum of human disease. Trends in Genetics 17: 502–510.
    Risch N (1990) Linkage strategies for genetically complex traits. II. The power of affected relative pairs. American Journal of Human Genetics 46: 229.
    Satten GA, Flanders WD and Yang Q (2001) Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. American Journal Human Genetics 68: 466–477.
    Schaid DJ (2004) Evaluating associations of haplotypes with traits. Genetic Epidemiology 27: 348–364.
    Schmidt S and Schaid DJ (1999) Potential misinterpretation of the case-only study to assess gene-environment interaction. American Journal of Epidemiology 150: 878–885.
    Sinha M, Larkin EK, Elston RC and Redline S (2006) Self-reported race and genetic admixture. New England Journal of Medicine 354: 421–422.
    Spielman RS, McGinnis RE and Ewens WJ (1993) Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). American Journal of Human Genetics 52: 506–516.
    Tang H, Quertermous T, Rodriguez B et al. (2005) Genetic structure, self-identified race/ethnicity, and confounding in case-control association studies. American Journal of Human Genetics 76: 268–275.
    The International HapMap Consortium (2003) The International HapMap Project. Nature 426: 789–796.
    The Wellcome Trust Case Control Consortium (2007) Genome-wide association study of 14 000 cases of seven common diseases and 3000 shared controls. Nature 447: 661–678.
    Thompson CL, Rybicki BA, Iannuzzi MC et al. (2006) Reduction of sample heterogeneity through use of population substructure: an example from a population of African American families with sarcoidosis. American Journal of Human Genetics 79: 606–613.
    Wang T and Elston RC (2007) Regression-based multivariate linkage analysis with an application to blood pressure and body mass index. Annals of Human Genetics 71: 96–106.
    Zhu X, Li S, Cooper RS and Elston RC (2008) A unified association analysis approach for family and unrelated samples correcting for stratification. American Journal of Human Genetics 82: 352–365.
 Further Reading
    Altshuler D, Daly MJ and Lander ES (2008) Genetic mapping in human disease. Science 322: 881–888.
    Elston RC and Spence MA (2006) Advances in statistical human genetics over the last 25 years. Statistical Medicine 25: 3049–3080.
    Gray-McGuire C, Bochud M, Goodloe R and Elston RC (2009) Genetic association tests: a method for the joint analysis of family and case-control data. Human Genomics 4: 2–20.
    Lander E and Schork N (1994) Genetic dissection of complex traits. Science 265: 2037–2048.
    Morris NJ, Gray-McGuire C and Stein CM (2009) Mendelian randomization in family data. BMC Proceedings 3(suppl. 7): S45.
    book Rao DC and Province MS (eds) (2001) Genetic Dissection of Complex Traits. San Diego, CA: Academic Press.
    book Thomas DC (2004) Statistical Methods in Genetic Epidemiology. New York: Oxford University Press.
    Wang WY, Barratt BJ, Clayton DG and Todd JA (2005) Genome-wide association studies: theoretical and practical concerns. Nature Reviews. Genetics 6: 109–118.
Contact Editor close
Submit a note to the editor about this article by filling in the form below.

* Required Field

How to Cite close
Stein, Catherine M(Apr 2010) Identifying Genes Underlying Human Inherited Disease. In: eLS. John Wiley & Sons Ltd, Chichester. http://www.els.net [doi: 10.1002/9780470015902.a0022395]