Molecular Basis of Complex Traits


Most genetically determined differences between individuals, including (but not limited to) susceptibility to specific diseases, depend on functional variation at multiple genes (loci). The frequency of these variants (alleles) in the general population varies from common to rare. The contribution of most common alleles, each in isolation, to disease risk is weak. Most uncommon and rare alleles have not been studied to date and some may have strong effects on disease risk even if they contribute to disease cause each in a small subset of cases. This mixture of strong and weak effects by common and rare alleles is referred to as allelic architecture. Defining the allelic architecture of each disease will be the first step towards using an individual's genetic profile to individualise the molecular diagnosis within a group of cases that all bear the same clinical diagnostic label.

Key Concepts:

  • Most diseases and other human traits have statistically significant familial clustering, indicating the involvement of genetic susceptibility.

  • Genetic susceptibility to most diseases is determined by a large number of gene variants (loci).

  • Disease‐associated variants may change the sequence of the protein product, or the control of its transcription, RNA stability or translational efficiency.

  • Most known loci have a weak genetic association with the disease or trait.

  • A genetic association may be weak because of weak biology, or because the associated allele is merely an imperfect marker for an untested rare allele with strong effect.

  • The discovery, among low‐frequency alleles, of those that have the strongest effects, is the next frontier in the genetics of complex traits and the greatest promise to personalised medicine.

Keywords: complex trait; missing heritability; allele frequency; relative risk; nonsynonymous; genetic marker; linkage disequilibrium; allelic architecture; genome‐wide

Figure 1.

A schematic annotation of certain interesting properties of complex‐trait loci, depending on their position on the two‐dimensional space defined by the two continuums of relative risk (RR) and minor allele frequency (MAF). Please note that the axis markings are arbitrary and not in scale, as the figure is meant to illustrate the concept rather than present specific data.

Figure 2.

Position of 145 known dichotomous complex‐trait loci in the OR‐MAF space, to illustrate that the majority cluster in the low‐OR/high‐MAF quadrant and that few high‐OR/high‐MAF loci exist. Data were from loci for which both values are listed in GWAS Central (, except for HLA in type 1 diabetes, which were obtained from Erlich et al. . To make ORs directly comparable, they were calculated using the lower‐risk allele as base regardless of frequency, so that all values are >1. Quantitative trait loci from the same database were not included, as the effect magnitude is harder to standardise.



Bouatia‐Naji N, Rocheleau G, Van Lommel L et al. (2008) A polymorphism within the G6PC2 gene is associated with fasting plasma glucose levels. Science 320(5879): 1085–1088.

Cheung VG, Spielman RS, Ewens KG et al. (2005) Mapping determinants of human gene expression by regional and genome‐wide association. Nature 437(7063): 1365–1369.

Cho YS, Go MJ, Kim YJ et al. (2009) A large‐scale genome‐wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nature Genetics 41(5): 527–534.

Daxinger L and Whitelaw E (2010) Transgenerational epigenetic inheritance: more questions than answers. Genome Research 20(12): 1623–1628.

Dimas AS, Deutsch S, Stranger BE et al. (2009) Common regulatory variation impacts gene expression in a cell type‐dependent manner. Science 325(5945): 1246–1250.

Erlich H, Valdes AM, Noble J et al. (2008) Type 1 Diabetes Genetics Consortium. HLA DR‐DQ haplotypes and genotypes and type 1 diabetes risk: analysis of the type 1 diabetes genetics consortium families. Diabetes 57(4): 1084–1092.

Feuk L, Carson AR and Scherer SW (2006) Structural variation in the human genome. Nature Reviews Genetics 7(2): 85–97.

Flanagan SE, Patch AM and Ellard S (2010) Using SIFT and PolyPhen to predict loss‐of‐function and gain‐of‐function mutations. Genetic Testing and Molecular Biomarkers 14(4): 533–537.

Frédéric MY, Lalande M, Boileau C et al. (2009) UMD‐predictor, a new prediction tool for nucleotide substitution pathogenicity – application to four genes: FBN1, FBN2, TGFBR1, and TGFBR2. Human Mutation 30(6): 952–959.

Goldstein DB (2009) Common genetic variation and human traits. New England Journal of Medicine 360: 1696–1698.

Julier C, Hyer RN, Davies J et al. (1991) Insulin‐IGF2 region on chromosome 11p encodes a gene implicated in HLA‐DR4‐dependent diabetes susceptibility. Nature 354(6349): 155–159.

Kerkel K, Spadola A, Yuan E et al. (2008) Genomic surveys by methylation‐sensitive SNP analysis identify sequence‐dependent allele‐specific DNA methylation. Nature Genetics 40(2008): 904–908.

Köttgen A, Glazer NL, Dehghan A et al. (2009) Multiple loci associated with indices of renal function and chronic kidney disease. Nature Genetics 41(6): 712–717.

Kyewski B and Derbinski J (2004) Self‐representation in the thymus: an extended view. Nature Reviews Immunology 4(9): 688–698.

Lee SH, Wray NR, Goddard ME and Visscher PM (2011) Estimating missing heritability for disease from genome‐wide association studies. American Journal of Human Genetics 88(3): 294–305.

Li Y, Vinckenbosch N, Tian G et al. (2010) Resequencing of 200 human exomes identifies an excess of low‐frequency non‐synonymous coding variants. Nature Genetics 42(11): 969–972.

Manolio TA, Collins FS, Cox NJ et al. (2009) Finding the missing heritability of complex diseases. Nature 461(7265): 747–753.

Nackley AG, Shabalina SA, Tchivileva IE et al. (2006) Human catechol‐O‐methyltransferase haplotypes modulate protein expression by altering mRNA secondary structure. Science 314(5807): 1930–1933.

Nejentsev S, Walker N, Riches D, Egholm M and Todd JA (2009) Rare variants of IFIH1, a gene implicated in antiviral responses, protect against type 1 diabetes. Science 324(5925): 387–389.

Nica AC, Parts L, Glass D et al. (2011) The architecture of gene regulatory variation across multiple human tissues: the MuTHER study. PLoS Genetics 7(2): e1002003.

Onkamo P, Vaananen S, Karvonen M and Tuomileho J (1999) Worldwide increase in incidence of type 1 diabetes – the analysis of the data on published incidence trends. Diabetologia 42: 1395–1403.

Rafiq S, Anand S and Roberts R (2010) Genome‐wide association studies of hypertension: have they been fruitful? Journal of Cardiovascular Translational Research 3(3): 189–196.

Redondo MJ, Fain PR, Krischer JP et al. (2004) Expression of beta‐cell autoimmunity does not differ between potential dizygotic twins and siblings of patients with type 1 diabetes. Journal of Autoimmunity 23(3): 275–279.

Redondo MJ, Jeffrey J, Fain PR, Eisenbarth GS and Orban T (2008) Concordance for islet autoimmunity among monozygotic twins. New England Journal of Medicine 359(26): 2849–2850.

Richards JB, Yuan X, Geller F et al. (2008) Male‐pattern baldness susceptibility locus at 20p11. Nature Genetics 40(11): 1282–1284.

Risch N (1990) Linkage strategies for genetically complex traits. I. Multilocus models. American Journal of Human Genetics 46(2): 222–228.

Sankararaman S, Sridhar S, Kimmel G and Halperin E (2008) Estimating local ancestry in admixed populations. American Journal of Human Genetics 82(2): 290–303.

Saxena R, Voight BF, Lyssenko V et al. (2007) Genome‐wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science 316(5829): 1331–1336.

The Welcome Trust Case Control Consortium (2007) Genome‐wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447(7145): 661–678.

Vafiadis P, Bennett ST, Todd JA et al. (1997) Insulin expression in human thymus is modulated by INS VNTR alleles at the IDDM2 locus. Nature Genetics 15(3): 289–292.

Verlaan DJ, Berlivet S, Hunninghake GM et al. (2009) Allele‐specific chromatin remodeling in the ZPBP2/GSDMB/ORMDL3 locus associated with the risk of asthma and autoimmune disease. American Journal of Human Genetics 85(3): 377–393.

Wei Z, Wang K, Qu HQ et al. (2009) From disease association to risk assessment: an optimistic view from genome‐wide association studies on type 1 diabetes. PLoS Genetics 5(10): e1000678.

Yang J, Benyamin B, McEvoy BP et al. (2010) Common SNPs explain a large proportion of the heritability for human height. Nature Genetics 42(7): 565–569.

Yang Y, Chung EK, Wu YL et al. (2007) Gene copy‐number variation and associated polymorphisms of complement component C4 in human systemic lupus erythematosus (SLE): low copy number is a risk factor for and high copy number is a protective factor against SLE susceptibility in European Americans. American Journal of Human Genetics 80(6): 1037–1054.

Further Reading

Cookson W, Liang L, Abecasis G, Moffatt M and Lathrop M (2009) Mapping complex disease traits with global gene expression. Nature Reviews Genetics 10(3): 184–194.

Frazer KA, Murray SS, Schork NJ and Topol EJ (2009) Human genetic variation and its contribution to complex traits. Nature Reviews Genetics 10(4): 241–251.

Majewski J and Pastinen T (2011) The study of eQTL variations by RNA‐seq: from SNPs to phenotypes. Trends in Genetics 27(2): 72–79.

Nica AC and Dermitzakis ET (2008) Using gene expression to investigate the genetic basis of complex disorders. Human Molecular Genetics 17(R2): R129–R134.

Plomin R, Haworth CM and Davis OS (2009) Common disorders are quantitative traits. Nature Reviews Genetics 10(12): 872–878.

Qu HQ, Tien M and Polychronakos C (2010) Statistical significance in genetic association studies. Clinical and Investigative Medicine 33(5): E266–E270.

Contact Editor close
Submit a note to the editor about this article by filling in the form below.

* Required Field

How to Cite close
Polychronakos, Constantin(Sep 2011) Molecular Basis of Complex Traits. In: eLS. John Wiley & Sons Ltd, Chichester. [doi: 10.1002/9780470015902.a0021448]