Population Stratification, Adjustment for


Population stratification is a major concern in genetic association studies. Failure to control it effectively can lead to excess false‐positive results and failure to detect true associations. Many methods have been designed to adjust for population stratification, which mainly belong to the following categories: (1) genomic control, (2) structured association, (3) principal component or multidimensional scaling adjustment, (4) stratification score method and (5) other approaches. No method is likely to be superior in all situations. Care needs to be taken to ensure that the assumptions of the method are met and that the method is used for its intended purpose.

Keywords: population stratification; genomic control; structured association; principal components; multidimensional scaling; stratification score

Figure 1.

Multidimensional scaling versus principal component approach. These figures show the clustering results using MDS and PCA with 5000 genome‐wide random autosomal SNPs from the HapMap project Phase I data. In the top panel, (a)–(c) are generated using the PCA approach as implemented in Eigenanalysis. In the bottom panel, (d)–(f) are generated using the MDS approach. Pairwise plots of the first three dimensions are presented. There is no apparent difference in their ability to visualise the ancestral differences in these populations. Multiple runs gave similar results. The signs of the dimension 1 and 2 from the MDS plots have been reversed (this does not change the relative location of each cluster) to match the geographical locations of PCA clusters. CEU, CEPH in Utah residents with ancestry from northern and western Europe; CHB, Han Chinese from Beijing, China; JPT, Japanese from Tokyo, Japan; YRI, Yoruba in Ibadan; MDS, multidimensional scaling and PCA, principal component analysis.



Gao, Xiaoyi, and Edwards, Todd L(Oct 2010) Population Stratification, Adjustment for.