An evaluation of the genetic-matched pair study design using genome-wide SNP data from the European population

An evaluation of the genetic-matched pair study design using genome-wide SNP data from the European population


Lu,T.T.; Lao,O.; Nothnagel,M.; Junge,O.; Freitag-Wolf,S.; Caliebe,A.; Balascakova,M.; Bertranpetit,J.; Bindoff,L.A.; Comas,D.; Holmlund,G.; Kouvatsi,A.; Macek,M.; Mollet,I.; Nielsen,F.; Parson,W.; Palo,J.; Ploski,R.; Sajantila,A.; Tagliabracci,A.; Gether,U.; Werge,T.; Rivadeneira,F.; Hofman,A.; Uitterlinden,A.G.; Gieger,C.; Wichmann,H.E.; Ruether,A.; Schreiber,S.; Becker,C.; Nurnberg,P.; Nelson,M.R.; Kayser,M.; Krawczak,M.;

Genetic matching potentially provides a means to alleviate the effects of incomplete Mendelian randomization in population-based gene-disease association studies. We therefore evaluated the genetic-matched pair study design on the basis of genome-wide SNP data (309 790 markers; Affymetrix GeneChip Human Mapping 500K Array) from 2457 individuals, sampled at 23 different recruitment sites across Europe. Using pair-wise identity-by-state (IBS) as a matching criterion, we tried to derive a subset of markers that would allow identification of the best overall matching (BOM) partner for a given individual, based on the IBS status for the subset alone. However, our results suggest that, by following this approach, the prediction accuracy is only notably improved by the first 20 markers selected, and increases proportionally to the marker number thereafter. Furthermore, in a considerable proportion of cases (76.0%), the BOM of a given individual, based on the complete marker set, came from a different recruitment site than the individual itself. A second marker set, specifically selected for ancestry sensitivity using singular value decomposition, performed even more poorly and was no more capable of predicting the BOM than randomly chosen subsets. This leads us to conclude that, at least in Europe, the utility of the genetic-matched pair study design depends critically on the availability of comprehensive genotype information for both cases and controls.European Journal of Human Genetics advance online publication, 21 January 2009; doi:10.1038/ejhg.2008.266

Eur.J.Hum.Genet. 2009 17(7):967-975
PubMed: 19156175