Pospiech,E.; Chen,Y.; Kukla-Bartoszek,M.; Breslin,K.; Aliferi,A.; Andersen,J.D.; Ballard,D.; Chaitanya,L.; Freire-Aradas,A.; vanderGaag,K.J.; Giron-Santamaria,L.; Gross,T.E.; Gysi,M.; Huber,G.; Mosquera-Miguel,A.; Muralidharan,C.; Skowron,M.; Carracedo,A.; Haas,C.; Morling,N.; Parson,W.; Phillips,C.; Schneider,P.M.; Sijen,T.; Syndercombe-Court,D.; Vennemann,M.; Wu,S.; Xu,S.; Jin,L.; Wang,S.; Zhu,G.; Martin,N.G.; Medland,S.E.; Branicki,W.; Walsh,S.; Liu,F.; Kayser,M.; E.UROFORGEN-NoEConsortium;
Human head hair shape, commonly classified as straight, wavy, curly or frizzy, is an attractive target for Forensic DNA Phenotyping and other applications of human appearance prediction from DNA such as in paleogenetics. The genetic knowledge underlying head hair shape variation was recently improved by the outcome of a series of genome-wide association and replication studies in a total of 26,964 subjects, highlighting 12 loci of which 8 were novel and introducing a prediction model for Europeans based on 14 SNPs. In the present study, we evaluated the capacity of DNA-based head hair shape prediction by investigating an extended set of candidate SNP predictors and by using an independent set of samples for model validation. Prediction model building was carried out in 9674 subjects (6068 from Europe, 2899 from Asia and 707 of admixed European and Asian ancestries), used previously, by considering a novel list of 90 candidate SNPs. For model validation, genotype and phenotype data were newly collected in 2415 independent subjects (2138 Europeans and 277 non-Europeans) by applying two targeted massively parallel sequencing platforms, Ion Torrent PGM and MiSeq, or the MassARRAY platform. A binomial model was developed to predict straight vs. non-straight hair based on 32 SNPs from 26 genetic loci we identified as significantly contributing to the model. This model achieved prediction accuracies, expressed as AUC, of 0.664 in Europeans and 0.789 in non-Europeans; the statistically significant difference was explained mostly by the effect of one EDAR SNP in non-Europeans. Considering sex and age, in addition to the SNPs, slightly and insignificantly increased the prediction accuracies (AUC of 0.680 and 0.800, respectively). Based on the sample size and candidate DNA markers investigated, this study provides the most robust, validated, and accurate statistical prediction models and SNP predictor marker sets currently available for predicting head hair shape from DNA, providing the next step towards broadening Forensic DNA Phenotyping beyond pigmentation traits.
Forensic Sci Int Genet 2018 37:241-251