Evaluation of the predictive capacity of DNA variants associated with straight hair in Europeans

Evaluation of the predictive capacity of DNA variants associated with straight hair in Europeans


Pospiech,E.; Karlowska-Pik,J.; Marcinska,M.; Abidi,S.; Andersen,J.D.; vandenBerge,M.; Carracedo,A.; Eduardoff,M.; Freire-Aradas,A.; Morling,N.; Sijen,T.; Skowron,M.; Sochtig,J.; Syndercombe-Court,D.; Weiler,N.; Schneider,P.M.; Ballard,D.; Borsting,C.; Parson,W.; Phillips,C.; Branicki,W.; E.UROFORGEN-NoEConsortium;

DNA-based prediction of hair morphology, defined as straight, curly or wavy hair, could contribute to an improved description of an unknown offender and allow more accurate forensic reconstructions of physical appearance in the field of forensic DNA phenotyping. Differences in scalp hair morphology are significant at the worldwide scale and within Europe. The only genome-wide association study made to date revealed the Trichohyalin gene (TCHH) to be significantly associated with hair morphology in Europeans and reported weaker associations for WNT10A and FRAS1 genes. We conducted a study that centered on six SNPs located in these three genes with a sample of 528 individuals from Poland. The predictive capacity of the candidate DNA variants was evaluated using logistic regression; classification and regression trees; and neural networks, by applying a 10-fold cross validation procedure. Additionally, an independent test set of 142 males from six European populations was used to verify performance of the developed prediction models. Our study confirmed association of rs11803731 (TCHH), rs7349332 (WNT10A) and rs1268789 (FRAS1) SNPs with hair morphology. The combined genotype risk score for straight hair had an odds ratio of 2.7 and these predictors explained approximately 8.2% of the total variance. The selected three SNPs were found to predict straight hair with a high sensitivity but low specificity when a 10-fold cross validation procedure was applied and the best results were obtained using the neural networks approach (AUC=0.688, sensitivity=91.2%, specificity=23.0%). Application of the neural networks model with 65% probability threshold on an additional test set gave high sensitivity (81.4%) and improved specificity (50.0%) with a total of 78.7% correct calls, but a high non-classification rate (66.9%). The combined TTGGGG SNP genotype for rs11803731, rs7349332, rs1268789 (European frequency=4.5%) of all six straight hair-associated alleles was identified as the best predictor, giving >80% probability of straight hair. Finally, association testing of 44 SNPs previously identified to be associated with male pattern baldness revealed a suggestive association with hair morphology for rs4679955 on 3q25.1. The study results reported provide the starting point for the development of a predictive test for hair morphology in Europeans. More studies are now needed to discover additional determinants of hair morphology to improve the predictive accuracy of this trait in forensic analysis.

Forensic Sci Int Genet 2015 19:280-8
PubMed: 26414620