Exhaustive Annotation of Human Exome Gene Variants with Consensus Pathogenicity

No alt text provided for this image

A novel approach is developed to address the challenge of annotating with phenotypic effects those exome variants for which relevant empirical data are lacking or minimal. The predictive annotation method is implemented as a stacked ensemble of supervised base-learners (), including distributed random forest and gradient boosting machines. Ensemble models were applied to 84 million non-synonymous single-nucleotide-variants (SNVs). The consensus model combined 39 functional mutation impacts, cross-species conservation score, and gene indispensability score. The indispensability score, accounting for differences in variant pathogenicities including in essential and mutation-tolerant genes, considerably improved the predictions. The consensus combination is consistent with as many input scores as possible while minimizing false predictions. The input scores are ranked based on their ability to predict effects. The score rankings and categorical phenotypic variant effect predictions are aimed for direct use in clinical and biological applications to prioritize human exome variants and mutations. https://doi.org/10.3390/genes11091076

要查看或添加评论,请登录

Victor Zharavin的更多文章

社区洞察

其他会员也浏览了