NAR Genomics and Bioinformatics,| Volume 7, Issue 2, June 2025 | doi.org/10.1093/nargab/lqaf080
Anil Prakash, Moinak Banerjee
Large-scale quantitative studies have identified significant genetic associations for various neurological disorders. Expression quantitative trait locus (eQTL) studies have shown the effect of single-nucleotide polymorphisms (SNPs) on the differential expression of genes in brain tissues. However, a large majority of the associations are contributed by SNPs in the noncoding regions that can have significant regulatory function but are often ignored. Besides, mutations that are in high linkage disequilibrium with actual regulatory SNPs will also show significant associations. Therefore, it is important to differentiate a regulatory noncoding SNP with a nonregulatory one. To resolve this, we developed a deep learning model named Neur-Ally, which was trained on epigenomic datasets from nervous tissue and cell line samples. The model predicts differential occurrence of regulatory features like chromatin accessibility, histone modifications, and transcription factor binding on genomic regions using DNA sequence as input. The model was used to predict the regulatory effect of neurological condition-specific noncoding SNPs using in silico mutagenesis. The effect of associated SNPs reported in genome-wide association studies of neurological condition, brain eQTLs, autism spectrum disorder, and reported probable regulatory SNPs in neurological conditions were predicted by Neur-Ally.