Genomic Approaches to Studying Neurodegenerative Diseases

The Genomic Revolution in Neurodegenerative Disease Research

Neurodegenerative diseases—including Alzheimer’s disease, Parkinson’s disease, amyotrophic lateral sclerosis (ALS), and frontotemporal dementia (FTD)—represent some of the most challenging and devastating conditions in modern medicine. These disorders are characterized by progressive loss of neuronal structure and function, leading to cognitive decline, motor impairment, and ultimately death. Despite decades of research, effective disease-modifying therapies remain scarce. The advent of high-throughput genomic technologies, however, has fundamentally transformed our understanding of the molecular underpinnings of neurodegeneration. By systematically interrogating the entire genome, transcriptome, and epigenome, researchers can now pinpoint inherited risk factors, somatic mutations, and regulatory alterations that drive disease pathogenesis. This article provides a comprehensive overview of the genomic approaches currently employed in neurodegenerative disease research, highlighting key discoveries, methodological advances, and the path toward translational applications.

From Candidate Genes to Genome-Wide Surveys

Early genetic studies of neurodegenerative diseases focused on rare, highly penetrant mutations identified through family-based linkage analysis. For Alzheimer’s disease (AD), mutations in APP, PSEN1, and PSEN2 were found to cause early-onset familial forms. Similarly, in Parkinson’s disease (PD), mutations in SNCA, LRRK2, and GBA were linked to both familial and sporadic cases. While these discoveries provided crucial insights, they explained only a small fraction of disease heritability. The majority of cases are sporadic, resulting from a complex interplay of multiple genetic variants and environmental factors. This realization drove the need for unbiased, genome-wide approaches to capture the full spectrum of genetic risk.

Genome-Wide Association Studies (GWAS)

GWAS have become the workhorse of complex disease genetics. By genotyping hundreds of thousands to millions of single nucleotide polymorphisms (SNPs) across large cohorts of cases and controls, researchers can identify common genetic variants associated with disease risk. For Alzheimer’s disease, the strongest and most replicated signal lies in the APOE gene; the ε4 allele increases risk by three- to twelvefold depending on zygosity. Beyond APOE, GWAS have now identified over 70 risk loci for AD, implicating pathways in lipid metabolism, immune function, endocytosis, and amyloid-β processing (Kunkle et al., 2019). In Parkinson’s disease, GWAS have uncovered more than 90 independent risk variants, with the GBA and LRRK2 loci showing the largest effect sizes in certain populations (Nalls et al., 2019). These studies have also revealed shared genetic architecture between neurodegenerative diseases and other conditions, such as inflammatory bowel disease and cancer, suggesting common pathogenic mechanisms.

Importantly, GWAS are limited in their ability to pinpoint causal variants because associated SNPs often fall in non-coding regions. Fine-mapping studies, coupled with functional genomics data (e.g., expression quantitative trait loci, or eQTLs), are required to identify the specific genes and regulatory elements driving each association. Resources such as the GWAS Catalog and the NIAGADS knowledge base provide centralized access to summary statistics and functional annotations.

Whole-Genome and Whole-Exome Sequencing

While GWAS capture common variants (minor allele frequency typically >1%), rare and low-frequency variants are better detected through sequencing approaches. Whole-exome sequencing (WES) targets the coding regions of the genome—approximately 1–2% of the total sequence—where most known disease-causing mutations reside. Whole-genome sequencing (WGS) captures both coding and non-coding regions, providing a complete picture of genetic variation, including structural variants, copy number alterations, and mutations in regulatory elements. Large-scale sequencing consortia such as the Alzheimer’s Disease Sequencing Project (ADSP) have identified rare missense mutations in genes like TREM2, ABCA7, and PLCG2 that confer risk for late-onset AD. In ALS, hexanucleotide repeat expansions in C9orf72, the most common genetic cause, were discovered through linkage and repeat-primed PCR, but WGS has since revealed the full mutational spectrum at that locus.

WGS also enables detection of somatic mutations—changes that occur post-zygotically in individual cells or tissues. Emerging evidence suggests that somatic mutations accumulate in the aging brain and may contribute to neurodegeneration. For example, studies have found increased rates of somatic single-nucleotide variants (SNVs) in neurons from individuals with Alzheimer’s disease compared to controls, with enrichment in genes involved in synaptic function. While the causal role of these mutations remains under investigation, they represent an exciting new frontier in genomic research.

Transcriptomics: Capturing the Dynamic Brain

Understanding how genetic variations influence gene expression is critical for translating genomic findings into disease mechanisms. Transcriptomic analyses—the comprehensive study of RNA transcripts—provide a snapshot of cellular activity at the molecular level. In neurodegenerative disease research, transcriptomics has been applied to postmortem brain tissue, cerebrospinal fluid (CSF), and induced pluripotent stem cell (iPSC)-derived neurons and glia.

Bulk Tissue Transcriptomics

Initial transcriptomic studies used microarrays and later RNA sequencing (RNA-seq) to profile gene expression in bulk tissue homogenates from affected brain regions, such as the hippocampus and prefrontal cortex in AD. These studies have consistently identified dysregulation of pathways related to synaptic transmission, mitochondrial function, neuroinflammation, and protein homeostasis. For example, reduced expression of synaptic genes like SYT1 and DLG4 is a hallmark of AD, while increased expression of glial and immune-related genes (e.g., GFAP, TREM2) reflects the neuroinflammatory component. Co-expression network analyses, such as weighted gene co-expression network analysis (WGCNA), have further identified modules of co-regulated genes that correlate with disease severity and pathological hallmarks.

Single-Cell and Single-Nucleus RNA Sequencing

The brain is a highly heterogeneous organ composed of diverse cell types—neurons, astrocytes, microglia, oligodendrocytes, and more. Bulk RNA-seq averages signals across these populations, obscuring cell-type-specific changes. Single-cell RNA sequencing (scRNA-seq) and single-nucleus RNA sequencing (snRNA-seq) overcome this limitation by profiling individual cells or nuclei. These technologies have transformed our understanding of cellular vulnerability in neurodegeneration. For instance, single-cell studies in AD have revealed that excitatory neurons in the entorhinal cortex are particularly susceptible to tau pathology, while microglia undergo dramatic transcriptional reprogramming, shifting from a homeostatic to a disease-associated state characterized by upregulation of APOE, TREM2, and CLEC7A (Mathys et al., 2019). In multiple sclerosis, which shares neurodegenerative features, single-cell analyses have identified oligodendrocyte heterogeneity and a microglial signature linked to lesion progression.

Emerging Multi-Omics at Single-Cell Resolution

The field is now moving beyond transcriptomics to integrate other modalities at the single-cell level. Single-cell ATAC-seq (assay for transposase-accessible chromatin) measures chromatin accessibility, revealing regulatory elements active in specific cell types. Single-cell DNA methylation profiling captures epigenetic landscapes. Efforts like the BRAIN Initiative Cell Census Network (BICCN) and the Human Cell Atlas are systematically mapping cell types and states across the human brain, providing an essential reference for neurodegenerative disease research.

Epigenomics: Beyond the DNA Sequence

Epigenetic modifications regulate gene expression without altering the DNA sequence itself. Aberrant epigenetic marks are increasingly recognized as contributors to neurodegeneration, potentially mediating the effects of aging, environmental exposures, and lifestyle factors. Two major areas of investigation are DNA methylation and histone modifications.

DNA Methylation in Neurodegeneration

DNA methylation typically occurs at cytosine residues in CpG dinucleotides and is associated with transcriptional repression. Genome-wide methylation arrays (e.g., Illumina EPIC) and whole-genome bisulfite sequencing allow researchers to survey methylation patterns at single-base resolution. In Alzheimer’s disease, studies have identified differential methylation at genes involved in amyloid processing (e.g., BACE1, APP), tau phosphorylation, and immune function. A notable example is the ANK1 gene, whose hypomethylation in AD brains was consistently replicated across multiple cohorts. Methylation changes also occur in mitochondrial DNA, linking energy metabolism to neurodegeneration.

Interestingly, many AD-associated methylation differences are located at CpG sites that overlap with GWAS risk loci, suggesting that genetic variants may influence disease risk in part by affecting local methylation. This interplay between genetics and epigenetics is an active area of research, with studies leveraging Mendelian randomization to infer causal relationships.

Histone Modifications and Chromatin Remodeling

Histone modifications—such as acetylation, methylation, and phosphorylation—modulate chromatin structure and gene accessibility. In neurodegenerative diseases, global changes in histone acetylation have been observed; for example, reduced histone H4 acetylation in AD models correlates with memory deficits. Enzymes that write, read, and erase these marks, including histone deacetylases (HDACs) and acetyltransferases (HATs), are being explored as therapeutic targets. However, because these enzymes are broadly expressed and regulate many cellular processes, developing brain-specific modulators remains a challenge.

Advanced techniques like ChIP-seq (chromatin immunoprecipitation followed by sequencing) and CUT&Tag enable genome-wide mapping of histone marks and transcription factor binding sites in brain tissue. These methods have been applied to uncover regulatory regions that drive cell-type-specific gene expression and to identify how disease-associated variants alter chromatin states.

Integrating Multi-Omics Data for Mechanistic Insights

No single omics layer can fully explain the complexity of neurodegeneration. The integration of genomic, transcriptomic, epigenomic, proteomic, and metabolomic data—collectively termed multi-omics—provides a systems-level understanding of disease biology. Bioinformatics tools for data integration include partial least squares regression, network-based approaches (e.g., similarity network fusion), and machine learning models that prioritize features across datasets. For example, the AMP-AD Knowledge Portal hosts harmonized multi-omics data from human brain samples, enabling researchers to identify modules of co-regulated molecules that are enriched for genetic risk factors and predict causal drivers.

A powerful integrative strategy is the use of expression quantitative trait loci (eQTLs) and epigenetic QTLs (eQTLs) to link GWAS signals to specific genes. By mapping variants that affect the expression or methylation of nearby genes, researchers can prioritize candidate genes for functional validation. For instance, a non-coding variant in an AD risk locus may be linked to altered expression of BIN1 or CD2AP in microglia, pointing to cell-type-specific mechanisms. Co-localization and Mendelian randomization analyses further strengthen causal inference.

Proteomics and Metabolomics: The Functional Endpoint

Genomic and transcriptomic changes do not always correlate with protein levels due to post-transcriptional and post-translational regulation. Direct measurement of proteins and metabolites provides a closer approximation to disease phenotype. Mass spectrometry-based proteomics has identified hundreds of proteins altered in AD brain tissue, including tau isoforms, amyloid precursor protein fragments, and synaptic proteins. Targeted approaches such as SomaLogic and Olink panels allow high-throughput protein quantification in CSF and plasma, facilitating biomarker discovery. Similarly, metabolomic profiling reveals dysregulation of energy metabolism, lipid metabolism, and amino acid pathways. Integrating these data with genetic variants (e.g., protein quantitative trait loci, or pQTLs) can reveal potential drug targets.

Challenges and Future Directions

Despite remarkable progress, genomic approaches to neurodegenerative diseases face several obstacles. First, the genetic architecture of these disorders is highly heterogeneous, with many variants of small effect. Large sample sizes are required for sufficient statistical power, and population diversity remains a major limitation; the vast majority of studies have been conducted in individuals of European ancestry. Efforts to include African, Asian, and Latin American cohorts are underway but need expansion to ensure that findings are globally applicable and that therapeutic strategies benefit all populations.

Second, functional validation of genomic findings is time-consuming yet essential. CRISPR-based gene editing in iPSC-derived neurons and glia, along with animal models, must be scaled to keep pace with the volume of genetic discoveries. Third, the ethical implications of genomic risk prediction, particularly for incurable diseases, demand careful consideration. Disclosure of APOE status for Alzheimer’s risk remains controversial due to potential psychological harm and discrimination. Polygenic risk scores (PRS)—which combine the effects of thousands of variants into a single metric—offer the promise of early risk stratification, but their clinical utility, reproducibility, and equity across populations are still being evaluated.

Looking ahead, genomic technologies will likely drive the next wave of therapeutic innovation. Antisense oligonucleotides (ASOs) and gene therapy approaches for monogenic forms of disease, such as HTT in Huntington’s disease and C9orf72 in ALS, are already in clinical trials. For complex forms, CRISPR-based epigenome editing could, in principle, modulate risk gene expression without altering the DNA sequence. Meanwhile, single-cell multi-omics, spatial transcriptomics, and long-read sequencing promise to reveal new types of variation, including structural variants, repeat expansions, and RNA modifications, that may be involved in neurodegeneration. The integration of artificial intelligence and machine learning will further accelerate the analysis of vast datasets, identifying patterns invisible to traditional statistics.

Ultimately, the goal of genomic research in neurodegeneration is to decode the molecular logic that transforms healthy aging into devastating disease. With continued investment in collaborative, data-sharing initiatives and rigorous functional follow-up, the path from genetic discovery to effective therapies becomes ever more tangible.

Conclusion

Genomic approaches have fundamentally reshaped our understanding of neurodegenerative diseases. From GWAS that uncovered dozens of risk variants to single-cell transcriptomics that mapped cellular vulnerability, these technologies provide a molecular roadmap of disease. Integration across omics layers, coupled with advances in functional genomics and computational biology, will be essential for translating these discoveries into diagnostics and treatments. As the field moves toward precision medicine, the genomic insights gained today will form the foundation for tomorrow’s therapies, offering hope to millions affected by these devastating conditions.