Functional Genomics

Research Overview

Our research group in the Department of Neurology at Columbia University aims to understand the genetic basis of molecular functions underlying neurological disorders, particularly Alzheimer’s disease and multiple sclerosis. We use computational and experimental approaches to integrate population-scale genomics, epigenomics, transcriptomics, proteomics, and metabolomics data. Our work spans a range of subjects at the intersection of genetics, statistics, machine learning, bioinformatics, and neurological sciences. We are particularly interested in developing and applying advanced computational methods to address significant problems in our field. Therefore, much of our research involves the thoughtful analysis of large-scale ‘omics data to generate new hypotheses on the molecular mechanisms of disease etiology, or to inspire the development of novel computational methodology.

One focus of our work is the statistical analysis of molecular quantitative trait loci (QTL). We use statistical mapping tools to quantify the genetic regulation of molecular traits in a large number of cells and tissues collected from aging brains, cerebrospinal fluid, and blood. This allows us to create a detailed picture of the functional genomic regulation landscape in the brain and central nervous system. Through collaborations with research groups across the country, we study the QTL of various molecular phenotypes, including chromatin accessibility, histone acetylation, transcription factor binding activity, methylation, small RNAs, gene expression, alternative polyadenylation, alternative splicing, proteomics, lipidomics, and metabolomics.

Another focus of our research is the computational discovery of novel regulatory genomics mechanisms. We use existing sequencing and other assay data, along with new computational methods, to uncover new molecular phenotypes. For example, we quantify novel spatial patterns of histone acetylation and methylation QTL. We also develop new methods for generating alternative polyadenylation data. We investigate mechanisms of genomic regulation through the development and application of improved computational methods in statistical modeling and machine learning for fine-mapping, colocalization, functional variant annotation and prediction, epigenome and transcriptome-wide association, and gene-set and pathway discovery.

We also investigate the ancestry-specific regulation of functional genomic features. By integrating the growing collection of molecular QTL resources with molecular data from minority population samples across the country, we aim to uncover the differential genetic regulations at the variants, genes, and pathways levels. We study the heterogeneity of cross-population QTL, and use the similarities between populations to improve the prediction of molecular phenotypes in minority populations.

Furthermore, we use functional genomics knowledge to study neurodegenerative disorders. We integrate whole genome sequences, brain and CNS QTL data, and a broad range of epigenomic data to study Alzheimer’s disease. We develop and apply multi-omics integration methods to understand the synergy of ‘omics regulations in conferring disease risk. We integrate functional genomic data to aid in the discovery of rare and/or non-coding genetic variants (including copy number variations). We also develop approaches for profiling AD families and patients using genetic, functional genomic, and pathology information at all levels to better characterize and uncover the heterogeneity within AD. Finally, we study the genetic comorbidity between AD and other complex traits in diverse cohorts.

An important aspect of our group’s mission is the development of functional genomics data and computational resources. We curate data resources and build user-friendly computational tools such as software packages and bioinformatics pipelines. These tools and resources are designed to facilitate the analysis and interpretation of large-scale ‘omics data in the context of neurological disorders. We also actively engage in educational and outreach efforts, with the goal of fostering a vibrant and collaborative research community.

On the group wiki page you can find more details on the ongoing projects and links to the code repositories associated with these projects.

If you are interested in joining our team related to any of these research directions, please contact us at: wang.gao@columbia.edu.