John Moult
Professor
Moult Group (240) 314-6241 jmoult@umd.eduResearch in Dr. John Moult’s laboratory is focused on computational modeling of biological systems, including:
- Investigating the effects of missense single nucleotide variants on protein structure and function to elucidate their role in human disease
- Integrating knowledge of the biological mechanisms underlying the relationship between human genetic variation and disease, particularly complex trait diseases such as Alzheimer’s, diabetes, and Crohn’s disease
- Using novel neural network architectures derived from deep biological knowledge to probe aspects of disease mechanism, including the evaluation of potential drug targets and the best choice of drug for any patient, given their genome sequence
- Conducting large-scale community experiments to assess and advance the state of the art in areas of computational biology, particularly genome interpretation and protein structure modeling
CURRENT RESEARCH
Missense Single Nucleotide Variants and Disease
In more than 7000 rare Mendelian diseases (e.g. cystic fibrosis, sickle cell anemia) and in most cancers, the most common underlying genetic causes are changes of a single DNA base resulting in an amino acid substitution, in turn resulting in altered protein function. The Moult lab develops new computational methods to determine the impact of these variants, using machine learning together with evolutionary and protein structure information. Emphasis is on maximizing the number of cases in which interpretation meets the high standards of reliability required in the clinic.
Integrating Knowledge of Disease Mechanism
Sequencing of the human genome drove an explosion of research into the relationship between genetic variants and disease phenotypes. The resulting deluge of information is overwhelming and knowledge is scattered throughout the literature. Dr. Moult’s group has developed a formal description of biological mechanism and they are utilizing it to capture and represent what is known (and not known) for key genotype/disease phenotype relationships. The disease mechanism framework is being developed in collaboration with a philosopher of science, Professor Lindley Darden, Department of Philosophy, University of Maryland, College Park. Knowledge is entered and accessed through a web resource, www.MecCog.org.
Novel Neural Network Architectures for Studying Disease Mechanism
Integrated knowledge of mechanism provides a basis for new quantitative analyses of disease. To this end, the Moult lab is developing a new neural network protocol, in which the network architecture is dictated by the disease mechanism, and nodes represent system perturbations at different stages of biological organization (e.g. RNA, protein, cell, tissue). Networks are trained using data from disease-related, genome-wide association studies (GWAS). Unlike conventional neural networks, direct representation of mechanism allows many aspects of a disease to be probed, including evaluation of uncertain mechanism components, exploration of non-linear and emergent properties, evaluation of potential drug target efficacy, and how drug response will depend on the genetic variants present in a patient.
Community Experiments to Advance Computational Biology
Computational biology is an extremely rapidly moving field, with new large data sets and algorithms continually appearing. In parallel with this, the advent of facile electronic communications and database access facilitate a new form of science in which participants from around the world work on common problems. Community-wide critical assessment experiments, such as Critical Assessment of Structure Prediction (CASP) and Critical Assessment of Genome Interpretation (CAGI) provide a platform for assessing the state-of-the-art in each area rapidly and clearly, thereby driving progress.
Publications
- Progress and Bottlenecks for Deep Learning in Computational Structure Biology: CASP Round XVI.
- Modeling Alternative Conformational States in CASP16.
- Protein Target Highlights in CASP16: Insights From the Structure Providers.
- Modeling Alternative Conformational States in CASP16.
- Functional Relevance of CASP16 Nucleic Acid Predictions as Evaluated by Structure Providers.
- Updates to the CASP Infrastructure in 2024.
- Redefining druggable targets with artificial intelligence.
- Functional relevance of CASP16 nucleic acid predictions as evaluated by structure providers.
- Critical assessment of methods of protein structure prediction (CASP)-Round XV.
- Breaking the conformational ensemble barrier: Ensemble structure modeling challenges in CASP15.
- Protein target highlights in CASP15: Analysis of models by structure providers.
- RNA target highlights in CASP15: Evaluation of predicted models by structure providers.
- More than just pattern recognition: Prediction of uncommon protein structure features by AI methods.
- New prediction categories in CASP15.
- Target highlights in CASP14: Analysis of models by structure providers.
- Critical assessment of methods of protein structure prediction (CASP)-Round XIV.
- Modeling SARS-CoV-2 proteins in the CASP-commons experiment.
- Computational models in the service of X-ray and cryo-electron microscopy structure determination.
- MecCog: A knowledge representation framework for genetic disease mechanism.
- Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype-weighted knowledge in the CAGI SickKids5 clinical genomes challenge.
- Critical assessment of methods of protein structure prediction (CASP)-Round XIII.
- Target highlights in CASP13: Experimental target structures through the eyes of their authors.
- Assessment of predicted enzymatic activity of α-N-acetylglucosaminidase variants of unknown significance for CAGI 2016.
- Reports from the fifth edition of CAGI: The Critical Assessment of Genome Interpretation.
- CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases.
- Assessing computational predictions of the phenotypic effect of cystathionine-beta-synthase variants.
- Performance of computational methods for the evaluation of pericentriolar material 1 missense variants in CAGI-5.
- Assessment of methods for predicting the effects of PTEN and TPMT protein variants.
- Assessment of patient clinical descriptions and pathogenic variants from gene panel sequences in the CAGI-5 intellectual disability challenge.
- Predicting venous thromboembolism risk from exomes in the Critical Assessment of Genome Interpretation (CAGI) challenges.
- Mapping genetic variations to three-dimensional protein structures to enhance variant interpretation: a proposed framework.
- Evaluation of the template-based modeling in CASP12.
- Critical assessment of methods of protein structure prediction (CASP)-Round XII.
- A tribute to Anna Tramontano (1957-2017).
- Target highlights from the first post-PSI CASP experiment (CASP12, May-August 2016).
- Reply to HU et al.: On the interpretation of gasdermin-B expression quantitative trait loci data.
- Reports from CAGI: The Critical Assessment of Genome Interpretation.
- Working toward precision medicine: Predicting phenotypes from exomes in the Critical Assessment of Genome Interpretation (CAGI) challenges.
- Matching phenotypes to whole genomes: Lessons learned from four iterations of the personal genome project community challenges.
- Ensemble variant interpretation methods to predict enzyme activity and assign pathogenicity in the CAGI4 NAGLU (Human N-acetyl-glucosaminidase) and UBE2I (Human SUMO-ligase) challenges.
- CAGI4 Crohn's exome challenge: Marker SNP versus exome variant models for assigning risk of Crohn disease.
- CAGI4 SickKids clinical genomes challenge: A pipeline for identifying pathogenic variants.
- Determination of disease phenotypes and pathogenic variants from exome sequence data in the CAGI 4 gene panel challenge.
- Performance of in silico tools for the evaluation of p16INK4a (CDKN2A) variants in CAGI.
- Lessons from the CAGI-4 Hopkins clinical panel challenge.
- Some of the most interesting CASP11 targets through the eyes of their authors.
- Insights from GWAS: emerging landscape of mechanisms underlying complex trait disease.
- Genetic Basis of Common Human Disease: Insight into the Role of Missense SNPs from Genome-Wide Association Studies.
- Increasing the stability of the bacteriophage endolysin PlyC using rationale-based FoldX computational modeling.
- Critical assessment of methods of protein structure prediction (CASP)--round x.
- Challenging the state of the art in protein structure prediction: Highlights of experimental target structures for the 10th Critical Assessment of Techniques for Protein Structure Prediction Experiment CASP10.
- CASP10 results compared to those of previous CASP experiments.
- Assessment of protein disorder region predictions in CASP10.
- Protein characterization of a candidate mechanism SNP for Crohn's disease: the macrophage stimulating protein R689C substitution.
- Target highlights in CASP9: Experimental target structures for the critical assessment of techniques for protein structure prediction.
- Critical assessment of methods of protein structure prediction (CASP)--round IX.
- CASP9 results compared to those of previous CASP experiments.
- Protein stability and in vivo concentration of missense mutations in phenylalanine hydroxylase.
- Evaluation of disorder predictions in CASP9.
- Structural and functional impact of cancer-related missense somatic mutations.
- Critical assessment of methods of protein structure prediction - Round VIII.
- Evaluation of template-based models in CASP8 with standard measures.
- CASP8 results in context of previous experiments.
- A survey of proteins encoded by non-synonymous single nucleotide polymorphisms reveals a significant fraction with altered stability and activity.
- Stochastic noise in splicing machinery.
- Structural implication of splicing stochastics.
- Community-wide assessment of GPCR structure modelling and ligand docking: GPCR Dock 2008.
- Outcome of a workshop on applications of protein models in biomedical research.
- Outcome of a workshop on archiving structural models of biological macromolecules.
- Detection of operons.
- SNPs3D: candidate gene and SNP selection for association studies.
- Rigorous performance evaluation in protein structure modelling and implications for computational biology.
- Towards computing with proteins.
- Identification and analysis of deleterious human SNPs.
- Critical assessment of methods of protein structure prediction (CASP)--round 6.
- Progress over the first decade of CASP experiments.
- Protein family clustering for structural genomics.
- Loss of protein structure stability as a major causative factor in monogenic disease.
- The psychrophilic lifestyle as revealed by the genome sequence of Colwellia psychrerythraea 34H through genomic and proteomic analyses.
- A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction.
- Critical assessment of methods of protein structure prediction (CASP)-round V.
- Three-dimensional structural location and molecular functional effects of missense SNPs in the T cell receptor Vbeta domain.
- Assessment of progress over the CASP experiments.
- Evaluation of disorder predictions in CASP5.
- CAPRI: a Critical Assessment of PRedicted Interactions.
- Assisting functional assignment for hypothetical Heamophilus influenzae gene products through structural genomics.
- Predicting protein three-dimensional structure.
- Critical assessment of methods of protein structure prediction (CASP): round III.
- Some measures of comparative performance in the three CASPs.
- Processing and analysis of CASP3 protein structure predictions.
- Local electrostatic optimization in proteins.
- Determinants of side chain conformational preferences in protein structures.