Learning Monotonic Genotype-Phenotype Maps

Niko Beerenwinkel; Patrick Knupfer; Achim Tresch

doi:10.2202/1544-6115.1603

Home Life Sciences Learning Monotonic Genotype-Phenotype Maps

Article

Licensed

Unlicensed Requires Authentication

Learning Monotonic Genotype-Phenotype Maps

Niko Beerenwinkel , Patrick Knupfer and Achim Tresch

Published/Copyright: January 6, 2011

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Statistical Applications in Genetics and Molecular Biology Volume 10 Issue 1

MLA
APA
Harvard
Chicago
Vancouver

MLA
APA
Harvard
Chicago
Vancouver

Beerenwinkel, Niko, Knupfer, Patrick and Tresch, Achim. "Learning Monotonic Genotype-Phenotype Maps" Statistical Applications in Genetics and Molecular Biology, vol. 10, no. 1. https://doi.org/10.2202/1544-6115.1603

Beerenwinkel, N., Knupfer, P. & Tresch, A. (). Learning Monotonic Genotype-Phenotype Maps. Statistical Applications in Genetics and Molecular Biology, 10(1). https://doi.org/10.2202/1544-6115.1603

Beerenwinkel, N., Knupfer, P. and Tresch, A. () Learning Monotonic Genotype-Phenotype Maps. Statistical Applications in Genetics and Molecular Biology, Vol. 10 (Issue 1). https://doi.org/10.2202/1544-6115.1603

Beerenwinkel, Niko, Knupfer, Patrick and Tresch, Achim. "Learning Monotonic Genotype-Phenotype Maps" Statistical Applications in Genetics and Molecular Biology 10, no. 1 (). https://doi.org/10.2202/1544-6115.1603

Beerenwinkel N, Knupfer P, Tresch A. Learning Monotonic Genotype-Phenotype Maps. Statistical Applications in Genetics and Molecular Biology. ;10(1). https://doi.org/10.2202/1544-6115.1603

Copy

Copied to clipboard

BibTeX EndNote RIS

Evolutionary escape of pathogens from the selective pressure of immune responses and from medical interventions is driven by the accumulation of mutations. We introduce a statistical model for jointly estimating the dynamics and dependencies among genetic alterations and the associated phenotypic changes. The model integrates conjunctive Bayesian networks, which define a partial order on the occurrences of genetic events, with isotonic regression. The resulting genotype-phenotype map is non-decreasing in the lattice of genotypes. It describes evolutionary escape as a directed process following a phenotypic gradient, such as a monotonic fitness landscape. We present efficient algorithms for parameter estimation and model selection. The model is validated using simulated data and applied to HIV drug resistance data. We find that the effect of many resistance mutations is non-linear and depends on the genetic background in which they occur.

Keywords: genotype-phenotype map; conjunctive Bayesian networks; HIV drug resistance; isotonic regression

Published Online: 2011-1-6

You are currently not able to access this content.

Articles in the same Issue

Invited Editorial
Measurement of Evidence and Evidence of Measurement
Article
Fully Moderated T-statistic for Small Sample Size Gene Expression Arrays
Determining Coding CpG Islands by Identifying Regions Significant for Pattern Statistics on Markov Chains
Assessing Modularity Using a Random Matrix Theory Approach
Choice of Summary Statistic Weights in Approximate Bayesian Computation
Genetic Linkage Analysis in the Presence of Germline Mosaicism
Fitting Boolean Networks from Steady State Perturbation Data
Adaptive Elastic-Net Sparse Principal Component Analysis for Pathway Association Testing
Bayesian Learning from Marginal Data in Bionetwork Models
Unsupervised Classification for Tiling Arrays: ChIP-chip and Transcriptome
Multiple Testing in Candidate Gene Situations: A Comparison of Classical, Discrete, and Resampling-Based Procedures
Modeling Read Counts for CNV Detection in Exome Sequencing Data
Multiscale Characterization of Signaling Network Dynamics through Features
A Calibrated Multiclass Extension of AdaBoost
False Discovery Rate Estimation for Stability Selection: Application to Genome-Wide Association Studies
A Markov-Chain Model for the Analysis of High-Resolution Enzymatically ¹⁸O-Labeled Mass Spectra
Repeated Measures Semiparametric Regression Using Targeted Maximum Likelihood Methodology with Application to Transcription Factor Activity Discovery
Learning Monotonic Genotype-Phenotype Maps
A Comparison of Multifactor Dimensionality Reduction and L₁-Penalized Regression to Identify Gene-Gene Interactions in Genetic Association Studies
Accuracy and Computational Efficiency of a Graphical Modeling Approach to Linkage Disequilibrium Estimation
Learning from Past Treatments and Their Outcome Improves Prediction of In Vivo Response to Anti-HIV Therapy
A Three Component Latent Class Model for Robust Semiparametric Gene Discovery
Log-Linear Modelling of Protein Dipeptide Structure Reveals Interesting Patterns of Side-Chain-Backbone Interactions
A Robust Statistical Method to Detect Null Alleles in Microsatellite and SNP Datasets in Both Panmictic and Inbred Populations
Large Sample Approximations of Probabilities of Correct Evolutionary Tree Estimation and Biases of Maximum Likelihood Estimation
Interval Estimation of Familial Correlations from Pedigrees
Information Metrics in Genetic Epidemiology
Linear Combination Test for Hierarchical Gene Set Analysis
Exploratory Analysis of Multiple Omics Datasets Using the Adjusted RV Coefficient
Application of the Lasso to Expression Quantitative Trait Loci Mapping
A Variance-Components Model for Distance-Matrix Phylogenetic Reconstruction
Imputation Estimators Partially Correct for Model Misspecification
On the Statistical Properties of SGoF Multitesting Method
Meta-Analysis of Family-Based and Case-Control Genetic Association Studies that Use the Same Cases
A Non-Parametric Method for Detecting Specificity Determining Sites in Protein Sequence Alignments
Performance of Matrix Representation with Parsimony for Inferring Species from Gene Trees
Disequilibrium Coefficient: A Bayesian Perspective
Analyzing Time-Course Microarray Data Using Functional Data Analysis - A Review
The NBP Negative Binomial Model for Assessing Differential Gene Expression from RNA-Seq
Inferring Gene Networks using Robust Statistical Techniques
A Two-Stage Poisson Model for Testing RNA-Seq Data
Quantifying the Relative Contribution of the Heterozygous Class to QTL Detection Power
The Joint Null Criterion for Multiple Hypothesis Tests
Multiple Imputation of Missing Phenotype Data for QTL Mapping
Sparse Canonical Covariance Analysis for High-throughput Data
Comparison of Clinical Subgroup aCGH Profiles through Pseudolikelihood Ratio Tests
Random Forests for Genetic Association Studies
Deviance Information Criteria for Model Selection in Approximate Bayesian Computation
High-Dimensional Regression and Variable Selection Using CAR Scores
Surveying the Manifold Divergence of an Entire Protein Class for Statistical Clues to Underlying Biochemical Mechanisms
Smoothing Gene Expression Data with Network Information Improves Consistency of Regulated Genes
Entropy Based Genetic Association Tests and Gene-Gene Interaction Tests
Weighted Lasso with Data Integration
MA-SNP -- A New Genotype Calling Method for Oligonucleotide SNP Arrays Modeling the Batch Effect with a Normal Mixture Model
A Modified Maximum Contrast Method for Unequal Sample Sizes in Pharmacogenomic Studies

Search journal Search the content of this journal

https://doi.org/10.2202/1544-6115.1603

Keywords for this article

genotype-phenotype map; conjunctive Bayesian networks; HIV drug resistance; isotonic regression

Articles in the same Issue

Invited Editorial
Measurement of Evidence and Evidence of Measurement
Article
Fully Moderated T-statistic for Small Sample Size Gene Expression Arrays
Determining Coding CpG Islands by Identifying Regions Significant for Pattern Statistics on Markov Chains
Assessing Modularity Using a Random Matrix Theory Approach
Choice of Summary Statistic Weights in Approximate Bayesian Computation
Genetic Linkage Analysis in the Presence of Germline Mosaicism
Fitting Boolean Networks from Steady State Perturbation Data
Adaptive Elastic-Net Sparse Principal Component Analysis for Pathway Association Testing
Bayesian Learning from Marginal Data in Bionetwork Models
Unsupervised Classification for Tiling Arrays: ChIP-chip and Transcriptome
Multiple Testing in Candidate Gene Situations: A Comparison of Classical, Discrete, and Resampling-Based Procedures
Modeling Read Counts for CNV Detection in Exome Sequencing Data
Multiscale Characterization of Signaling Network Dynamics through Features
A Calibrated Multiclass Extension of AdaBoost
False Discovery Rate Estimation for Stability Selection: Application to Genome-Wide Association Studies
A Markov-Chain Model for the Analysis of High-Resolution Enzymatically ¹⁸O-Labeled Mass Spectra
Repeated Measures Semiparametric Regression Using Targeted Maximum Likelihood Methodology with Application to Transcription Factor Activity Discovery
Learning Monotonic Genotype-Phenotype Maps
A Comparison of Multifactor Dimensionality Reduction and L₁-Penalized Regression to Identify Gene-Gene Interactions in Genetic Association Studies
Accuracy and Computational Efficiency of a Graphical Modeling Approach to Linkage Disequilibrium Estimation
Learning from Past Treatments and Their Outcome Improves Prediction of In Vivo Response to Anti-HIV Therapy
A Three Component Latent Class Model for Robust Semiparametric Gene Discovery
Log-Linear Modelling of Protein Dipeptide Structure Reveals Interesting Patterns of Side-Chain-Backbone Interactions
A Robust Statistical Method to Detect Null Alleles in Microsatellite and SNP Datasets in Both Panmictic and Inbred Populations
Large Sample Approximations of Probabilities of Correct Evolutionary Tree Estimation and Biases of Maximum Likelihood Estimation
Interval Estimation of Familial Correlations from Pedigrees
Information Metrics in Genetic Epidemiology
Linear Combination Test for Hierarchical Gene Set Analysis
Exploratory Analysis of Multiple Omics Datasets Using the Adjusted RV Coefficient
Application of the Lasso to Expression Quantitative Trait Loci Mapping
A Variance-Components Model for Distance-Matrix Phylogenetic Reconstruction
Imputation Estimators Partially Correct for Model Misspecification
On the Statistical Properties of SGoF Multitesting Method
Meta-Analysis of Family-Based and Case-Control Genetic Association Studies that Use the Same Cases
A Non-Parametric Method for Detecting Specificity Determining Sites in Protein Sequence Alignments
Performance of Matrix Representation with Parsimony for Inferring Species from Gene Trees
Disequilibrium Coefficient: A Bayesian Perspective
Analyzing Time-Course Microarray Data Using Functional Data Analysis - A Review
The NBP Negative Binomial Model for Assessing Differential Gene Expression from RNA-Seq
Inferring Gene Networks using Robust Statistical Techniques
A Two-Stage Poisson Model for Testing RNA-Seq Data
Quantifying the Relative Contribution of the Heterozygous Class to QTL Detection Power
The Joint Null Criterion for Multiple Hypothesis Tests
Multiple Imputation of Missing Phenotype Data for QTL Mapping
Sparse Canonical Covariance Analysis for High-throughput Data
Comparison of Clinical Subgroup aCGH Profiles through Pseudolikelihood Ratio Tests
Random Forests for Genetic Association Studies
Deviance Information Criteria for Model Selection in Approximate Bayesian Computation
High-Dimensional Regression and Variable Selection Using CAR Scores
Surveying the Manifold Divergence of an Entire Protein Class for Statistical Clues to Underlying Biochemical Mechanisms
Smoothing Gene Expression Data with Network Information Improves Consistency of Regulated Genes
Entropy Based Genetic Association Tests and Gene-Gene Interaction Tests
Weighted Lasso with Data Integration
MA-SNP -- A New Genotype Calling Method for Oligonucleotide SNP Arrays Modeling the Batch Effect with a Normal Mixture Model
A Modified Maximum Contrast Method for Unequal Sample Sizes in Pharmacogenomic Studies