Startseite A Classification Model for the Leiden Proteomics Competition
Artikel
Lizenziert
Nicht lizenziert Erfordert eine Authentifizierung

A Classification Model for the Leiden Proteomics Competition

  • Huub C. J. Hoefsloot , Suzanne Smit und Age K. Smilde
Veröffentlicht/Copyright: 19. Februar 2008

A strategy is presented to build a discrimination model in proteomics studies. The model is built using cross-validation. This cross-validation step can simply be combined with a variable selection method, called rank products. The strategy is especially suitable for the low-samples-to-variables-ratio (undersampling) case, as is often encountered in proteomics and metabolomics studies. As a classification method, Principal Component Discriminant Analysis is used; however, the methodology can be used with any classifier. A data set containing serum samples from breast cancer patients and healthy controls is analysed. Double cross-validation shows that the sensitivity of the model is 82% and the specificity 86%. Potential putative biomarkers are identified using the variable selection method. In each cross-validation loop a classification model is built. The final classification uses a majority voting scheme from the ensemble classifier.

Published Online: 2008-2-19

©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston

Artikel in diesem Heft

  1. Editorial
  2. International Competition on Mass Spectrometry Proteomic Diagnosis
  3. Introduction Paper
  4. Case-Control Breast Cancer Study of MALDI-TOF Proteomic Mass Spectrometry Data on Serum Samples
  5. Organizing a Competition on Clinical Mass Spectrometry Based Proteomic Diagnosis
  6. Competition Paper
  7. Application of the Random Forest Classification Method to Peaks Detected from Mass Spectrometric Proteomic Profiles of Cancer Patients and Controls
  8. Developing a Discrimination Rule between Breast Cancer Patients and Controls Using Proteomics Mass Spectrometric Data: A Three-Step Approach
  9. Principal Component Discriminant Analysis
  10. Classification of Breast Cancer versus Normal Samples from Mass Spectrometry Profiles Using Linear Discriminant Analysis of Important Features Selected by Random Forest
  11. A Classification Model for the Leiden Proteomics Competition
  12. Empirical Bayes Logistic Regression
  13. Autocorrelated Logistic Ridge Regression for Prediction Based on Proteomics Spectra
  14. Support Vector Machine Approach to Separate Control and Breast Cancer Serum Samples
  15. A Cross-Validation Study to Select a Classification Procedure for Clinical Diagnosis Based on Proteomic Mass Spectrometry
  16. Clinical Mass Spectrometry Proteomic Diagnosis by Conformal Predictors
  17. Article
  18. Assessing the Validity Domains of Graphical Gaussian Models in Order to Infer Relationships among Components of Complex Biological Systems
  19. Assessment
  20. Breast Cancer Diagnosis from Proteomic Mass Spectrometry Data: A Comparative Evaluation
Heruntergeladen am 26.9.2025 von https://www.degruyterbrill.com/document/doi/10.2202/1544-6115.1351/html
Button zum nach oben scrollen