State of the human proteome in 2014/2015 As viewed through peptideatlas: Enhancing accuracy and coverage through the atlas prophet

Eric W. Deutsch, Zhi Sun, David Campbell, Ulrike Kusebauch, Caroline S. Chu, Luis Mendoza, David Shteynberg, Gilbert S. Omenn, Robert L. Moritz

Research output: Contribution to journalArticlepeer-review

66 Citations (Scopus)

Abstract

The Human PeptideAtlas is a compendium of the highest quality peptide identifications from over 1000 shotgun mass spectrometry proteomics experiments collected from many different laboratories, all reanalyzed through a uniform processing pipeline. The latest 2015-03 build contains substantially more input data than past releases, is mapped to a recent version of our merged reference proteome, and uses improved informatics processing and the development of the AtlasProphet to provide the highest quality results. Within the set of ∼20 000 neXtProt primary entries, 14 070 (70%) are confidently detected in the latest build, 5% are ambiguous, 9% are redundant, leaving the total percentage of proteins for which there are no mapping detections at just 16% (3166), all derived from over 133 million peptide-spectrum matches identifying more than 1 million distinct peptides using AtlasProphet to characterize and classify the protein matches. Improved handling for detection and presentation of single amino-acid variants (SAAVs) reveals the detection of 5326 uniquely mapping SAAVs across 2794 proteins. With such a large amount of data, the control of false positives is a challenge. We present the methodology and results for maintaining rigorous quality along with a discussion of the implications of the remaining sources of errors in the build.

Original languageEnglish
Pages (from-to)3461-3473
Number of pages13
JournalJournal of Proteome Research
Volume14
Issue number9
DOIs
Publication statusPublished - 4 Sept 2015
Externally publishedYes

Keywords

  • Human Proteome Project
  • PeptideAtlas
  • observed proteome
  • repositories
  • shotgun proteomics
  • tandem mass spectrometry

Cite this