TY - JOUR
T1 - Trans-Proteomic Pipeline, a standardized data processing pipeline for large-scale reproducible proteomics informatics
AU - Deutsch, Eric W.
AU - Mendoza, Luis
AU - Shteynberg, David
AU - Slagel, Joseph
AU - Sun, Zhi
AU - Moritz, Robert L.
N1 - Publisher Copyright:
© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
PY - 2015/8/1
Y1 - 2015/8/1
N2 - Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include MS to define protein sequence, protein:protein interactions, and protein PTMs. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative MS proteomics. It supports all major operating systems and instrument vendors via open data formats. Here, we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of MS/MS datasets, as well as some major upcoming features.
AB - Democratization of genomics technologies has enabled the rapid determination of genotypes. More recently the democratization of comprehensive proteomics technologies is enabling the determination of the cellular phenotype and the molecular events that define its dynamic state. Core proteomic technologies include MS to define protein sequence, protein:protein interactions, and protein PTMs. Key enabling technologies for proteomics are bioinformatic pipelines to identify, quantitate, and summarize these events. The Trans-Proteomics Pipeline (TPP) is a robust open-source standardized data processing pipeline for large-scale reproducible quantitative MS proteomics. It supports all major operating systems and instrument vendors via open data formats. Here, we provide a review of the overall proteomics workflow supported by the TPP, its major tools, and how it can be used in its various modes from desktop to cloud computing. We describe new features for the TPP, including data visualization functionality. We conclude by describing some common perils that affect the analysis of MS/MS datasets, as well as some major upcoming features.
KW - Bioinformatics
KW - Mass spectrometry
UR - http://www.scopus.com/inward/record.url?scp=84948987628&partnerID=8YFLogxK
U2 - 10.1002/prca.201400164
DO - 10.1002/prca.201400164
M3 - Review article
C2 - 25631240
AN - SCOPUS:84948987628
SN - 1862-8346
VL - 9
SP - 745
EP - 754
JO - Proteomics - Clinical Applications
JF - Proteomics - Clinical Applications
IS - 7-8
ER -