TY - CHAP
T1 - Chapter 11
T2 - Data Formats of the Proteomics Standards Initiative
AU - Vizcaíno, Juan Antonio
AU - Perkins, Simon
AU - Jones, Andrew R.
AU - Deutsch, Eric W.
N1 - Publisher Copyright:
© 2017 Royal Society of Chemistry.
PY - 2017/1/1
Y1 - 2017/1/1
N2 - The existence and adoption of data standards in computational proteomics, as in any other field, is generally perceived to be crucial for the further development of the discipline. We here give an up-to-date overview of the open standard data formats that have been developed under the umbrella of the Proteomics Standards Initiative (PSI). We will focus in those formats related to mass spectrometry (MS). Most of them are based in XML (Extensible Markup Language) schemas: mzML (for primary MS data, the output of mass spectrometers), mzIdentML (for peptide and protein identification data), mzQuantML (for peptide and protein quantification data) and TraML (for reporting transition lists for selected reaction monitoring approaches). In addition, mzTab was developed as a simpler tab-delimited file to support peptide, protein and small molecule identification and quantification data in the same file. In all cases, we will explain the main characteristics of each format, describe the main existing software implementations and give an update of the ongoing work to extend the formats to support new use cases. Additionally, we will discuss other data formats that have been inspired by the PSI formats. Finally, other PSI data standard formats (not MS related) will be also outlined in brief.
AB - The existence and adoption of data standards in computational proteomics, as in any other field, is generally perceived to be crucial for the further development of the discipline. We here give an up-to-date overview of the open standard data formats that have been developed under the umbrella of the Proteomics Standards Initiative (PSI). We will focus in those formats related to mass spectrometry (MS). Most of them are based in XML (Extensible Markup Language) schemas: mzML (for primary MS data, the output of mass spectrometers), mzIdentML (for peptide and protein identification data), mzQuantML (for peptide and protein quantification data) and TraML (for reporting transition lists for selected reaction monitoring approaches). In addition, mzTab was developed as a simpler tab-delimited file to support peptide, protein and small molecule identification and quantification data in the same file. In all cases, we will explain the main characteristics of each format, describe the main existing software implementations and give an update of the ongoing work to extend the formats to support new use cases. Additionally, we will discuss other data formats that have been inspired by the PSI formats. Finally, other PSI data standard formats (not MS related) will be also outlined in brief.
UR - http://www.scopus.com/inward/record.url?scp=85006801811&partnerID=8YFLogxK
U2 - 10.1039/9781782626732-00229
DO - 10.1039/9781782626732-00229
M3 - Chapter
AN - SCOPUS:85006801811
T3 - New Developments in Mass Spectrometry
SP - 231
EP - 258
BT - Proteome Informatics
A2 - Gaskell, Simon J.
A2 - Bessant, Conrad
PB - Royal Society of Chemistry
ER -