TY - JOUR
T1 - The mzIdentML data standard version 1.2, supporting advances in proteome informatics
AU - Vizcaíno, Juan Antonio
AU - Mayer, Gerhard
AU - Perkins, Simon
AU - Barsnes, Harald
AU - Vaudel, Marc
AU - Perez-Riverol, Yasset
AU - Ternent, Tobias
AU - Uszkoreit, Julian
AU - Eisenacher, Martin
AU - Fischer, Lutz
AU - Rappsilber, Juri
AU - Netza, Eugen
AU - Walzer, Mathias
AU - Kohlbacher, Oliver
AU - Leitner, Alexander
AU - Chalkley, Robert J.
AU - Ghali, Fawaz
AU - Martínez-Bartolome, Salvador
AU - Deutsch, Eric W.
AU - Jones, Andrew R.
PY - 2017/7/1
Y1 - 2017/7/1
N2 - The first stable version of the Proteomics Standards Initiative mzIdentML open data standard (version 1.1) was published in 2012 capturing the outputs of peptide and protein identification software. In the intervening years, the standard has become well-supported in both commercial and open software, as well as a submission and download format for public repositories. Here we report a new release of mzIdentML (version 1.2) that is required to keep pace with emerging practice in proteome informatics. New features have been added to support: (1) scores associated with localization of modifications on peptides; (2) statistics performed at the level of peptides; (3) identification of cross-linked peptides; and (4) support for proteogenomics approaches. In addition, there is now improved support for the encoding of de novo sequencing of peptides, spectral library searches, and protein inference. As a key point, the underlying XML schema has only undergone very minor modifications to simplify as much as possible the transition from version 1.1 to version 1.2 for implementers, but there have been several notable updates to the format specification, implementation guidelines, controlled vocabularies and validation software. mzIdentML 1.2 can be described as backwards compatible, in that reading software designed for mzIdentML 1.1 should function in most cases without adaptation. We anticipate that these developments will provide a continued stable base for software teams working to implement the standard.
AB - The first stable version of the Proteomics Standards Initiative mzIdentML open data standard (version 1.1) was published in 2012 capturing the outputs of peptide and protein identification software. In the intervening years, the standard has become well-supported in both commercial and open software, as well as a submission and download format for public repositories. Here we report a new release of mzIdentML (version 1.2) that is required to keep pace with emerging practice in proteome informatics. New features have been added to support: (1) scores associated with localization of modifications on peptides; (2) statistics performed at the level of peptides; (3) identification of cross-linked peptides; and (4) support for proteogenomics approaches. In addition, there is now improved support for the encoding of de novo sequencing of peptides, spectral library searches, and protein inference. As a key point, the underlying XML schema has only undergone very minor modifications to simplify as much as possible the transition from version 1.1 to version 1.2 for implementers, but there have been several notable updates to the format specification, implementation guidelines, controlled vocabularies and validation software. mzIdentML 1.2 can be described as backwards compatible, in that reading software designed for mzIdentML 1.1 should function in most cases without adaptation. We anticipate that these developments will provide a continued stable base for software teams working to implement the standard.
UR - http://www.scopus.com/inward/record.url?scp=85021813762&partnerID=8YFLogxK
U2 - 10.1074/mcp.M117.068429
DO - 10.1074/mcp.M117.068429
M3 - Article
C2 - 28515314
AN - SCOPUS:85021813762
SN - 1535-9476
VL - 16
SP - 1275
EP - 1285
JO - Molecular and Cellular Proteomics
JF - Molecular and Cellular Proteomics
IS - 7
ER -