Skip to main content
Premium Trial:

Request an Annual Quote

HUPO’s Proteomics Standards Initiative to Merge mzData, mzXML Formats to Create New 'dataXML' Format

SEATTLE, June 1 (GenomeWeb News) - The Human Proteome Organization's Proteomics Standards Initiative announced this week that it will combine the current HUPO-PSI format, mzData, with the mzXML format developed by the Institute for Systems Biology.


The new, combined format will be called "dataXML." PSI officials said they expect the dataXML project to be mostly completed by the end of the year. They made their announcement at the American Society for Mass Spectrometry conference, held here this week.


"This is a major undertaking for the proteomics informatics community and represents widespread agreement on the need to improve data interchange," said PSI officials, who met here this week at the American Society for Mass Spectrometry conference.


The new format will incorporate features from both mzData and mzXML, including an interchange schema that has split data vectors compatible with other analytical interchange formats. It will also support both random access indexes and digital signatures via a wrapper schema.


The new format will also include tools to support developers and users, including a conocalization program to format legal XML documents before binary indexes or signatures are computed; a validation program to insure that the use of controlled vocabulary terms matches MIAPE requirements; an "Application Programming Interface" including language bindings for popular programming languages; and abstract data models and other documentation to help software developers who want to implement systems based on the interchange format.


PSI officials said they expect to complete a data model and ontology models in August, while documentation, draft specification of schema, and language bindings will be done in September. In December, they expect to complete binary indexing and signature programs, a validation program, and reference implementations of converters.

The Scan

Study Reveals New Details About Genetics of Major Cause of Female Infertility

Researchers in Nature Medicine conducted a whole-exome sequencing study of mote than a thousand patients with premature ovarian insufficiency.

Circulating Tumor DNA Shows Potential as Biomarker in Rare Childhood Cancer

A study in the Journal of Clinical Oncology has found that circulating tumor DNA levels in rhabdomyosarcoma may serve as a biomarker for prognosis.

Study Recommends Cancer Screening for Dogs Beginning Age Seven, Depending on Breed

PetDx researchers report in PLOS One that annual cancer screening for dogs should begin by age seven.

White-Tailed Deer Harbor SARS-CoV-2 Variants No Longer Infecting Humans, Study Finds

A new study in PNAS has found that white-tailed deer could act as a reservoir of SARS-CoV-2 variants no longer found among humans.