Skip to main content
Premium Trial:

Request an Annual Quote

Diagnostics Hutch Diagnostics Push Could Be Microsoft's Segue into Biofx


The Fred Hutchinson Cancer Research Center’s new $4.4 million initiative to identify proteins in human serum indicative of early-stage cancer should add fuel to the fire below the field of protein disease marker discovery. It might also indicate the arrival of a new face in bioinformatics: Microsoft.

The heart of the project will be a large-scale human serum proteome database to capture, store, and analyze data. Researchers from the Hutchinson Center and Ruedi Aebersold’s group at the Institute for Systems Biology will work together to analyze samples and identify new protein markers by mass spectrometry. Then, together with Microsoft, they will co-develop the database, which will eventually serve as a GenBank-like public resource for proteomics data, according to Martin McIntosh, a Hutch biostatistician.

But first, they need to build it, which is where Microsoft comes in. Jim Gray, a relational database pioneer and senior researcher in Microsoft’s Bay Area Research Center — fresh from similar stints in other scientific domains — will lend his skills to the project. Gray will focus on optimizing Microsoft’s SQL Server database technology for the nuances of biological data, which is “much more complex than any other kind of data that I’ve had to deal with,” he says.

The early detection project should put Gray’s database skills to the test. As with microarray gene expression data, the correlation of proteomics data across multiple experiments at different locations is a bioinformatics nightmare. However, proteomics experiments present an additional obstacle over gene expression experiments, according to McIntosh: “With SAGE and cDNA arrays, you’re trying to measure something you’ve already identified — you know there’s a gene, and you’re trying to measure its expression. What we’re doing now is trying to identify what’s there, and then we can talk about combining measurements that quantify it. … So the challenge is combining databases that do both discovery and quantification at the same time.”

And Gray admits that Microsoft sees a real opportunity in optimizing its database technology for the bioinformatics market. “I don’t expect to see BioInfo 1.0 as a product from Microsoft any time soon,” Gray quips, “but certainly applications like this have unusual needs, and to the extent we can see ways of meeting those needs, we can improve our products.”

— Bernadette Toner

The Scan

Booster for At-Risk

The New York Times reports that the US Food and Drug Administration has authorized a third dose of the Pfizer-BioNTech SARS-CoV-2 vaccine for people over 65 or at increased risk.

Preprints OK to Mention Again

Nature News reports the Australian Research Council has changed its new policy and now allows preprints to be cited in grant applications.

Hundreds of Millions More to Share

The US plans to purchase and donate 500 million additional SARS-CoV-2 vaccine doses, according to the Washington Post.

Nature Papers Examine Molecular Program Differences Influencing Neural Cells, Population History of Polynesia

In Nature this week: changes in molecular program during embryonic development leads to different neural cell types, and more.