CHICAGO (GenomeWeb) – The Simons Genome Diversity Project (SGDP) is migrating to the cloud, courtesy of a deal with biomedical data analysis company Seven Bridges, the groups announced today.
Cambridge, Massachusetts-based Seven Bridges is making the SGDP data available through both its platform and Cancer Genomics Cloud. Researchers will now be able to analyze the 35 terabytes of Simons data in conjunction with their own datasets and with other large sets already hosted on the Seven Bridges platform, including The Cancer Genome Atlas and the Cancer Cell Line Encyclopedia.
The SGDP is the largest dataset of human genetic variation collected to date, Seven Bridges noted. It includes whole genomes from 279 individuals representing 130 diverse populations. The samples were selected with the intent of capturing a wide variety of geographic, anthropological, and linguistic diversity that may be missing from other genomics datasets, the firm added.
"One of the most important components of scientific practice is the ability of scientists to replicate analysis, reanalyze data, build on it, and come to their own conclusions," Harvard Medical School's David Reich, a director of SGDP, said in a statement. "The Seven Bridges platform and tools provide a new way for researchers all over the world to leverage our data and make new discoveries."
Financial and other terms of the collaboration were not disclosed.
This deal is the latest of several similar partnerships for Seven Bridges. In November 2016, the company integrated SolveBio's genomic data analysis into its platform. It was also one of three platforms selected for the National Cancer Institute's Cancer Genomics Cloud Pilot initiative, and has a contract with the US Department of Veteran Affairs to develop a hybrid cloud genome data analysis and a genotype-phenotype graph-analysis engine for the VA's Million Veteran Program.