Skip to main content
Premium Trial:

Request an Annual Quote

And Back

The SARS-CoV-2 genome sequences that disappeared from a database about a year ago are back online at a different database, according to the New York Times.

In June, Jesse Bloom from the Fred Hutchinson Cancer Research Center reported in a preprint posted to BioRxiv that though he was unable to find a number of viral genomic sequences that were supposed to beat the Sequence Read Archive, he was able to reconstruct 13 missing sequences by recovering files from Google Cloud. According to Bloom, the reconstructed data suggested that early pandemic viral samples had characteristics similar to bat coronaviruses and the data may have been removed from the SRA to "obscure their existence." Others, though, were skeptical of a cover-up, as Science reported then.

The Times now reports that the viral genome sequences were uploaded in early July to a China National Center for Bioinformation database. It adds that the issue surrounding the sequences' disappearance seems to stem from an editorial error in which a data availability statement was accidentally deleted by the journal Small — which published the initial viral sequencing work — leading the researchers to think the data did not have to be stored at SRA. The journal tells the Times that it is issuing a correction and a link to where the data is now kept.

The Scan

Study Finds Sorghum Genetic Loci Influencing Composition, Function of Human Gut Microbes

Focusing on microbes found in the human gut microbiome, researchers in Nature Communications identified 10 sorghum loci that appear to influence the microbial taxa or microbial metabolite features.

Treatment Costs May Not Coincide With R&D Investment, Study Suggests

Researchers in JAMA Network Open did not find an association between ultimate treatment costs and investments in a drug when they analyzed available data on 60 approved drugs.

Sleep-Related Variants Show Low Penetrance in Large Population Analysis

A limited number of variants had documented sleep effects in an investigation in PLOS Genetics of 10 genes with reported sleep ties in nearly 192,000 participants in four population studies.

Researchers Develop Polygenic Risk Scores for Dozens of Disease-Related Exposures

With genetic data from two large population cohorts and summary statistics from prior genome-wide association studies, researchers came up with 27 exposure polygenic risk scores in the American Journal of Human Genetics.