NEW YORK (GenomeWeb News) – The National Human Genome Research Institute plans to fund the development of new methods to manage and analyze the massive amounts of data to be generated by the international 1000 Genomes Project.
NHGRI will spread up to $3 million for each of the next two years, beginning July 2009, to between four and six proposals to develop the new technologies.
Specifically, the goal of these grants is to develop, evaluate, and implement new methods to monitor data quality, produce a set of data types from sequence data, and to produce new tools for additional analysis that will be used to produce the project’s final dataset.
Sequencing for the pilot projects began in January 2008, and sequencing for the full-scale project is expected to being this fall and last through 2010, when the entire dataset is expected to be around 20 terabases.
The 1000 Genomes Project is the first large-scale project to use next-generation sequencers for extensive human resequencing studies, according to the National Institutes of Health.
The sequencing will take place at centers funded by NHGRI, including Baylor College of Medicine, the Broad Institute of MIT and Harvard, Washington University in St. Louis, the Wellcome Trust Sanger Institute, the Beijing Genomics Institute Shenzhen, the Max Planck Institute for Molecular Genetics, as well as Applied Biosystems, Illumina, and Roche.
Examples of the kind of studies this funding will support include the development of methods for monitoring data quality; methods for validating gene variants of different frequencies and types by genotyping; methods for combining data across many samples; methods for integrating data on SNPs and structural variants into one map and placing them on one set of haplotypes; and tools to visualize, simulate, and summarize data.
More information about the 1000 Genomes Project Data Processing grants is available here.