NEW YORK (GenomeWeb News) – Two teams of California scientists will use a $1.4 million grant from the National Science Foundation to create a complete bioinformatic workflow and other tools for supporting large volumes of data from next-generation sequencing studies, the University of California, San Diego said yesterday.
The funding to UCSD's San Diego Supercomputer Center (SDSC) and the California Institute for Telecommunications and Information Technology (Calit2) specifically will be used to create a Kepler Scientific Workflow System module.
The so-called bioKepler project seeks to address three central challenges related to how large-scale sequencing data can be analyzed by the scientific community, how those analyses can be configured or programmed by end-users, and how workflows can be executed in an efficient and intuitive manner.
"Next-generation DNA sequencing is now creating such a large amount of sequence data that it is overwhelming current computational tools and resources," principal investigator Ilkay Altintas, director of the SDSC Scientific Workflow Automation Technologies Lab, said in a statement. "New computational techniques and efficient implementation mechanisms for this data-intensive workload are needed to enable rapid analysis of these next-generation sequence data."
The bioKepler project will create scientific workflow components to develop an array of bioinformatics tools that will be used on multiple distributed platforms, such as cloud and grid platforms. These tools will be aimed at meeting an array of needs in a range of applications, such as sequence database searches; mapping; gene prediction; sequence assembly; gene prediction; clustering; multiple sequence alignment; phylogeny and taxonomy; protein annotation, and others.
All of the resources, materials, and software products the bioKepler project creates will be integrated into Calit2's Community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis, a data repository and bioinformatics resource for metagenomic analysis.