NEW YORK, Oct. 23-The National Human Genome Research Institute today announced a three-year, $15-million grant to integrate the three major international protein sequence databases into a 2-million-protein powerhouse.
The United Protein Database (or UniProt) will combine the two European databases, Swiss-PROT and TrEMBL, with the U.S.-based Protein Information Resource. The project should make data from these three resources easier to access.
UniProt will be divided into two segments: Swiss-PROT will include fully annotated records, as it does now, and TrEMBL will maintain the 700,000 data records that have not yet been hand-curated. The 283,000 entries from the Protein Information Resource will be folded into UniProt.
Swiss-PROT coordinator Rolf Apweiler will be the principal investigator, working with Swiss Institute of Bioinformatics' Amos Bairoch and Georgetown University Medical Center's Cathy Wu, who oversees the Protein Information Resource.
For more information, see the NHGRI statement.