GenBank release 164.0 is available here via FTP from the National Center for Biotechnology Information. It contains data as of Feb. 12. The new release contains 85,759,586,764 base pairs and 82,853,685 entries. Uncompressed, the 164.0 release flat files require about 321 GB for sequence files only. The ASN.1 data require approximately 295 GB.
GATC Biotech has introduced a cross-platform barcoding system for multi-sample sequencing on Roche’s 454 GS FLX and Illumina’s Genome Analyzer. The barcodes are available to its next-generation sequencing service customers. According to the company, the technique is highly efficient, tagging 99.9 percent of sequences successfully. GATC has already used the barcodes in BAC sequencing projects with 50 individual tags using the GS FLX. The company plans to use the barcodes with its ABI SOLiD system as well.