The Genomics Technology Infrastructure team seeks a talented and motivated software developer to work in the Ensembl project at the European Bioinformatics Institute (EMBL-EBI).
Ensembl (http://www.ensembl.org/) is one of the most successful large-scale bioinformatics projects and one of the leading projects for genome annotation. Ensembl has developed one of the most mature database schemas and APIs for genomic annotation storage and access. This is extensively used in our sister project Ensembl Genomes (http://www.ensemblgenomes.org).
You will be responsible for designing a new annotation storage system to sit alongside our existing infrastructure. This new system is responsible for the archiving, checksums and retrieval of transcripts across multiple releases of Ensembl and Ensembl Genomes annotation sets. It will provide infrastructure to track transcripts in a stable manner, which is critical for implementing genomics in the clinic. In addition this system will be capable of loading transcript annotation directly from an Ensembl database and from externally submitted files and will form a key part of our internal production infrastructure. The system will also be capable of tagging transcripts across multiple sources and disseminating all of this information via a public REST API to be feedback into the Ensembl ecosystem. You will report to the Ensembl Core Project Leader.