We are seeking an experienced software developer to develop a new system for the capture of complex experimental data from model organism genetics research. You will join the Non-vertebrate Genomics team at the European Bioinformatics Institute (EMBL-EBI), located on the Wellcome Genome Campus near Cambridge in the UK, working on the WormBase project.
You will work on a new modern and scalable back-end database and associated software for WormBase (www.wormbase.org) project. The system will be developed for, and applied to, the curation, organisation and dissemination of the complex interconnected data from the vast body of literature on the genetics, genomics and biology of the model nematode Caenorhabditis elegans. However, our aim is that it should be sufficiently general to act as a unifying framework for all model organism database projects.
The role involves the development of the following:
- A graph-based data model for the representation of data from model organism genetics and genomics research;
- Implementation of the data model using the modern NoSQL database Datomic (www.datomic.com);
- Software for converting existing model organism data (initially from WormBase) into the new database;
- APIs for interacting with the database programmatically;
- Tools to allow curators to interact with the database from a web-browser.
WormBase is an international consortium comprising groups at EMBL-EBI, the Wellcome Trust Sanger Institute, the California Institute of Technology and the Ontario Institute for Cancer Research, and you will work closely with developers and curators at all four sites. The WormBase team at EMBL-EBI are part of a wider genomics group that produces Ensembl and Ensembl Genomes (www.ensembl.org, www.ensemblgenomes.org) and participates in numerous global collaborative genomics projects (e.g. VectorBase, PomBase). You will work in an environment that has a proven track-record of producing world-leading infrastructure and tools for the management of biological data.