Scientific Data Architect/Analyst | GenomeWeb

Scientific Data Architect/Analyst

Regeneron Pharmaceuticals
Job Location
777 Old Saw Mill River Road
Tarrytown, NY 30907
Depending on experience
Job Description

The Therapeutic Target Discovery R&D group at Regeneron is seeking a Data Scientist to develop infrastructure and software and facilitate/conduct cleaning, curation, modeling and analysis of large scale data and metadata screen generated by our “Tier1” high-throughput gene knockout embryonic and adult mouse phenotypic screen, with the ultimate goal of revealing insights into human biology.  The Tier1 process is a broad-based phenotyping screen that includes: lacZ gene expression mapping to determine patterns of reporter gene expression, morphological phenotyping to determine developmental implications of gene removal, hematology/serum chemistry to assess blood cell, metabolic and other activities, PIXI/microCT evaluation to identify any alterations in bone density or lean/fat tissue composition, tumor phenotyping to explore the effects of gene removal on tumor growth and development, immunophenotyping to identify role in development and response of the immune system, and next generation sequencing-based Transcriptome analysis to identify changes in the expression levels of all gene messages.

The successful candidate will work within the team of biologists, data analysts and other scientists who collectively compose the Tier1 program.  The position will closely coordinate and collaborate with other scientists throughout discovery, including bioinformaticians conducting transcriptome analyses, programmers and scientists at the Regeneron Genetics Center conducting human genomics and phenotypic data analyses, and scientific database programmers and administrators building and maintaining custom systems of mice, genes, constructs, process and observational data.

Responsibilities include, but are not limited to:

• working with programmers, database administrators and scientists to clean and extract data from custom scientific databases and commercial LIMS systems
• facilitate and conduct data analysis, including mining and curating of phenotypic datasets with primary responsibility in developing infrastructure, data models software, and statistically sound algorithms to facilitate genotype:phenotype associations and gene:anatomy reporter expression annotations
• integration and comparison of Tier1 results to human genomic and phenotypic data provided by the Regeneron Genetics Center, and public mouse phenotypic and genomic data repositories including MGI, IMPC, Sanger, BioGPS, EMAP, Allen Brian Atlas, and others
• implement GUIs and GUXs or other software to enable a scalable data warehousing and informatics framework, quality control and data mining/querying by department team members and broader Regeneron scientists
• close collaboration and coordination with scientific database programmers and Molecular Profiling team members mining transcriptome data.  Work with these collaborators to structure data and develop algorithms, rules engines, and querying tools to access and curate the phenotypic datasets.


The position requires a minimum of a BS in Computer Science, Bioinformatics or other relevant analytical/data field, a minimum of three years of relevant experience, and interest and familiarity with mouse phenotype data generated by some or all of the assays listed above.
Additional requirements:
• Familiarity with data mining, mouse and biological ontologies, and database architecture, development and administration
• Experience working with genomic and bioinformatics investigators
• Proficiency with user interface development, and database driven Web-based front ends to curate and explore biological data, including web programming tools and software such as Javascript, Python, Ruby on Rails, RESTful, JSON
• Proficiency with databases and database concepts, SQL and NoSQL databases and query tools, and previous experience with bioinformatics tools and software such as R, SAS, SpotFire, Perl, MATLAB
• Demonstrates critical thinking and strong analytical and problem solving skills
• Demonstrates the ability to multitask and manage simultaneous projects to meet deadlines with a strong attention to detail
• Demonstrates the ability to work both independently, a strong team spirit, and commitment to share and collaborate

Preferable but not necessary:

• Experience with data analyses of high-throughput phenotyping projects
• Experience working with public mouse phenotyping data from repositories such as IMPC, MGI, Sanger, or similar

How to Apply


This is an opportunity to join our select team that is already leading the way in the Pharmaceutical/Biotech industry.  Apply today and learn more about Regeneron’s unwavering commitment to combining good science & good business.

Regeneron is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability status, protected veteran status, or any other characteristic protected by law.

About Our Organization

Known for its scientific and operational excellence, Regeneron is a leading science-based biopharmaceutical company that discovers, invents, develops, manufactures, and commercializes medicines for the treatment of serious medical conditions. Regeneron commercializes medicines for eye diseases, high LDL-cholesterol, and a rare inflammatory condition and has product candidates in development in other areas of high unmet medical need, including oncology, rheumatoid arthritis, asthma, atopic dermatitis, pain, and infectious diseases.

Bitesize Bio offers some word of wisdom for designing a new lab.

A study finds that some women choose science majors later in their college careers.

The US National Labor Relations Board rules that graduate assistants have the right to unionize.

Sociologists find that dual-career programs are important for recruiting female academics, Inside Higher Ed reports.