Programmer Analyst - Analytical Genetics

Regeneron Pharmaceuticals
Job Location
777 Old Saw Mill Road
Tarrytown, NY 10591
Commensurate to experience
Job Description

Positions for Statistical Data Analysts are available at the RGC reporting to the Analytical Genetics team. Analysts will work under the direction of managers to:

- perform quality control checks and data management of large genomic sequencing and phenotypic datasets derived from electronic health records, display and report summary statistics for these data, creating analysis-ready datasets by implementing study designs developed for population and family-based projects, implementing inclusions / exclusions, and carrying out phenotype adjustments and transformations

- carry out statistical genetic analyses to determine genotype-phenotype relationships including regression, logistic regression, rare variant aggregation tests, family-based tests in pedigrees and trios for quantitative biomarkers and disease traits

- work with computational biologists and others to develop and provide programming support for the creation of algorithms and automated pipelines for analysis of NGS and array data

- support the creation of queryable databases summarizing results across multiple projects.
The successful candidate will have strong programming and statistical skills, experience in the analysis of large datasets, have strong communication and collaborative skills, and exhibit meticulous attention to details. Ideally, the candidate will have experience in the analysis of genomic data, such as whole exome, whole genome, targeted sequencing data, or array based genotype data; experience with statistical methods for genetic association analysis, familiarity with concepts of study design for epidemiologic and family studies, and knowledge of online tools and databases for genetic analyses and interpretation. Excellent communication skills are required to present and convey results and findings to the statistical genetics team and the broader RGC group.


• Perform a variety of statistical genetic analyses of quantitative and qualitative traits, including population and family-based genomic association studies (variant-, gene – and pathway-wise), genome wide association scans of common haplotypes, tests of gene x gene or gene x environment interactions, assessment of gene sets or pathways.

• Develop, maintain and evaluate quality control reports for high throughput sequence data.

• Develop and maintain analysis data sets across multiple phenotypes and studies, by the appropriate study design.

• Carry out various programming tasks as directed by your manager, for example, power calculations or simulation experiments.

• Interact with biostatisticians, epidemiologists and clinical scientists to carry out statistical data. analysis to promote gene mapping and target discovery.

• Assist in the preparation of scientific presentations, manuscripts, reports, and grant proposals including the preparation of tables, figures, and graphs depicting research results.




This position requires a minimum of an MS degree and 2 years of related experience.  

• Strong programming skills in R, Perl/Python, with experience in statistical genomic analysis, proficiency in the management of large genomic datasets generated by microarrays and high throughput sequencing, and QC best practices.

• Experience in developing and applying statistical analysis in the context of genomic studies. Experience in development of computational algorithms in statistical genetics.

• Competence with high performance Linux/Unix computing environments, cloud-based computing experience a benefit.

• Strong collaborative and communication skills.

• Familiarity with genome databases (particularly those relating to genome annotation, variant annotation, and biological pathways) and biological & statistical software packages (such as gene annotation programs, Plink, Plink/Seq, BCFtools) that are relevant for the study of computational genetics and genomics.

• Experience in large-scale analysis of next generation DNA sequencing data (e.g. whole exome and whole genome) as well as array based genotype data.

• Ability to communicate clearly and succinctly about quantitative science in written and oral formats.

• Meticulous attention to quality assurance and quality control in all activities

How to Apply

Visit and search 5470BR .

For job inquiry, please contact .

About Our Organization

The Regeneron Genetics Center, a wholly-owned subsidiary of Regeneron Pharmaceuticals, is organized to collaborate with health systems and research groups to elucidate, on a large scale, genetic factors that cause or influence a range of human diseases.  Building upon Regeneron's strengths in mouse genetics and genetics-driven drug discovery and development, the Center will specialize in ultra-high-throughput exome sequencing and computational biology; discovery of genotype-phenotype associations through linkage to well-annotated de-identified patient electronic medical records; and validation of discoveries using Regeneron’s VelociGene® technology.  Our interests encompass a breadth of different areas such as Mendelian and family frameworks, large-scale population genetics (both common and rare variants), and gene-gene interactions.  Program goals include target discovery, indication discovery, and patient-disease stratification.  Objectives include advancing basic science around the world through public sharing of discoveries, providing clinically-valuable insights to physicians and patients of collaborating health-care systems, and identifying novel targets for drug development.

RGC is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability status, protected veteran status, or any other characteristic protected by law.

