Statistical Genetics Analyst - Regeneron Genetics Center

Regeneron Pharmaceuticals
Job Location
777 Old Saw Mill River Road
Tarrytown, NY 10591
Commensurate to experience

Health insurance, annual bonuses, stock option, 401k match, Pay Time Off,  tuition reimbursement, and more.

Job Description

Job ID: 6596BR

Location: Tarrytown, New York

Department: Analytical Genetics, Genetics Center

The Analytical Genetics team in the Genetics Center is responsible for the design, statistical analysis, and interpretation of all genetic studies conducted by the center. Analysts will work under the direction of managers and other scientific collaborators to (i) perform quality control checks and data management of large genomic sequencing and phenotypic datasets derived from electronic health records, display and report summary statistics for these data, creating analysis-ready datasets by implementing study designs developed for population and family-based projects, implementing inclusions / exclusions, and carrying out phenotype adjustments and transformations; (ii) carry out statistical genetic analyses to determine genotype-phenotype relationships including regression, logistic regression, rare variant aggregation tests, family-based tests in pedigrees and trios for quantitative biomarkers and disease traits; (iii) work with computational biologists and others to develop and provide programming support for the creation of algorithms and automated pipelines for analysis of NGS and array data (iv) support the creation of queryable databases summarizing results across multiple projects. The successful candidate will have strong programming and statistical skills, experience in the analysis of large datasets, have strong communication and collaborative skills, and exhibit meticulous attention to details. Ideally, the candidate will have experience in the analysis of genomic data, such as whole exome, whole genome, targeted sequencing data, or array based genotype data; experience with statistical methods for genetic association analysis, familiarity with concepts of study design for epidemiologic and family studies, and knowledge of online tools and databases for genetic analyses and interpretation. Excellent communication skills are required to present and convey results and findings to the statistical genetics team and the broader RGC group.  This position will provide exciting opportunities to collaborate with industry and academic investigators. The RGC hosts a vast amount of data encompassing thousands of phenotypes derived from electronic medical records, integrated with genomic data. Together, these represent a landmark collection of information that will move precision medicine and novel therapeutic discovery forward as a new data-driven paradigm in healthcare.

• Perform a variety of statistical genetic analyses of quantitative and qualitative traits, including population and family-based genomic association studies (variant-, gene – and pathway-wise), genome wide association scans of common haplotypes, tests of gene x gene or gene x environment interactions, assessment of gene sets or pathways.
• Develop, maintain and evaluate quality control reports for high throughput sequence data.
• Develop and maintain analysis data sets across multiple phenotypes and studies, by the appropriate study design.
• Carry out various programming tasks as directed by your manager, for example, power calculations or simulation experiments.
• Interact with biostatisticians, epidemiologists and clinical scientists to carry out statistical data. analysis to promote gene mapping and target discovery.
• Assist in the preparation of scientific presentations, manuscripts, reports, and grant proposals including the preparation of tables, figures, and graphs depicting research results.


This position requires a minimum of an MS, or BS with 3 years relevant experience in Bioinformatics, Biostatistics, Genetic Epidemiology, Genetics, Genomics or related field.  Additional requirements include:
• Experience with handling & managing large datasets/software associated with high-throughput instruments for genetics/genomics.
• Excellent oral and written communication skills as well as documented skills in communicating scientific findings.
• Strong work ethic and enthusiasm for collaborative science.
• Experience working independently and in a team environment.
• Statistical knowledge and experience as well as strong background in computer programming.
• Strong knowledge in use of statistical software (e.g. R, Perl/Python, etc.), with ability to perform statistical analysis, interpretation, and effective communication results.
• Essential skills include strong programming skills in R, Perl/Python, with experience in statistical genomic analysis, proficiency in the management of large genomic datasets generated by microarrays and high throughput sequencing, and QC best practices.
• Experience in developing and applying statistical analysis in the context of genomic studies. Experience in development of computational algorithms in statistical genetics.
• Competence with high performance Linux/Unix computing environments, cloud-based computing experience a benefit.
• Strong collaborative and communication skills.
• Familiarity with genome databases (particularly those relating to genome annotation, variant annotation, and biological pathways) and biological & statistical software packages (such as gene annotation programs, Plink, Plink/Seq, BCFtools) that are relevant for the study of computational genetics and genomics.
• Experience in large-scale analysis of next generation DNA sequencing data (e.g. whole exome and whole genome) as well as array based genotype data.
• Ability to communicate clearly and succinctly about quantitative science in written and oral formats.
• Meticulous attention to quality assurance and quality control in all activities.

Level will be commensurate with experience.

How to Apply

Regeneron Genetics Center is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability status, protected veteran status, or any other characteristic protected by law.

About Our Organization

The Regeneron Genetics Center is a wholly-owned subsidiary of Regeneron Pharmaceuticals organized to collaborate with health systems and research groups to elucidate, on a large scale, genetic factors that cause or influence a range of human diseases.  Building upon Regeneron's strengths in mouse genetics and genetics-driven drug discovery and development, the Center will specialize in ultra-high-throughput exome sequencing and computational biology; discovery of genotype-phenotype associations through linkage to well-annotated de-identified patient electronic medical records; and validation of discoveries using Regeneron’s VelociGene® technology.  Our interests encompass a breadth of different areas such as Mendelian and family frameworks, large-scale population genetics (both common and rare variants), and gene-gene interactions.  Program goals include target discovery, indication discovery, and patient-disease stratification.  Objectives include advancing basic science around the world through public sharing of discoveries, providing clinically-valuable insights to physicians and patients of collaborating health-care systems, and identifying novel targets for drug development.

An analysis appearing in PeerJ finds that social media mentions of a paper may lead to increased citations.

NIH's Michael Lauer looks at the number of grants, their amount, and funding success rates at the agency for last year.

At Nature, Johns Hopkins' Gundula Bosch describes her graduate program that aims to get doctoral students thinking about the big picture.

Patricia Fara writes about childcare funding, and women in science and science history at NPR.