Data Scientist (Oncology)

Inspirata, Inc.
Job Location
1 N Dale Mabry Hwy
Suite 600
Tampa, FL 33609
Commensurate to experience

We offer an attractive and competitive compensation package including: salary commensurate to experience, performance incentive, and stock options.  Employees enjoy 100% employer paid benefits for medical, dental, vision, life, and disability coverage; PTO, paid holidays, 401(k), and EAP program.

Job Description

We are seeking a Data Scientist with deep expertise in Oncology, who will combine his or her knowledge of cancer-related data with technology, informatics, big data and analytics skills to help assemble a data platform that provides insights into the vast amounts of real world cancer data. 


You will be responsible for all activities necessary to access, mine and categorize structured and unstructured cancer related data and for presentation of that data in the Company’s big data platform.  You will be engaged in all data related activities, including data acquisition, design and implementation of data processing tools, data modeling, data architecture, data mining and statistical analysis to ensure the Company’s products access, organize and deliver accurate and meaningful information to researchers, clinicians and patients. 


You must be comfortable digging into databases, health records, data architecture, data processing tools with software engineering colleagues to ensure that data is correctly captured, managed and delivered.  You must be highly motivated, comfortable in a fast-paced entrepreneurial environment, with experience designing and developing highly dimensional data intensive products, strong product leadership and team working skills and an aptitude for new technologies. 



  • Align the Company’s business and product strategies with the required underlying data assets
  • Support the development and expansion of the Company’s oncology data assets to promote research, clinical and consumer use
  • Lead the definition, development, implementation and standardization of cancer data-related asset.
  • Provide expertise to fetch, process, cleanse, verify and QA raw data from various sources
  • Design data collection processes, data quality programs and analytic tools to optimize creation and delivery of clinical oncology content
  • Lead efforts to update, standardize and centralize disease factors such as cancer subtype, stage, therapy, and diagnostic factors
  • Select features, build, and optimize classifiers, using machine learning techniques or big data tools
  • Data mining, statistical analysis and visualization using state-of-the-art methods
  • Create automated anomaly detections systems and constant performance tracking
  • Collaborate with customers, product stakeholders and engineering to gather and document data-related requirements for accessing, cleaning, categorizing, organizing and mining data
  • Contribute expertise in medical informatics to support the utilization of the CIDT in research and clinical settings
  • Assist with the development, maintenance and adherence to policies, SOPs and data management plans related to data acquisition, use, security and compliance
  • M.S. or Ph.D. in a technical, scientific or quantitative field
  • 5+ years’ experience in a healthcare setting and familiarity with pathology and oncology data and medical records
  • Excellent understanding of machine learning techniques and algorithms, such as Naive Bayes, SVM, Decision Forests, etc.
  • Strong experience with common data science toolkits, such as R, Weka, MatLab, etc.
  • Proficiency in using query languages such as SQL, Hive, Pig
  • Experience with NoSQL databases, such as MongoDB, Cassandra, HBase
  • Good applied statistics skills, such as distributions, statistical testing, regression, etc.
  • Good scripting and programming skills to perform ETL and data cleansing operations
    • Data-oriented personality with hands on experience using Relational models, Object Models, Hierarchy data models and network data models
  • Familiarity with medical ontologies, lexicons and tools required
  • Strong project management skills and demonstrated ability to influence, motivate and collaborate with software development colleagues
How to Apply

Please reply via email to

About Our Organization

Inspirata is leading the transformation of pathology into a digital and automated workflow, so patients can get diagnosed faster and be given options for more precise therapies based on image analytics. Our efforts are focused on developing novel diagnostic, theragnostic, prognostic and predictive assays, as well as a global image-based cancer research and education data repository.

Inspirata is more than just another career stop; our team is committed to the company’s long-term success and the vital role we play. Our culture is based on trust and collaboration, where opinions matter and contributions make a difference.

If you have the skills and desire to be part of a dynamic team working on cutting edge of technologies that are transforming cancer diagnostics, then Inspirata is the place for you!

NIH's Michael Lauer looks at the number of grants, their amount, and funding success rates at the agency for last year.

At Nature, Johns Hopkins' Gundula Bosch describes her graduate program that aims to get doctoral students thinking about the big picture.

Patricia Fara writes about childcare funding, and women in science and science history at NPR.

National Institute of Environmental Health Sciences researchers have visualized the career paths of former postdocs.