Overview
Skills
Job Details
No Sub-Contracting! Direct W/2's only! This is fully remote.
Standout skills in resume: Chemistry and/or Pharma (nice to haves), core skills required: Python, SQL Data Base and Kubernetes and familiar with Cloud Native Computing.
Project Scope: Automation Platform (full term of contract) End-to-end, input-agnostic data pipeline for RAPTR automation platform Point person to interact with Labman software engineer on robotics/analytical back end Build platform to interface with ELN, robotics, database, and analytics systems Containerized with Docker and deployed in Kubernetes based cloud solution Currently using Flask for frontend and FastAPI for backend, a switch from Flask to React is being evaluated Design and implement extraction protocol for raw Agilent LC-MS data Adopt and deploy RoSL/HORSS Spotfire analytics dashboard for use with RAPTR Requires use of IronPython for scripting ML-centric Database Development (rolling basis – full term of contract) AWS database for reaction data, analytical data, and molecule ID/parameter storage Carry on work of client apprentice to develop AWS database Emphasis on analytical and reaction data and ML parameter storage Technologies used: PostgreSQL, Fast API, Flask, Docker, Kubernetes Build pipeline to ingest data directly from Signals ELN MLOps / Web Apps (time-permitting) Production and maintenance of ML models within SMPS Work with gCS to host current ML models on Weights & Biases platform Focus on general purpose solution that will be extensible to other SMPS functions Build and deploy web apps for ingesting data into / querying AWS database POC in progress, can expand to various applications as needed Currently in Flask/FastAPI but can be switched to different platform Potential visualization libraries: RDKit for molecular structure; Bokeh, Plotly, Matplotlib for interactive graphs
Required skills:
4+ years as Data Scientist
Extensive experience in Python, Javascript, and SQL
Knowledge of cloud native development including familiarity with Docker and Kubernetes
Experience with web application development, especially three tier applications
Good coding practices including experience with Git Nice to have:
Experience with IronPython
Experience with Google Apps Script
Experience with the Spotfire platform
Libraries: React, Flask, FastAPI, RDKit
Experience with organic chemistry
General knowledge of machine learning