Data Engineer/Scientist

Overview

Remote
$77 - $78
Contract - W2
Contract - 11 Month(s)
No Travel Required

Skills

Python
SQL
Cloud native development including docker and Kubernetes
3 tier applications
Coding practices with Git
Organic chemistry is a big plus
Machine learning is a big plus
Process Chemistry and Catalysis is a big plus
4 years a Data Scientist required
Data Engineer
Software Engineer
IronPython nice to have
Spotfire nice to have
Chemistry is a plus
Pharma is a plus

Job Details

No Sub-Contracting! Direct W/2's only! This is fully remote.

Standout skills in resume: Chemistry and/or Pharma (nice to haves), core skills required: Python, SQL Data Base and Kubernetes and familiar with Cloud Native Computing.

Project Scope: Automation Platform (full term of contract) End-to-end, input-agnostic data pipeline for RAPTR automation platform Point person to interact with Labman software engineer on robotics/analytical back end Build platform to interface with ELN, robotics, database, and analytics systems Containerized with Docker and deployed in Kubernetes based cloud solution Currently using Flask for frontend and FastAPI for backend, a switch from Flask to React is being evaluated Design and implement extraction protocol for raw Agilent LC-MS data Adopt and deploy RoSL/HORSS Spotfire analytics dashboard for use with RAPTR Requires use of IronPython for scripting ML-centric Database Development (rolling basis – full term of contract) AWS database for reaction data, analytical data, and molecule ID/parameter storage Carry on work of client apprentice to develop AWS database Emphasis on analytical and reaction data and ML parameter storage Technologies used: PostgreSQL, Fast API, Flask, Docker, Kubernetes Build pipeline to ingest data directly from Signals ELN MLOps / Web Apps (time-permitting) Production and maintenance of ML models within SMPS Work with gCS to host current ML models on Weights & Biases platform Focus on general purpose solution that will be extensible to other SMPS functions Build and deploy web apps for ingesting data into / querying AWS database POC in progress, can expand to various applications as needed Currently in Flask/FastAPI but can be switched to different platform Potential visualization libraries: RDKit for molecular structure; Bokeh, Plotly, Matplotlib for interactive graphs

Required skills:

4+ years as Data Scientist
Extensive experience in Python, Javascript, and SQL
Knowledge of cloud native development including familiarity with Docker and Kubernetes
Experience with web application development, especially three tier applications
Good coding practices including experience with Git Nice to have:
Experience with IronPython
Experience with Google Apps Script
Experience with the Spotfire platform
Libraries: React, Flask, FastAPI, RDKit
Experience with organic chemistry
General knowledge of machine learning

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Advanced Software Talent