Machine Learning Engineer

  • Posted 60+ days ago | Updated 15 days ago

Overview

Remote
Depends on Experience
Full Time
No Travel Required
Able to Provide Sponsorship

Skills

natural language processing
vector databases
LLMs
HuggingFace
Langchain
Open AI
pandas
scikit-learn
gensim
nltk
tensorflow/pytorch
Machine Learning
Spark
data science
databricks
AI
scikit - learn
TensorFlow
big data

Job Details

Background:

Computomic was founded in 2019 with a vision to design and deploy mission critical and highly differentiated Data & AI solutions for companies looking to migrate their data stack from legacy data platforms such as Teradata, Cloudera, Datastage and Informatica to modern data platforms like Databricks. Over the last few years, we have become a leading Databricks partner, building a strong and mature Databricks practice and delivering Databricks migration projects for some of the world s leading companies.

Our consulting services and tooling help companies realize value from their data by modernizing their data stack to the latest data platform technologies like Databricks. We use accelerators that save companies up to 80% of time and money moving data from legacy platforms to Databricks. We help customers find margins of improvement in everything they do, which when aggregated creates a transformative effect on their business.

We are a nimble and rapidly growing company with a global footprint, and a focus on Data & AI. We have built an impressive depth of Data & AI experts and experts in BI/ Dashboarding tools such as PowerBI and Tableau

Our name Computomic is derived from two words: Computers and Atomic. We believe in making one small change (Atomic) at a time, so that the cumulative effect of all the changes delivers a massive impact.

The Role:

We are one of the fastest growing partners for Databricks and are looking for a Machine Learning Engineer to help us build out one of the most impactful GenAI practices in the industry!

You will have the opportunity to shape the future of the GenAI landscape at leading Fortune 500 companies and cutting-edge startups. You will work on the industry s most challenging customer engagements to solve big data problems using leading cloud platforms such as AWS/Azure/Google Cloud Platform and Databricks.

You will have the opportunity to work with some of the leading experts in the data and AI industry and develop a deep understanding of Databricks and adjacent technologies. You will be empowered to architect, scope, negotiate and lead data GenAI projects, develop best practices and thought leadership, and represent both Computomic and Databricks at Industry forums and events.

You will join a team of experts with the autonomy and flexibility to make quick decisions, forge your own paths and adapt to the changing market and customers needs.


The impact you will have:

  • Develop LLM solutions on customer data such as RAG architectures on enterprise knowledge repos, querying structured data with natural language, and content generation
  • Build, scale, and optimize customer data science workloads and apply best in class MLOps to productionize these workloads across a variety of domains
  • Advise data teams on various data science such as architecture, tooling, and best practices
  • Present at conferences such as Data+AI Summit
  • Provide technical mentorship to the larger ML SME community in Databricks
  • Collaborate cross-functionally with the product and engineering teams to define priorities and influence the product roadmap

What we look for:

  • Experience with the latest techniques in natural language processing including vector databases, fine-tuning LLMs, and deploying LLMs with tools such as HuggingFace, Langchain, and OpenAI
  • 4+ years of hands-on industry ML engineering experience, leveraging typical machine learning and data science tools including pandas, scikit-learn, gensim, nltk, and TensorFlow/PyTorch
  • Experience building production-grade machine learning deployments on AWS, Azure, or Google Cloud Platform
  • Graduate degree in a Computer Science/Engineering or equivalent practical experience
  • Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike
  • Passion for collaboration, life-long learning, and driving business value through ML
  • [Preferred] Experience working with Apache Spark to process large-scale distributed datasets

What we offer:

  • Excellent Compensation, including bonuses
  • 401k Plan and Employer matching
  • Robust Health and Wellness Benefits including employer paid health insurance for employees
  • Referral Program
  • Paid Time Off and Sick Leave
  • Professional Development Assistance
  • Excellent work culture
  • Remote/Hybrid working option (location is Princeton, NJ)
  • Training & Certification Plan, on Databricks + adjacent technologies
  • Career Development Plan, including management/ leadership training

Are you passionate about this opportunity, but speculating that you don t meet 100% of the experience we re looking for? We still want to hear from you!

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Computomic