Overview
Skills
Job Details
Data Scientist
Location: Washington, DC,
Hybrid-3 days onsite.
Exp: 5-7+yrs
Must have:
PhD in CS from a university in the states.
Experienced in NLP and Gen AI.
Qualifications & Requirements
Education: MS in Computer Science, Statistics, Math, Engineering, or related field, PhD preferred
3+ years of relevant experience in building large scale machine learning or deep learning models and/or systems
1+ year of experience specifically with deep learning (e.g., CNN, RNN, LSTM)
1+ year of experience building NLP and NLG tools.
Experience with wide range of LLMs (Llama, Claude, OpenAI, Cohere, etc.), LoRA, LangChain, RAG, LLM Fine Tuning and PEFT are preferred.
Demonstrated skills with Jupyter Notebook, AWS Sagemaker, or Domino Datalab or comparable environments
Passion for solving complex data problems and generating cross-functional solutions in a fast-paced environment
Knowledge in Python and SQL, object-oriented programming, service oriented architectures
Strong scripting skills with Shell script and SQL
Strong coding skills and experience with Python (including SciPy, NumPy, and/or PySpark) and/or Scala.
Knowledge and implementation experience with NLP techniques (topic modeling, bag of words, text classification, TF/IDF, Sentiment analysis) and NLP technologies such as Python NLTK, or Spacy or comparable technologies
Knowledge and implementation experience with statistical and machine learning models (regression, classification, clustering, graph models, etc.)