Overview
Skills
Job Details
Job Title: Data Scientist with R Programming Statistical Machine Learning Model Expert
Location: New Jersey
Duration: Long Term
Job Description
We are seeking a mid-level Data Science professional with at least 3 years of hands-on experience in R Programming, machine learning model development, and ETL processes, with proven expertise in Bitbucket and SQL. The ideal candidate will have a strong analytical mindset and the ability to develop and productionize models that drive real-world impact.
Role Overview
Design, develop, and deploy statistical and machine learning models using R programming to solve business problems across various domains.
Leverage 3+ years of experience in data science to contribute to impactful, data-driven decision-making.
Key Responsibilities
Model Development
- Build and implement predictive models (e.g., regression, random forests) using R packages such as caret, dplyr, and tidyr.
- Analyze datasets to identify patterns, trends, and actionable insights.
Productionization
- Productionize R models using Bitbucket for version control and support CI/CD pipelines for seamless deployment.
- Collaborate with data engineering teams to ensure scalable and robust implementation of models.
ETL Pipelines
Design and maintain ETL workflows in R using tools like data. Table and dplyr to ingest and transform data from various sources.
SQL Integration
Write and optimize complex SQL queries to preprocess and integrate data with R-based workflows.
Data Analysis & Visualization
- Conduct exploratory data analysis and statistical assessments.
- Create intuitive visualizations and dashboards using ggplot2 or shiny to present insights to business stakeholders.
Collaboration & Documentation
- Engage with cross-functional teams to understand requirements and translate them into R-based solutions.
- Maintain detailed documentation of code, workflows, and processes within Bitbucket repositories.
Required Skills:
- Strong expertise in R Programming and statistical modeling.
- Proficient with Bitbucket for version control and collaboration.
- Solid experience with ETL processes, data wrangling, and pipeline development.
- Advanced SQL skills for data extraction and preprocessing.
- Proficiency in R libraries: caret, dplyr, tidyr, data.table, ggplot2, shiny.
- Strong communication skills and ability to work collaboratively in a team environment.