Overview
On Site
BASED ON EXPERIENCE
Contract - W2
Contract - Independent
Contract - 5+ mo(s)
Skills
ETL ADMINISTRATION
ETL ADMIN TASKS
AB INITIO GDE
METADATA HUB
OPERATIONAL CONSOLE
CLOUDERA
HADOOP
HDFS
HIVE
AWS
AMAZON WEB SERVICES
SHELL SCRIPTING
LINUX
SQL
CI/CD TOOLS
JENKINS
GITLAB CI
PYTHON
DOCKER
KUBERNETES
Job Details
Job Description:
This is an Ab Initio Administration Lead position and not a developer position.
This is an Ab Initio Administration Lead position and not a developer position.
We are seeking a highly skilled Ab Initio Admin with a robust background in Ab Initio as well as with some experience with Cloudera and AWS to join our dynamic IT team. The Lead Ab Initio ETL Administrator is responsible for leading all the tasks involved in administration of ETL tool (Ab-Initio) in the Cloud. This position also required hands on work. Candidate will support the implementation of a Data Integration/Data Warehouse for the Data products on-prem and in AWS. Position does not have direct reports but is expected to assist in guiding and mentoring less experienced staff. May lead a team of matrixed resources.
Qualification Requirements:
- Advanced (expert preferred) level experience in administrating and engineering relational databases (ex. MySQL, PostgreSQL, Mongo DB, RDS, DB2), Big Data systems (ex. Cloudera Data Platform Private Cloud and Public Cloud), automation tools (ex. Ansible, Terraform, Bit Bucket) and experience working cloud solutions (specifically data products on AWS) are necessary.
- Require prior experience with AWS Cloud.
- At least 10 years of Experienced with all the tasks involved in administration of ETL Tool (Ab Initio)
- At least 10 years of Experienced with Advance knowledge of Ab Initio Graphical Development Environment (GDE), Meta Data Hub, Operational Console
- Created Big Data pipelines (ETL) from on-premises to Data Factories, Data Lakes, and Cloud Storage such as EBS or S3.
- Experience with Advance knowledge of UNIX, Linux, Shell scripting and SQL.
- Experience working and troubleshooting issues related to Hive, ICFF, HDFS.
- Experience with manage metadata hub-MDH, Operational Console and troubleshoot environmental issues which affect these components
- Experience with scripting and automation such as design and develop automated ETL process and architecture and unit testing of the ETL code
- Troubleshoot potential issues with Kerberos, TLS/SSL, Models, and Experiments, as well as other workload issues that data scientists might encounter once the application is running.
- Strong knowledge of Cloudera (Hadoop, HDFS, YARN, Hive, Spark) administration and management.
- Hands-on experience with AWS services (EC2, S3, RDS, VPC, IAM, etc.).
- Familiarity with scripting languages (Python, Shell, etc.)
- Experience with CI/CD tools (Jenkins, GitLab CI, etc.).
- Understanding of containerization technologies (Docker, Kubernetes) is a plus.
- Strongly demonstrated knowledge of DB2
- Represents team in all architectural and design discussions. Knowledgeable in the end-to-end process and able to act as an SME providing credible feedback and input in all impacted areas. Require tracking and monitoring projects and tasks as the lead.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.