Overview
On Site
Full Time
Skills
Information Technology
TypeScript
JavaScript
Amazon DynamoDB
Amazon S3
Amazon EC2
Couchbase
Capacity management
Software performance management
DevOps
Problem management
Mentorship
Educate
Trade shows
Training
Scripting
Amazon Web Services
High availability
Configuration Management
Ansible
Reporting
Dashboard
SAP BASIS
Microsoft Windows Server
Microsoft Operating Systems
Microsoft IIS
Docker
Kubernetes
Continuous Integration and Development
Continuous integration
Version control
Git
Apache Subversion
Communication
Writing
Management
Health information management
Teamwork
Job Details
Position: DevOps Engineer with Node, Cluster, AWS Native Service (Must)
Location: Chicago IL. - only Locals who can go onsite.
10+ years working in Information Technology
4+ years running production systems on AWS
Datadog
Experience with site monitoring and log monitoring tools, specifically Datadog.
Certified AWS SysOps Administrator a plus.
Experience with TypeScript & JavaScript
Experience with DynamoDB, S3 and Cognito
Good understanding of Serverless and CloudFormation
Good understanding of Serverless and CloudFormation
Experience in Node, Cluster, AWS Native Service
Understanding of best practices for AWS, EC2, EKS, Couchbase and monitoring of containers
Manages capacity planning, updates, upgrades and internal integration.
Responsible for administration of monitoring tools in the APM space
Coordinates with DevOps, Problem Management, etc escalations to support monitoring
Manages uptime and availability and reporting
Mentor, educate, and train support personnel on how to use tools
Maintains knowledge on current technology by reading technology periodicals, evaluating new technologies and attending trade-shows, technical seminars and training sessions.
Performs other duties as assigned and required. Duties and responsibilities may change from time to time without notice and include but are not limited to the duties described above
REQUIRED QUALIFICATIONS - KNOWLEDGE/SKILLS
Manage monitoring of overall application availability, latency and system health
Determine alert standards for production environments and implement them
Develop strategies for logging and indexing to improve visibility to development teams
Develop and manage configuration scripts for Amazon hosted infrastructure
Work with the development team and management to ensure high availability
Familiarity with configuration management software such as Ansible
Build reporting dashboards to assist visibility of cost and stability
Provide support to teams for alarms and outages on an as-needed basis
Experience with Windows Server, IIS, Docker/Kubernetes
Strong understanding of systems, networks and troubleshooting techniques.
Experience in automated build pipeline, and continuous integration. Source control, branching, & merging: git/svn/etc (Repository Management)
Communication Skills- The ability to communicate verbally and in writing with all levels of employees and management, speaks and writes clearly and understandably at the right level.
Integrity and Trust- Involves being widely trusted, being seen as a direct, truthful individual, can present the unvarnished truth in an appropriate and helpful manner, keeps confidences, admits mistakes, and doesn't misrepresent him/herself for personal gain.
Teamwork- Works well in a collaborative setting, volunteering for and completing assignments, acting as a positive team member by contributing to discussions, developing and maintaining relations.
Technical Expertise- A commitment to increasing knowledge and skills in current technical/functional area, keeping up to date on technical developments, staying informed as to industry practices, knowing how to apply relevant technical processes to appropriate business needs.
Location: Chicago IL. - only Locals who can go onsite.
10+ years working in Information Technology
4+ years running production systems on AWS
Datadog
Experience with site monitoring and log monitoring tools, specifically Datadog.
Certified AWS SysOps Administrator a plus.
Experience with TypeScript & JavaScript
Experience with DynamoDB, S3 and Cognito
Good understanding of Serverless and CloudFormation
Good understanding of Serverless and CloudFormation
Experience in Node, Cluster, AWS Native Service
Understanding of best practices for AWS, EC2, EKS, Couchbase and monitoring of containers
Manages capacity planning, updates, upgrades and internal integration.
Responsible for administration of monitoring tools in the APM space
Coordinates with DevOps, Problem Management, etc escalations to support monitoring
Manages uptime and availability and reporting
Mentor, educate, and train support personnel on how to use tools
Maintains knowledge on current technology by reading technology periodicals, evaluating new technologies and attending trade-shows, technical seminars and training sessions.
Performs other duties as assigned and required. Duties and responsibilities may change from time to time without notice and include but are not limited to the duties described above
REQUIRED QUALIFICATIONS - KNOWLEDGE/SKILLS
Manage monitoring of overall application availability, latency and system health
Determine alert standards for production environments and implement them
Develop strategies for logging and indexing to improve visibility to development teams
Develop and manage configuration scripts for Amazon hosted infrastructure
Work with the development team and management to ensure high availability
Familiarity with configuration management software such as Ansible
Build reporting dashboards to assist visibility of cost and stability
Provide support to teams for alarms and outages on an as-needed basis
Experience with Windows Server, IIS, Docker/Kubernetes
Strong understanding of systems, networks and troubleshooting techniques.
Experience in automated build pipeline, and continuous integration. Source control, branching, & merging: git/svn/etc (Repository Management)
Communication Skills- The ability to communicate verbally and in writing with all levels of employees and management, speaks and writes clearly and understandably at the right level.
Integrity and Trust- Involves being widely trusted, being seen as a direct, truthful individual, can present the unvarnished truth in an appropriate and helpful manner, keeps confidences, admits mistakes, and doesn't misrepresent him/herself for personal gain.
Teamwork- Works well in a collaborative setting, volunteering for and completing assignments, acting as a positive team member by contributing to discussions, developing and maintaining relations.
Technical Expertise- A commitment to increasing knowledge and skills in current technical/functional area, keeping up to date on technical developments, staying informed as to industry practices, knowing how to apply relevant technical processes to appropriate business needs.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.