Overview
On Site
Depends on Experience
Accepts corp to corp applications
Contract - Independent
Contract - W2
Contract - 12 Month(s)
Skills
Cloud Computing
Collaboration
Google Cloud Platform
Google Cloud
Kubernetes
Microsoft Azure
Amazon Web Services
Conflict Resolution
Job Details
Job Title: Site Reliability Engineer (SRE)
Job Description: We are looking for a talented Site Reliability Engineer (SRE) to join our team. The SRE will work to ensure the reliability, scalability, and performance of our infrastructure and services. You will collaborate with development teams to design, implement, and support systems that are fault-tolerant, highly available, and efficient.
Responsibilities:
- Ensure the uptime, reliability, and performance of production systems
- Automate operational processes and eliminate manual intervention
- Collaborate with developers to build scalable and resilient infrastructure
- Monitor and troubleshoot systems, identifying and resolving issues proactively
- Implement and maintain monitoring, logging, and alerting systems
- Participate in on-call rotation for production incident response
Requirements:
- Experience with cloud platforms (AWS, Google Cloud Platform, Azure)
- Proficient in programming/scripting languages (Python, Go, Shell, etc.)
- Strong knowledge of Linux/Unix systems and networking
- Familiarity with containerization and orchestration tools (Docker, Kubernetes)
- Solid understanding of CI/CD, automation, and infrastructure-as-code principles
- Strong problem-solving and troubleshooting skills
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.