Overview
Hybrid4days onsite and 1 day remote
Up to $100
Contract - W2
Contract - 4 Month(s)
Skills
distributed storage technologies
Python
Java
C/C++
Ruby
JavaScript
GitHub
GitLab
SQL
MySQL
databases
services
network
Dynatrace
Grafana
Job Details
Position Title: Site Reliability Engineer Lead (1)
Provide locations/flexible work by preference: 1. Phoenix Hub 2. Pittsburg Hub
Ability to work remotely: 4 days in the office, 1 remote
Industry background:
- Banking and financial preferred but open to diverse backgrounds
Roles and Responsibilities:
- Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding.
- Partner with development teams to improve services through rigorous testing and release procedures.
- Participate in system design consulting, platform management, and capacity planning.
- Create sustainable systems and services through automation and uplifts.
- Balance feature development speed and reliability with well-defined service-level objectives.
Level 4: 6+ years
Must-Have Technical Skills:
- Dynatrace
- SQL or other forms such as MySQL - understand the database concepts and run queries
- Prometheus - basic understanding
- Full stack engineer background experience - databases, services, network, know if there is a problem
- CICD - know how to navigate
- GitHub and GitLab - Aversion control storing and development
Flex Skills/Nice to Have:
- Grafana
Soft Skills:
Need to know how to collaborate - team player
Open to learning
Handle pressure
A good frame of mind
Screening Questions:
- What made you interested in this role?
- Why are you looking for a new position?
- What do you know about our company?
- Define Observability in 2-3 sentences.
Summary:
The main function is to monitor, automate, and improve the reliability, performance, and availability of software systems in an organization.
Job Responsibilities:
- Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding.
- Partner with development teams to improve services through rigorous testing and release procedures.
- Participate in system design consulting, platform management, and capacity planning.
- Create sustainable systems and services through automation and uplifts.
- Balance feature development speed and reliability with well-defined service-level objectives.
Skills:
- Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript.
- Experience with distributed storage technologies.
- A proactive approach to identifying problems, performance bottlenecks, and areas for improvement.
- Previous success in technical engineering.
- Coding experience beyond simple scripts.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.