Remote
•
Today
Job Description: Design, deploy, and scale our Prometheus architecture to handle 100+ million active series and beyond.Deploy and operate large, high-performance Elasticsearch clusters holding 2000+TB of data.Deploy and grow high-throughput data pipelines built on Kafka, handling hundreds of thousands of events per second.Design and build an alerting system that allows engineering teams to construct alerts from multiple data sources and alerting workflows.Write libraries and APIs that give engin
Easy Apply
Full-time
Depends on Experience