Overview
Skills
Job Details
******************ONLY W2 CONTRACT******************
Title: Grafana & Telemetry Engineer
Location: Austin, TX and Sunnyvale, CA (Onsite)
Role Overview:Application Monitoring and Observability Engineers to lead the migration of our monitoring, tracing, and dashboarding solutions from Dynatrace to Sentry and Grafana. The ideal candidates will have expertise in monitoring systems, distributed tracing, and observability tools to ensure our applications' performance, stability, and user experience.
Key Responsibilities:
Lead the migration from Dynatrace to Sentry and Grafana, ensuring continuity of application monitoring, tracing, and alerting.
Design, build, and maintain monitoring dashboards and alerts using Grafana to visualize metrics and KPIs.
Integrate Sentry for error tracking and performance monitoring across applications.
Configure Prometheus or Loki for metrics and log aggregation, if required.
Collaborate with development, DevOps, and infrastructure teams to integrate monitoring tools into CI/CD pipelines.
Define and implement SLA-based alerts and notifications to track application performance and errors.
Provide root cause analysis (RCA) for critical incidents using distributed tracing and monitoring data.
Automate monitoring and alerting tasks through scripting (Python, Bash, or similar languages).
Ensure secure and compliant access to monitoring tools by configuring roles and permissions.
Document the migration process and develop knowledge base articles to streamline operations.
Required Skills & Experience:
Proven experience in application monitoring, tracing, and observability tools (Dynatrace, Grafana, Sentry, Prometheus, Loki).
Strong understanding of APM (Application Performance Management) concepts, distributed tracing, and error tracking.
Hands-on experience building custom Grafana dashboards and configuring alerts.
Proficiency in Sentry setup and integration across multiple applications.
Familiarity with PrometheLoki for metrics and log aggregation.
Knowledge of DevOps practices and experience integrating monitoring tools in CI/CD pipelines.
Strong programming/scripting skills (e.g., Python, Bash) to automate monitoring tasks.
Solid understanding of incident management, root cause analysis, and SLA tracking.
Experience with API integration and data transformation between observability tools.
Knowledge of security and compliance principles for access management and data governance.Preferred Qualifications:
Prior experience in migrating from Dynatrace to other observability tools.
Experience with microservices or cloud-native monitoring solutions.
Familiarity with Agile methodologies and collaborative team environments.
Certification in monitoring or observability platforms (e.g., Grafana or Prometheus) is a plus.
Note: If Interested to pursue, reach out at kirti at galaxy i tech dot com / six zero two two four Seven Seven eight one nine