More jobs:
Job Description & How to Apply Below
6+ Years
Permanent/ Bangalore - Hybrid
Job Description
Must Have:
- Atleast 5 years of relevant experience in working on Observability stack as defined above.
- Has managed and operated Datadog Platform.
- Strong communication skills to interact with global teams.
- Fundamental knowledge of working and operating on AWS using IAC practices.
Beginning March, we need to start a new project for migration of our Observability Infra Stack from self hosted AWS ( Prometheus/Grafana, Loki,Mimir) to Datadog Solution ( SAAS).
The good resources that will focus on Engineering deliverables set by the organization SRE Team for migration.
SKILLS:
1. Working Knowledge of Prometheus and PromQL:
- Ability to read, understand, and modify existing PromQL queries, dashboards, and alerting rules, including common aggregations and label usage.
2. Grafana and Alert manager Familiarity:
- Experience navigating Grafana dashboards and Alert manager configurations to understand intent, thresholds, and alert routing.
3. Datadog Dashboarding and Monitors
- Hands-on experience creating Datadog dashboards and monitors based on defined requirements, using existing patterns and guidance.
4. Query and Alert Semantics Translation
- Ability to accurately map PromQL queries and Alert manager rules to Datadog equivalents, recognising non-1:1 translations, validating statistical correctness, and documenting functional differences where exact parity is not possible.
5. Observability Concepts
- Understanding of metrics vs logs vs traces, alert thresholds, and standard monitoring practices in production environments.
6. Team Collaboration
- Ability to work with engineering teams to validate migrated dashboards and alerts, following structured validation checklists.
7. Clear Execution and Documentation
- Documenting migrated assets, assumptions, and validation outcomes in a consistent, predefined format.
8. Automation Skills
- Proficient is building tooling using python to reduce engineering toil for these migration activities.
Nice to Have:
- AWS Administrator Certifications.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×