×
Register Here to Apply for Jobs or Post Jobs. X

Grafana Observability Architect

Job in Dallas, Dallas County, Texas, 75215, USA
Listing for: Lorven Technologies, Inc.
Full Time, Seasonal/Temporary position
Listed on 2026-01-12
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, IT Support, Cybersecurity
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Hi,

Our client is looking Grafana Observability Architect For a Contract/Full Time Position role in Dallas TX (Hybrid) below is the detailed requirements.

Kindly share your Updated Resume to proceed further.

Position

Grafana Observability Architect

Location

Dallas TX / Tampa, FL (Hybrid)

Job Mode

Contract/Full Time

Job Summary

We are seeking a highly skilled and motivated Grafana Observability Architect with experience in design, implementation, and optimization of observability solutions using the Grafana ecosystem
. The ideal candidate will work closely with platform engineers, SREs, developers, and business stakeholders to ensure end-to-end visibility into system performance, reliability, and user experience across distributed systems.

Required Qualifications
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 5 years of experience in Dev Ops, SRE
    , or infrastructure automation roles.
  • 3 years of hands‑on experience with Grafana and dashboard development
    .
  • Strong proficiency in scripting languages (Python, Bash, Go).
  • Experience with monitoring tools (
    Grafana Cloud, Prometheus, Loki, Dynatrace, Splunk, etc.).
  • Deep understanding of CI/CD
    , and cloud platforms (AWS and Azure).
  • Expertise in Kubernetes, Docker
    , and container orchestration.
  • Familiarity with security and compliance in automated environments.
  • Hands‑on experience with Open Telemetry instrumentation and data collection.
Preferred Qualifications
  • Grafana certification or equivalent experience.
  • Experience with custom Grafana plugins or panel development.
  • Knowledge of business intelligence tools and data visualization principles.
  • Contributions to open‑source Dev Ops or observability projects.
  • Strong communication and stakeholder management skills.
  • Experience with Open Telemetry Collector configuration and integration.
  • Familiarity with distributed tracing concepts.
Key Responsibilities
  • Architect and implement observability platforms using Grafana, Tempo, Loki, Mimir, and Prometheus.
  • Design and maintain scalable telemetry pipelines using Open Telemetry and Grafana Agent.
  • Define and enforce observability standards, SLIs/SLOs, and alerting strategies.
  • Collaborate with application and infrastructure teams to instrument services for metrics, logs, and traces.
  • Develop reusable dashboards and templates for performance monitoring and incident response.
  • Design and implement visually compelling and data‑rich Grafana dashboards for Observability.
  • Integrate Grafana Cloud with data sources such as Prometheus, Loki, Service Now, Pager Duty, Snowflake, AWS.
  • Integrate telemetry data sources such as Tomcat, Liberty, Ping, Linux, Windows, and databases (Oracle, Post Gres) and REST API.
  • Create alerting mechanisms for SLA breaches, latency spikes and transaction anomalies.
  • Develop custom panels and alerts to monitor infrastructure, applications, and business metrics.
  • Collaborate with stakeholders to understand monitoring needs and translate them to define KPIs and visualization needs.
  • Optimize dashboard performance and usability across teams.
  • Implement and manage Open Telemetry instrumentation across services to collect distributed traces, metrics, and logs.
  • Integrate Open Telemetry data pipelines with Grafana and other observability platforms.
  • Develop and maintain Open Telemetry collectors and exporters for various environments.
  • Develop and implement monitoring solutions for applications and infrastructure to ensure high availability and performance.
  • Collaborate with development, operations, and other IT teams to ensure monitoring solutions are integrated and aligned with business needs.
Dev Ops & Automation
  • Architect, design and maintain CI/CD pipelines using tools such as Jenkins, Bitbucket, and Nexus.
  • Implement Infrastructure as Code (IaC) using Terraform and Ansible.
  • Automate deployment, scaling, and monitoring of both cloud‑native and on‑premises environments.
  • Ensure system reliability, scalability, and security through automated processes.
  • Collaborate with development and operations teams to streamline workflows and reduce manual intervention.
SME Responsibilities
  • Act as a technical advisor on automation and observability best practices.
  • Lead initiatives to improve system performance, reliability, and developer productivity.
  • Conduct training sessions and create documentation for internal teams.
  • Stay current with industry trends and emerging technologies in Dev Ops and observability.
  • Advocate for and guide the adoption of Open Telemetry standards and practices across engineering teams.
  • Optimize monitoring processes and tools to enhance efficiency and effectiveness.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary