Federal Observability Engineer; Clearance Required - Secret), Hybrid Remote & On-Site AL, UT
Oklahoma City, Oklahoma County, Oklahoma, 73116, USA
Listed on 2026-01-16
-
IT/Tech
Systems Engineer, Cloud Computing
Federal Observability Engineer, (Clearance Required - Secret), Hybrid Remote & On-Site AL, UT, PA, OK. This role has been designated as ‘Remote/Teleworker’, which means you will primarily work from home.
Who We Are:Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next.
We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.
You will be part of a larger technical team, working as an Observability Engineer in an Ops Ramp environment. You will be responsible for designing, implementing, and maintaining the observability infrastructure that provides deep insights into the health, performance, and behavior of HPE’s PCE environment and Cloud infrastructure in support of a Federal Customer. You will work closely with development, operations, and other teams to proactively identify and resolve issues, improve system performance, and optimize resource utilization.
- US Citizenship Required
- Clearance Required:
Secret
- Flexible Hybrid Role with requirement to work onsite at customer location as required
- M-F 9-5, Candidate must be flexible to work evening and weekends if required
- Flexibility to work on a monthly rotation schedule required
Ops Ramp Platform Expertise:
- Deeply understand and effectively utilize the full capabilities of the Ops Ramp platform for:
- Metrics Collection & Analysis:
Configure and manage data sources, define and monitor key performance indicators (KPIs), and analyze performance trends. - Log Management:
Configure log collection, aggregation, and analysis within the Ops Ramp platform. - Alerting & Notifications:
Create and manage alerts, define escalation paths, and integrate with incident management systems. - Automation:
Develop and implement automated workflows and remediation actions within the Ops Ramp platform. - Reporting & Dashboards:
Design and build custom dashboards and reports to provide key insights into system health and performance.
Infrastructure:
- Design, implement, and maintain observability solutions utilizing the Ops Ramp platform as the core technology.
- Integrate Ops Ramp with other monitoring and observability tools as needed (e.g., Prometheus, Datadog, Elastic Stack).
- Ensure data quality and integrity within the Ops Ramp platform.
- Utilize Ops Ramp data to troubleshoot and resolve performance issues, application errors, and other operational problems.
- Collaborate with development and operations teams to identify and fix root causes of issues.
- Participate in incident response activities, leveraging Ops Ramp data to accelerate resolution.
- Continuously evaluate and optimize the performance and effectiveness of the Ops Ramp platform.
- Stay updated on the latest features, enhancements, and best practices within the Ops Ramp ecosystem.
- Proactively identify areas for improvement and implement necessary changes.
Required:
- Certifications Required
- DD8750 - Security Plus or higher Security Certification (CISSP, CASP, etc)
- OS Level certifications are a plus
- Bachelor's degree preferred or Associate degree holder (technical field) with 6-8 years working experience in related fields desired.
Skills:
Technical
Skills:
- Strong understanding of cloud computing platforms (AWS, Azure, GCP).
- Experience with containerization technologies (Docker, Kubernetes).
- Proficiency in scripting languages (Python, Go, Bash).
- Experience with SQL and No
SQL databases. - Knowledge of networking protocols (TCP/IP, HTTP).
- Proven experience with the Ops Ramp platform is a strong plus.
- Experience with at least 1 other observability tool…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).