Site Reliability Engineer; SRE Artificial Intelligence; AI Engineer
Listed on 2026-02-28
-
IT/Tech
Systems Engineer, Cybersecurity, Cloud Computing
The U.S. Navy’s Service Management, Integration, and Transport (SMIT) program has an opening for a Site Reliability Automation and Orchestration Engineer on a high-visibility DoD program that provides engineering support to the Navy Marine Corps Intranet (NMCI), the largest information technology (IT) network in the world. This position will provide many opportunities to challenge and grow your skills.
Key Responsibilities AIOps & Observability Intelligence- Design, develop, and maintain AI/ML models for anomaly detection, trend analysis, and signal correlation across metrics, logs, traces, and events.
- Reduce alert noise through intelligent alert grouping, suppression, and prioritization.
- Enhance observability platforms with AI-generated insights supporting SLO and error-budget management.
- Implement AI-driven incident classification, enrichment, and summarization.
- Provide probable root-cause analysis recommendations based on historical and real-time telemetry.
- Support on-call and incident response teams with AI-guided remediation suggestions.
- Contribute AI insights to post-incident reviews and reliability improvement plans.
- Apply AI techniques to identify repetitive operational tasks and automation opportunities.
- Assist in generating, validating, and optimizing automation playbooks and workflows.
- Analyze automation execution data to improve success rates, resiliency, and reuse.
- Build and maintain AI-searchable knowledge repositories containing runbooks, SOPs, lessons learned, and historical incident data.
- Enable natural-language access to operational knowledge for SREs and operations staff.
- Reduce dependency on tribal knowledge through intelligent documentation and discovery.
- Develop predictive models for capacity planning, failure forecasting, configuration risk, and reliability debt identification.
- Support proactive remediation strategies to prevent incidents before customer impact.
- Assist SRE leadership in data-driven prioritization of reliability investments.
- Ensure AI solutions adhere to organizational security, compliance, and data-handling policies.
- Establish guardrails for AI recommendations, human-in-the-loop decision making, and automation execution.
- Promote transparency, explainability, and auditability of AI-driven operational decisions.
Education and Requirements:
- Bachelor’s degree in computer science, Engineering, Information Systems, Data Science, or related discipline
- 5+ years in Site Reliability Engineering, Dev Ops, IT Operations, or Systems Engineering
- 2+ years applying AI/ML techniques in operational, analytics, or automation contexts
- Demonstrated experience supporting production systems in high-availability environments
- Must have an active Secret Clearance in order to be considered for the position
- Proficiency in data analysis tooling
- Experience with machine learning fundamentals (anomaly detection, clustering, time-series analysis, NLP)
- Familiarity with observability platforms (metrics, logs, traces, events)
- Experience with automation frameworks and infrastructure-as-code concepts
- Strong understanding of distributed systems and operational telemetry
Pay Range -
The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
OriginalPosting
February 18, 2026
For U.S. Positions:
While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.
All qualified applicants will receive consideration for employment without regard to sex, race, ethnicity, age, national origin, citizenship, religion, physical or mental disability, medical condition, genetic information, pregnancy, family structure, marital status, ancestry, domestic partner status, sexual orientation, gender identity or expression, veteran or military status, or any other basis prohibited by law. Leidos will also consider for employment qualified applicants with criminal histories consistent with relevant laws.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).