Senior Engineer, Infrastructure; DevOps
Listed on 2026-03-14
-
IT/Tech
Cloud Computing, SRE/Site Reliability
About Pryon:
We’re a team of AI, technology, and language experts whose DNA lives in Alexa, Siri, Watson, and virtually every human language technology product on the market. Now we’re building an industry-leading knowledge management and Retrieval-Augmented Generation (RAG) platform. Our proprietary, cutting‑edge natural language processing capabilities transform unstructured data into meaningful experiences that increase productivity with unmatched accuracy and speed.
We are growing our team and adding a Dev Ops/Infrastructure Engineer to our team focused on platform architecture, CI/CD, and observability infrastructure. In this role, you will own the technical architecture and implementation of our cloud‑native, and highly scalable RAG applications. You will design and manage the infrastructure, deployment pipelines, and operational procedures for delivering enterprise‑grade AI/ML products.
We’re looking for someone who will drive Dev Ops best practices and work with engineering teams to implement them effectively. You will own the platform’s reliability, scalability, and operational excellence across multiple cloud environments and on‑premises deployments. If you are looking for an opportunity to architect and drive modern Dev Ops practices for cutting‑edge AI/ML products in industry, we would love to hear from you!
InThis Role, You Will:
- Design and implement cloud‑native architectures for AI/ML applications using Kubernetes (GKE, EKS, AKS)
- Architect and maintain CI/CD pipelines using modern Git Ops practices with tools like FluxCD and Bit Bucket
- Design and implement observability solutions using Prometheus, Grafana, and other monitoring tools
- Create and maintain Infrastructure as Code (IaC) using Terraform
- Implement container orchestration strategies using Docker, Kubernetes, and Helm
- Design and implement multi‑cloud deployment strategies
- Establish SLOs/SLIs and implement SRE best practices
- Automate operational tasks and create self‑healing systems
- Mentor team members on Dev Ops best practices
- Collaborate with ML engineers and researchers to optimize model deployment and serving infrastructure
- Stay current with emerging technologies and best practices in the Dev Ops/MLOps space
- 7+ years of experience in Dev Ops/Platform engineering
- Deep expertise in Kubernetes, Helm and container orchestration
- Strong experience with a major cloud provider (GCP, AWS, Azure)
- Experience with CI/CD tools and Git Ops practices
- Proficiency in Go, Python, or similar programming languages
- Experience with observability tools (Prometheus, Grafana, etc.)
- Knowledge of security best practices and compliance requirements
- Experience with Infrastructure as Code and configuration management
- Experience with MLFlow, Air Flow, Kube Flow or Ray (Desirable)
- BS degree in Computer Science or related field
- Excellent communication and collaboration skills
- Strong problem‑solving abilities and systematic thinking
- Experience working in an Agile environment
$180,000 - $200,000 a year
Benefits for Full Time Employees:- Remote first organization
- 100% Company paid Health/Dental/Vision benefits for you and your dependents
- Life Insurance, Short‑term and Long‑term Disability
- 401k
- Unlimited PTO
We are interested in every qualified candidate who is authorized to work in the United States. However, we are not able to sponsor or take over sponsorship of employment visas at this time.
Pryon will not consider race, religion, sex, sexual preference, or national origin in ways that violate the Nation’s civil rights laws.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).