Platform/Site Reliability Engineer
Job in
Seattle, King County, Washington, 98127, USA
Listed on 2026-01-12
Listing for:
Axiom Software Solutions Limited
Full Time
position Listed on 2026-01-12
Job specializations:
-
IT/Tech
Cloud Computing, Systems Engineer, Data Engineer, SRE/Site Reliability
Job Description & How to Apply Below
Overview
We are looking for a skilled Platform Engineer / SRE to design, implement, and maintain our cloud infrastructure and platforms. The ideal candidate will have a strong background in Kubernetes administration, Azure cloud services, infrastructure as code, and automation. You will play a crucial role in ensuring the scalability, reliability, and security of our systems while supporting our AI/ML initiatives.
Responsibilities- Design, deploy, and manage infrastructure solutions using Terraform, ensuring scalability, security, and reliability.
- Develop and maintain infrastructure as code scripts to automate the provisioning and configuration of resources.
- Ensure version-controlled, repeatable deployments using IaC best practices.
- Implement and manage Kubernetes clusters for containerized applications.
- Collaborate with development teams to deploy, scale, and optimize applications in Kubernetes environments.
- Leverage scripting languages (e.g. Python) to automate routine tasks and streamline workflows.
- Implement continuous integration and continuous deployment (CI/CD) pipelines for efficient software delivery.
- Ensure seamless integration of infrastructure components with CI/CD pipelines.
- Design, deploy, and maintain scalable and reliable infrastructure for AI/ML platforms.
- Implement containerization (Docker) and orchestration (Kubernetes) solutions for deploying and managing AI/ML applications.
- Ensure containerized applications are secure, scalable, and easily deployable.
- Enable seamless integration of AI/ML models into the platform, ensuring data pipelines are efficient and reliable.
- Establish monitoring and alerting systems to ensure the health and performance of AI/ML platforms.
- Implement security best practices for AI/ML platforms, ensuring data privacy and compliance with industry standards.
- Bachelor's degree in computer science, Engineering, or a related field.
- Proven experience in Kubernetes administration, specifically with Azure Kubernetes Service (AKS).
- Strong proficiency in Azure cloud services and Azure ARM templates.
- Expert-level scripting skills in Power Shell and Python.
- Hands-on experience with Terraform for infrastructure as code.
- Solid understanding of CI/CD principles and experience with Azure Dev Ops.
- Experience with containerization technologies, particularly Docker.
- Strong problem-solving skills and ability to work in a fast-paced environment.
- Excellent communication and collaboration skills.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×