More jobs:
Job Description & How to Apply Below
Role:
SRE & Devops (Ray.io) Support Engineer
Duration: Contract
Location:
India
About the Team
We are the AI Platform Team , enabling next-generation machine learning infrastructure for the Client's researchers and data scientists globally. Our mission is to build, operate, and support highly available, scalable AI platforms while driving Dev Ops and SRE best practices across the Client's AI organization .
We are looking for a highly motivated, self-reliant, and experienced SRE / Support Engineer who is passionate about AI platforms, operational excellence, and customer support.
Required Experience & Skills
Strong experience designing, building, and supporting software written in Python (C++ is a plus)
Hands-on experience with Ray.io , including cluster deployment, workload management, scheduling, and troubleshooting
Strong knowledge of Ray Dashboard and CLI tools for monitoring and debugging distributed jobs
Experience supporting distributed systems in production environments
Solid understanding of Kubernetes, Docker, and Linux
Strong debugging and issue triaging skills
Experience with Dev Ops practices , CI/CD pipelines, and automation
Familiarity with Jenkins and test automation frameworks
Excellent communication and collaboration skills
Ability to manage multiple priorities in a fast-paced environment
Strong written and spoken English skills
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×