Senior Engineering Manager - ML Ops
Listed on 2026-03-01
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Cloud Computing
Everseen
: A leader in vision AI solutions for the world’s leading retailers.
The role:
We are seeking an experienced software engineering lead for the Everseen ML Operations department. You and your team will extend the capabilities of our scalable ML Ops infrastructure that empowers our data scientists and machine learning engineers to develop, train, benchmark, and monitor our machine learning models efficiently. You will be instrumental in enhancing our internal Machine Learning Platform and driving automation, reproducibility, and performance across the machine learning lifecycle.
Whatyou'll do
- Leadership
- Manages and leads the team reporting to them. Manages the day-to-day running of the team and the projects
- Has overall responsibility for the projects that the team is working on. Acts as a point of escalation for the team.
- Working with senior leadership to create strategic goals for their team. Measures performance of those goals.
- Monitors and supports team members' career paths. Evaluate team members' performance and propose promotions.
- Assesses and proposes team headcount adjustments based on the project roadmap and team capacity.
- Teaching and Sharing Culture
- Ensures the sharing of skills, knowledge, and expertise between members of the engineering team.
- Fosters a culture of collaboration and continuous learning by organizing training sessions, workshops, and knowledge-sharing sessions.
- Design and Development
- Coordinate sand drive progress with cross-functional teams in designing and developing new features and functionalities.
- Ensures that the developed solutions meet project objectives and enhance user experience.
- Coding
- Ensures design and implementation of reusable, testable, efficient, and elegant code based on requirements and a longer-term product and feature strategy
- Ensures adherence to coding standards and best practices.
- Integration of Third-Party Solutions
- Ensures the evaluation, integration, and maintenance of third-party software solutions to optimize system performance.
- Ensures expansion of product capabilities by integrating compatible third-party solutions. Be aware of and promote.
- Monitoring and Troubleshooting
- Ensures seamless operation and timely resolution of any anomalies to maintain system reliability.
- Documentation
- Responsible for ensuring that documentation is clear, comprehensive, and up-to-date while overseeing its creation, maintenance, and adherence to organizational standards.
- 5+ years of experience in either ML infrastructure, MLOps, or Platform Engineering.
- 5+ years of experience in leadership roles
- Inspirational leadership, strategic vision, culture shaping approach.
- Strong programming skills – Python / Go
- Hands‑on experience with Kubernetes, Docker, and cloud services.
- Experience with CI/CD tools (e.g., Git Lab, Jenkins).
- Understanding of ML training pipelines, data lifecycle, and model serving concepts
- Excellent communication and collaboration skills.
- Familiarity with workflow orchestration tools (e.g., Airflow, Kubeflow, Ray, Vertex AI, Azure ML).
- Proven ability to monitor and optimize cloud cost
- Good Understanding of data privacy, RBAC, and model governance
- Experience in Microsoft Azure or Google GCP cloud infrastructure and Machine Learning tools
- Experience with ML frameworks (e.g., Tensor Flow, PyTorch).
- Experience with GPU orchestration (e.g., NVIDIA GPU Operator, MIG).
- Experience with Infrastructure as Code (e.g., Terraform).
- Knowledge of data engineering tools (e.g., Snowflake, Databricks, Big Query, Airbyte, Kafka).
- Familiarity with feature stores and model registries.
- Exposure to large-scale distributed systems and performance optimization.
Everseen is a leader in vision AI. We are transforming business operations for global retailers, driving measurable business value and improving the customer experience.
We are a dedicated team of inventors, research scientists, engineers, AI experts and retail industry veterans. Our mission is to protect people, process, products and profitability within the retail sector and beyond.
We are trusted by major food, drug, mass, and specialty retailers around the world— including Kroger, Meijer, and…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).