×
Register Here to Apply for Jobs or Post Jobs. X

AI DevOps and Cloud Infrastructure Engineer

Job in Cleveland, Cuyahoga County, Ohio, 44101, USA
Listing for: Crowe
Full Time position
Listed on 2026-01-16
Job specializations:
  • IT/Tech
    AI Engineer, Cloud Computing
Job Description & How to Apply Below

AI Dev Ops and Cloud Infrastructure Engineer

Join to apply for the AI Dev Ops and Cloud Infrastructure Engineer role at Crowe
.

At Crowe, you can build a meaningful and rewarding career. With real flexibility to balance work with life moments, you’re trusted to deliver results and make an impact. We embrace you for who you are, care for your well‑being, and nurture your career. Everyone has equitable access to opportunities for career growth and leadership. Over our 80‑year history, delivering excellent service through innovation has been a core part of our DNA across our audit, tax, and consulting groups.

That’s why we continuously invest in innovative ideas, such as AI‑enabled insights and technology‑powered solutions, to enhance our services. Join us at Crowe and embark on a career where you can help shape the future of our industry.

Job Description About Crowe AI Transformation

Everything we do is about making the future of human work more purposeful. We do this by leveraging state‑of‑the‑art technologies, modern architecture, and industry experts to create AI‑powered solutions that transform the way our clients do business. The new AI Transformation team will build on Crowe’s established AI foundation, furthering the capabilities of our Applied AI / Machine Learning team. By combining Generative AI, Machine Learning and Software Engineering, this team empowers Crowe clients to transform their business models through AI, irrespective of their current AI adoption stage.

As a member of AI Transformation, you will help distinguish Crowe in the market and drive the firm’s technology and innovation strategy. The future is powered by AI, come build it with us.

About The Team
  • We invest in expertise. You’ll have the time, space, and support to yılı deep in your projects and build lasting technical and strategic mastery. You’ll work with developers, product stakeholders, and project managers as a trusted leader and domain expert.
  • We believe in continuous growth. Our team is committed to professional development and knowledge‑sharing.
  • We protect balance. Our distributed team culture is grounded in trust and flexibility. We offer unlimited PTO, a flexible remote work policy, and a supportive environment that prioritizes sustainable, long‑term performance.
About

The Role

The AI Dev Ops and Cloud Infrastructure Manager әһ leads teams responsible for designing, operating, and scaling AI/ML infrastructure, cloud platforms, and Dev Ops automation that support enterprise model training, inference, and generative AI workloads. This role is the strategy and execution of cloud‑native, Kubernetes‑based platforms that enable reliable, secure, and cost‑efficient AI systems.

As a manager, this position combines hands‑on technical leadership with people management, delivery ownership, and strategic decision‑making. The manager oversees distributed compute environments, GPU clusters, CI/CD pipelines, and vector‑search infrastructure while ensuring high availability, resilience, and compliance with security and responsible AI standards. The manager partners closely with AI engineering, data engineering, product, and security teams, serves as the primary technical owner for assigned initiatives, and communicates system risks, tradeoffs, and progress to leadership.

Key Responsibilities
  • Leading engineering teams responsible for AI/ML infrastructure, cloud operations, and MLOps automation.
  • Defining cloud, Kubernetes, and infrastructure strategy to support scalable model training, inference, and generative AI platforms.
  • Guiding the design and operation of distributed compute environments, GPU clusters, and vector database infrastructure.
  • Overseeing CI/CD pipelines that automate model training, testing, deployment, monitoring, and lifecycle management.
  • Managing incident response, failure analysis, and reliability engineering across AI platforms.
  • Directing performance testing, capacity planning, and cost optimization for AI infrastructure.
  • Ensuring compliance with cloud security, IAM practices, governance requirements, and responsible AI frameworks.
  • Implementing multi‑cloud resilience patterns, high availability, and automated failover…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary
Learn4Good is currently undergoing necessary server maintenance.
We hope to have the Login & Registration options back in 5 minutes, and apologize for any inconvenience.