Director of Platform Engineering & Operations MAZDC Job Charlotte area,North Carolina USA,IT/Tech

Position: Director of Platform Engineering & Operations -- MAZDC5764907

Role Summary

The Director of Platform Engineering & Operations is responsible for COMPANY’s entire technology platform—overseeing both customer-facing systems and internal infrastructure to ensure 24x7 availability, security, and scalability across Azure cloud and on-premise environments. This hands‑on leadership role balances technical execution and strategic management, including building a high-performing team, driving operational excellence, implementing security controls, and supporting the company’s rapid growth.

Key Responsibilities

Build, mentor, and retain a team of 8 engineers across infrastructure/network, Dev Ops/SRE, and desktop/end‑user support, providing technical coaching, career development, and performance management.
Own platform strategy, roadmap, and execution to meet business goals and customer SLAs.
Define and track operational KPIs (availability, MTTR, change success rate, incident volume, cloud cost efficiency) and present regular updates to the CTO and executive team.
Take full ownership of platform strategy, roadmap, and execution aligned with business objectives, product needs, and customer SLAs.
Establish operational cadence: incident reviews, change advisory board, service desk metrics, team retrospectives, and continuous improvement culture.
Own the design, implementation, and 24x7 operation of COMPANY’s hybrid infrastructure (Azure + on‑premise) supporting both production and internal corporate systems.
Ensure high availability, scalability, performance, security, and cost efficiency across all environments.
Hands‑on architecture and implementation of cloud infrastructure, networking, identity management (Azure AD/Entra, RBAC), storage, backup, monitoring, and observability.
Drive cloud optimization initiatives: rightsizing, reserved capacity, architectural improvements, and cost governance across Azure workloads.
Define and enforce platform standards for networking, security, identity, logging, alerting, and operational discipline.

Dev Ops & Site Reliability Engineering

Lead Dev Ops and SRE transformation: implement CI/CD pipelines, Infrastructure as Code (Terraform, ARM/Bicep), containerization (Kubernetes), and modern deployment practices.
Hands‑on implementation of Kubernetes clusters, container orchestration, service mesh, and cloud‑native architecture patterns.
Establish SRE principles: error budgets, SLOs/SLIs, blameless postmortems, observability (metrics/logs/traces), and reliability engineering culture.
Build and optimize CI/CD tooling and workflows to improve release velocity, reduce deployment risk, and increase developer productivity.
Implement robust change management processes (risk assessment, testing, communication, rollback procedures) that balance speed, safety, and audit readiness.

Information Security & Compliance

Implement security and compliance controls, including access management, logging and monitoring, vulnerability management, incident response, and audit evidence collection.
Establish security best practices across infrastructure: network segmentation, firewall rules, encryption (data at rest/in transit), secrets management, privileged access management.
Lead incident response for infrastructure and platform issues, including root cause analysis, remediation, and process improvements.
Own Disaster Recovery strategy and execution: define RPO/RTO targets, architect multi‑region and hybrid DR solutions, develop runbooks, and conduct regular DR testing.
Ensure backup and restore capabilities across all critical systems with documented procedures and validated recovery processes.

Desktop & End‑User Support

Oversee desktop, endpoint, and telecom services (laptops, mobile devices, productivity tools, collaboration platforms, voice/conferencing) to deliver reliable, secure employee experiences.
Implement IT service management practices (incident, request, problem, asset management) with clear SLAs and user satisfaction metrics.
Manage vendor relationships across infrastructure, telecom, SaaS, and managed services—evaluate contracts, optimize licensing, and ensure service quality.

Required Qualifications

10+ years of progressive experience in IT infrastructure and…


Increase/decrease your Search Radius (miles)



Job Posting Language