More jobs:
Director of Platform Engineering & Operations MAZDC
Job in
Charlotte, Mecklenburg County, North Carolina, 28245, USA
Listed on 2026-03-13
Listing for:
Compunnel Inc.
Full Time
position Listed on 2026-03-13
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing
Job Description & How to Apply Below
Role Summary
The Director of Platform Engineering & Operations is responsible for COMPANY’s entire technology platform—overseeing both customer-facing systems and internal infrastructure to ensure 24x7 availability, security, and scalability across Azure cloud and on-premise environments. This hands‑on leadership role balances technical execution and strategic management, including building a high-performing team, driving operational excellence, implementing security controls, and supporting the company’s rapid growth.
Key Responsibilities- Build, mentor, and retain a team of 8 engineers across infrastructure/network, Dev Ops/SRE, and desktop/end‑user support, providing technical coaching, career development, and performance management.
- Own platform strategy, roadmap, and execution to meet business goals and customer SLAs.
- Define and track operational KPIs (availability, MTTR, change success rate, incident volume, cloud cost efficiency) and present regular updates to the CTO and executive team.
- Take full ownership of platform strategy, roadmap, and execution aligned with business objectives, product needs, and customer SLAs.
- Establish operational cadence: incident reviews, change advisory board, service desk metrics, team retrospectives, and continuous improvement culture.
- Own the design, implementation, and 24x7 operation of COMPANY’s hybrid infrastructure (Azure + on‑premise) supporting both production and internal corporate systems.
- Ensure high availability, scalability, performance, security, and cost efficiency across all environments.
- Hands‑on architecture and implementation of cloud infrastructure, networking, identity management (Azure AD/Entra, RBAC), storage, backup, monitoring, and observability.
- Drive cloud optimization initiatives: rightsizing, reserved capacity, architectural improvements, and cost governance across Azure workloads.
- Define and enforce platform standards for networking, security, identity, logging, alerting, and operational discipline.
- Lead Dev Ops and SRE transformation: implement CI/CD pipelines, Infrastructure as Code (Terraform, ARM/Bicep), containerization (Kubernetes), and modern deployment practices.
- Hands‑on implementation of Kubernetes clusters, container orchestration, service mesh, and cloud‑native architecture patterns.
- Establish SRE principles: error budgets, SLOs/SLIs, blameless postmortems, observability (metrics/logs/traces), and reliability engineering culture.
- Build and optimize CI/CD tooling and workflows to improve release velocity, reduce deployment risk, and increase developer productivity.
- Implement robust change management processes (risk assessment, testing, communication, rollback procedures) that balance speed, safety, and audit readiness.
- Implement security and compliance controls, including access management, logging and monitoring, vulnerability management, incident response, and audit evidence collection.
- Establish security best practices across infrastructure: network segmentation, firewall rules, encryption (data at rest/in transit), secrets management, privileged access management.
- Lead incident response for infrastructure and platform issues, including root cause analysis, remediation, and process improvements.
- Own Disaster Recovery strategy and execution: define RPO/RTO targets, architect multi‑region and hybrid DR solutions, develop runbooks, and conduct regular DR testing.
- Ensure backup and restore capabilities across all critical systems with documented procedures and validated recovery processes.
- Oversee desktop, endpoint, and telecom services (laptops, mobile devices, productivity tools, collaboration platforms, voice/conferencing) to deliver reliable, secure employee experiences.
- Implement IT service management practices (incident, request, problem, asset management) with clear SLAs and user satisfaction metrics.
- Manage vendor relationships across infrastructure, telecom, SaaS, and managed services—evaluate contracts, optimize licensing, and ensure service quality.
- 10+ years of progressive experience in IT infrastructure and…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×