Azure Cloud Infrastructure Manager
Listed on 2026-03-07
-
IT/Tech
Cloud Computing, Cybersecurity, Systems Engineer, IT Support
Hybrid 4 in either New York City or Jersey City
Our client is a global investments company that provides investment management and investment services to institutions, corporations, and individual investors in multiple countries. The organization serves as a single point of contact for clients to create, trade, hold, manage, service, distribute, or restructure investments and operates recognized subsidiaries that support advisors and financial services firms with clearing, custody, and technology solutions.
Due to client requirements, applicants must be willing and able to work on a w2 basis. For our w2 consultants, we offer a great benefits package that includes Medical, Dental, and Vision benefits, 401k with company matching, and life insurance.
Rate: $80.00 to $95.00/hr. w2
Role SummaryThe Principal Azure Capacity Manager (Consultant) is responsible for leading capacity planning, forecasting, and optimization for a mission‑critical Azure public cloud environment operating with High regulatory and resilience requirements. This role ensures adequate and reliable capacity across compute, storage, network, and platform services, supports SRE teams with performance and reliability objectives, and maintains evidence‑based compliance with High‑impact control expectations.
Core Responsibilities Capacity Planning & Optimization- Develop and maintain service‑level capacity models for App Services, databases, storage, messaging, networking, Key Vault/HSM, and other Azure/PaaS services.
- Establish capacity buffer standards (e.g., N+1/N+2, percentage headroom, runway‑in‑weeks) and validate against demand patterns, failover scenarios, and planned maintenance.
- Implement and tune autoscaling strategies—horizontal/vertical—using SLI/SLO‑driven triggers (latency, error rate, saturation).
- Perform baseline and trend analysis for utilization, throughput, and performance; recommend tuning changes, architectural adjustments, and reservations/savings plans.
- Forecast future demand using product roadmaps, release calendars, and business growth inputs; translate into actionable capacity plans and reservation procurement timelines.
- Participate in CABs as the capacity and security representative; enforce gated approvals based on documented capacity and security impact analyses.
- Maintain configuration management for cryptographic services (Key Vault, managed HSM) with versioned inventories and FIPS‑validated module tracking.
- Document capacity‑related changes, including confidentiality/integrity/availability impacts; update SSP and system artifacts when changes affect control implementations.
- Integrate vulnerability and flaw remediation workflows with capacity risk considerations; maintain POA&M entries and continuous monitoring documentation.
- Ensure third‑party services related to capacity management meet High program requirements with documented oversight and continuous monitoring.
- Enforce strict U.S. or U.S.
-Territories‑only processing, storage, logging, backup, DR, and support operations; validate region selection as part of capacity planning. - Contribute to reviews and updates of policies and procedures related to capacity governance (SA‑1 alignment).
- Balance performance, resilience, and cost efficiency using Reservations/Savings Plans, rightsizing, storage tiering, and scheduled scaling.
- Integrate capacity signals into incident, change, and problem management processes to enable proactive adjustments before production risk manifests.
- Provide clear, transparent reporting on utilization trends, cost impacts, and optimization outcomes.
- Conduct criticality assessments to prioritize capacity for high‑criticality components; ensure alignment across monitoring, hardening, backup/DR, and buffer policies.
- Validate DR capacity (warm/cold/hot) to support failover requirements; ensure reserved quotas and buffers meet both steady‑state and disaster‑event needs.
- Define and publish capacity KPIs such as utilization, saturation, headroom %, runway…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).