Senior Cloud Infrastructure Engineer
Listed on 2026-01-24
-
IT/Tech
Cybersecurity, Systems Engineer
Company Overview
By Light Professional IT Services LLC readies warfighters and federal agencies with technology and systems engineered to connect, protect, and prepare individuals and teams for whatever comes next. Headquartered in McLean, VA, By Light supports defense, civilian, and commercial IT customers worldwide.
Cole Engineering Services (CESI), a By Light company, is recognized as a premier provider of modeling and simulation (M&S) training solutions to the Federal Government and industry. Since 2004, CESI has been at the forefront of developing, maintaining, and integrating simulation-based training, serious gaming, technical services, training and other support in live, virtual, constructive, and gaming (LVCG) domains. CESI also designs, builds and runs infrastructure, platforms, applications and processes that enable cyber training for the integrated multi-domain force.
Our vision is to become a worldwide full spectrum LVCG and cyber training/analysis developer, integrator and services provider.
Cole Engineering Services, Inc. is seeking a highly qualified Senior Cloud Infrastructure Engineer to lead implementation, security, and operations of mission-critical cloud environments that power DoD cyber training capabilities and applications. You will manage and develop resilient, compliant, and cost-optimized cloud platforms supporting cyber ranges, training orchestration, and multi-tenant applications in Fed Ramp approved cloud environments. You will partner closely with cybersecurity, Dev Sec Ops , networking, and training operations teams to deliver secure, scalable capabilities aligned to DoD RMF, DISA STIGs, and the DoD Cloud Computing SRG (Impact Levels IL2–IL6).
In this role, you will be a key technical leader ensuring the DoD’s cyber training enterprise platforms are secure, resilient, and efficient, enabling cyber operators to execute complex cyber exercises at scale while meeting stringent compliance and mission requirements.
ResponsibilitiesPrimary Position Functions:
- Support the design and maintain landing zones using cloud applications such as AWS Organizations, Control Tower, SCP guardrails, Identity and Access Management (IAM) multi-account patterns, and VPC architectures (Transit Gateway, Private Link, NAT, IGW) for enclave isolation and cross-domain needs.
- Engineer high-availability, multi-Region solutions leveraging cloud tools such as EC2, EKS/ECS Fargate, RDS/Aurora, Dynamo
DB, S3/EFS/FSx, Load Balancers, Route 53, and API Gateway. - Implement Zero Trust-aligned patterns (micro-segmentation, strong identity, continuous verification) consistent with DoD Zero Trust guidance.
- Implement security controls and evidence generation for RMF ATO packages (SSP, SAR, POA&M) in coordination with cybersecurity teams.
- Apply DISA STIGs (OS, DB, Kubernetes, Container) and SRG requirements for workloads at IL2–IL6
- Tailor and automate STIG application using IaC and configuration management.
- Integrate encryption and key management with cloud tools such as AWS KMS/HSM; enforce IAM least privilege, SCPs, permission boundaries, ABAC, and robust secrets management.
- Implement cloud logging and metrics tools such as Cloud Trail/Cloud Watch/Guard Duty/Config for comprehensive audit and detection.
- Align architectures with FedRAMP Moderate/High baselines when required and ensure boundary compliance for controlled workloads.
- Develop secure connectivity (AWS Direct Connect/VPN), hybrid routing, and segmentation; implement TLS mutual auth, certificate management, and private service endpoints.
- Design logging and telemetry pipelines (Cloud Watch, Open Telemetry, Kinesis, S3, SIEM integration such as Splunk/ELK) with retention, metadata/tagging, and data lifecycle policies.
- Own SLOs/SLAs for platform services.
- Implement autoscaling, health checks, and proactive capacity management.
- Lead cost management and alerting practices of cloud environments in coordination with project leads.
- Provide Tier 3 support, on-call rotations during exercises, and incident response coordination with cybersecurity and training operations.
- Collaborate with agile…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).