AWS Consultant
Listed on 2026-01-16
-
IT/Tech
Systems Engineer, Cloud Computing
Job Title
: AWS Consultant
Location
:
Foster City, CA
Duration
: FTE
- Any Bachelor’s Degree
- 10 years
- 7 years of experience with AWS services including Glue, EKS, Athena, S3, ECS, and ASG, with strong capabilities in monitoring, validation, deep‑Dive troubleshooting, and advanced incident resolution.
- 7 years of experience working with AWS Serverless services such as Lambda, API Gateway, and Dynamo
DB, including log analysis, root cause identification, and complex issue resolution. - 7 years of experience with Terraform / Infrastructure as Code (IaC), capable of designing, reviewing, and troubleshooting infrastructure deployments and managing environment‑level issues.
- 7 years of experience with containerization and orchestration using Docker, Helm, and Kubernetes, including advanced pod/service troubleshooting and collaboration with platform teams.
- 7 years of experience with Git Flow and CI/CD pipelines, handling pipeline design, failure analysis, and release coordination across teams.
- 7 years of experience with microservices and event‑driven architectures, enabling end‑to‑end system analysis, incident root cause analysis, and ownership of L3 support resolution.
The L3 Cloud & Platform Support Engineer is responsible for providing advanced technical support and ownership of complex incidents across cloud‑native platforms. The role requires deep hands‑on expertise in AWS, serverless and container technologies, Infrastructure as Code, CI/CD pipelines, and distributed systems to ensure platform stability, scalability, and reliability.
Soft skills / other skills – To be Evaluated by Hiring Manager- Communication Skills
:
Communicate effectively with internal and customer stakeholders (technical and non‑technical); communication approach: verbal, emails and instant messages. - Interpersonal Skills
:
Strong interpersonal skills to build and maintain productive relationships with team members; provide constructive feedback during reviews and be open to receiving the feedback. - Problem‑Solving and Analytical Thinking
:
Capability to troubleshoot and resolve issues efficiently; analytical mindset. - Task / Work Updates
:
Prior experience in working on Agile/Scrum projects with exposure to tools like Jira/Azure Dev Ops; provides regular updates, proactive and due diligent to carry out responsibilities.
The expected outcome of this role is to ensure high availability, stability, and reliability of cloud platforms by owning and resolving complex L3 incidents end to end. The role will drive faster recovery and reduced repeat issues through strong root cause analysis, preventive fixes, and well‑governed infrastructure deployments using Terraform. It will enable smooth and predictable releases, optimized performance of microservices and event‑driven systems, and improved operational maturity through enhanced monitoring, automation, documentation, and effective knowledge transfer to L2 teams.
SecondarySkills to be planned Post Hiring – Training Plan
Secondary skills include scripting (Python/Bash), monitoring and observability tools (Cloud Watch, Prometheus, Grafana), and a solid understanding of security, networking, and compliance best practices. The role also benefits from experience in incident management, documentation, and mentoring L2 teams, along with exposure to data platforms and analytics workloads.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).