×
Register Here to Apply for Jobs or Post Jobs. X

Production Support Cloud Engineer

Job in Rosebank, Western Cape, South Africa
Listing for: Khonology (Pty) Ltd
Full Time position
Listed on 2026-01-10
Job specializations:
  • IT/Tech
    Cloud Computing, IT Support
Job Description & How to Apply Below
Location: Rosebank

Khonology is a digital services company focused on software development, Application Support, data analytics and engineering.

We are looking for two Production Support Cloud Engineers to ensure the stability, performance, and operational continuity of a platform and its hosted workloads during the November – January period, coinciding with the year-end change freeze and early-year restart window
.

The platform is an internal developer platform designed to accelerate secure, compliant, and scalable application delivery on AWS. It provides teams with self-service onboarding, reusable golden paths, runtime patterns, and built-in Fin Ops and observability guardrails.

Key Responsibilities
  • Provide L2/L3 production support for platform components running on Amazon EKS, RDS, Lambda, S3, and Cloud Flare.
  • Monitor workloads, troubleshoot incidents, and coordinate resolution with platform and development teams.
  • Manage and triage service requests, incident queues, and change controls within ITSM workflows.
  • Maintain operational dashboards and Grafana/Cloud Watch alerts, ensuring uptime and SLO compliance.
  • Execute post-incident root cause analyses (RCAs) and document permanent fixes in runbooks.
  • Support deployment automation and Git Ops processes (Argo CD, Git Hub Actions, Helm).
  • Validate compliance of services with security, reliability, and cost optimisation standards.
  • Collaborate with Platform engineers to automate recurring tasks and improve operational efficiency.
  • Ensure backup verification, log retention, and audit readiness for all managed components.
Required Skills & Experience
  • 3–5 years of production support or site reliability experience in cloud-native environments.
  • Solid understanding of AWS EKS, RDS, Cloud Watch, IAM, S3, and Lambda.
  • Experience with Kubernetes, Helm, Git Ops, and CI/CD pipelines.
  • Competence in Typescript, Python, Bash, or Go scripting for automation.
  • Familiarity with Grafana, Prometheus, Loki, and incident management practices (ITIL).
  • Strong communication skills and ability to collaborate across platforms, security, and development teams.
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary