Kubernetes Engineer
Listed on 2026-03-02
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability
Washington
D.C, New York, Sarasota, Miami, Toronto, ON
Rumble is a high-growth video platform and cloud services provider that is creating an independent infrastructure. Our mission is to restore the internet to its roots by making it free and open once again.
About the RoleRumble Cloud is seeking a Kubernetes Engineer to support our team in rolling out and operating our next-generation Kubernetes platform. This role will focus on our new CAPI/CAPO-based Kubernetes solution, which is designed to be compatible with our existing Open Stack Magnum API and will be deployed across our public cloud. You will help run the day-to-day operations of the Kubernetes service, assist with migrations and onboarding from our current Magnum-based offering, and act as an escalation point for complex customer issues that go beyond front-line support.
This is a hands‑on engineering role for someone who enjoys debugging difficult problems, improving reliability, and working closely with both platform engineers and customer‑facing teams.
- Operate and maintain Rumble Cloud’sKubernetes platform, including our new CAPI/CAPO-based installation and its integration with the Open Stack Magnum API.
- Assist in therollout, migration, and upgradeprocesses as customers transition from the existing Magnum-based solution to the new platform.
- Serve as an escalation point for customer support, troubleshooting complex Kubernetes and cluster lifecycle issues.
- Monitor the health, performance, and capacity of Kubernetes clusters and underlying infrastructure; assist in incident response and root cause analysis.
- Implement and maintainday-2 operations processes, including backup/restore, scaling, patching, and cluster upgrades.
- Collaborate with the Kubernetes Architect to implement standards, best practices, and reference patterns for cluster configuration and operations.
- Help manage and integratecontainer registries, identity, networking, and storage components required for the Kubernetes platform.
- Automate repetitive operational tasks using scripting and infrastructure-as-codetools.
- Contribute to and maintainrunbooks, documentation, and knowledge base articlesfor both internal teams and customer-facing support.
- Provide feedback from operations and customer interactions to influence platform improvements and product roadmap.
- Extensive experience in Linux systems engineering, cloud engineering, or platform/SRE roles, including hands‑on work with Kubernetes in production environments.
- Practical experience withinstalling, operating, and troubleshooting Kubernetesin production environments.
- Experience working with Kubernetes onOpenStackor other cloud providers.
- Familiarity with Cluster API (CAPI) and/or Cluster API Provider Open Stack (CAPO), or strong willingness to ramp up quickly.
- Strong understanding of Linux, containers, and container runtimes (e.g., Docker, containerd).
- Experience with Kubernetes networking, storage, and ingressconcepts (CNI, CSI, load balancers, ingress controllers).
- Hands‑on experience with scripting and automation(e.g., Bash, Python, or Go) for operational tasks.
- Familiarity withconfiguration management and IaC toolssuch as Ansible and Terraform.
- Experience with monitoring and loggingin Kubernetes environments (e.g., Prometheus, Grafana, ELK/Graylog, or similar).
- Strong troubleshooting skills, clear written documentation, and effective communication with both technical and non‑technical stakeholders.
- Experience with
OpenStack Magnumand its Kubernetes integrations. - Experience operating Kubernetes on other public cloud platforms (EKS, AKS, GKE).
- Knowledge ofCeph, S3-compatible storage, and their use with Kubernetes.
- Understanding of application architecture son Kubernetes, including microservices and 12‑factor applications.
- Exposure toCI/CD tool chains(e.g., Git, Git Lab/Jenkins, Artifactory, etc.) for containerized workloads.
- Experience with security hardeningand best practices for Kubernetes (RBAC, Pod Security, network policies, image scanning).
- Demonstrated experience operating production Kubernetes cluster sand working as part of a cross‑functional…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).