Rumble is a high-growth video platform and cloud services provider that is creating an independent infrastructure. Our mission is to restore the internet to its roots by making it free and open once again.
About the RoleRumble Cloud is seeking a Kubernetes Engineer to support our team in rolling out and operating our next-generation Kubernetes platform. This role will focus on our new CAPI/CAPO-based Kubernetes solution, which is designed to be compatible with our existing Open Stack Magnum API and will be deployed across our public cloud. You will help run the day-to-day operations of the Kubernetes service, assist with migrations and onboarding from our current Magnum-based offering, and act as an escalation point for complex customer issues that go beyond front-line support.
This is a hands‑on engineering role for someone who enjoys debugging difficult problems, improving reliability, and working closely with both platform engineers and customer‑facing teams.
- Operate and maintain Rumble Cloud’s Kubernetes platform, including our new CAPI/CAPO-based installation and its integration with the Open Stack Magnum API.
- Assist in the rollout, migration, and upgrade processes as customers transition from the existing Magnum-based solution to the new platform.
- Serve as an escalation point for customer support, troubleshooting complex Kubernetes and cluster lifecycle issues.
- Monitor the health, performance, and capacity of Kubernetes clusters and underlying infrastructure; assist in incident response and root cause analysis.
- Implement and maintain day‑2 operations processes, including backup/restore, scaling, patching, and cluster upgrades.
- Collaborate with the Kubernetes Architect to implement standards, best practices, and reference patterns for cluster configuration and operations.
- Help manage and integrate container registries, identity, networking, and storage components required for the Kubernetes platform.
- Automate repetitive operational tasks using scripting and infrastructure‑as‑code tools.
- Contribute to and maintain runbooks, documentation, and knowledge base articles for both internal teams and customer‑facing support.
- Provide feedback from operations and customer interactions to influence platform improvements and product roadmap.
- Extensive experience in Linux systems engineering, cloud engineering, or platform/SRE roles, including hands‑on work with Kubernetes in production environments.
- Practical experience with installing, operating, and troubleshooting Kubernetes in production environments.
- Experience working with Kubernetes on Open Stack or other cloud providers.
- Familiarity with Cluster API (CAPI) and/or Cluster API Provider Open Stack (CAPO), or strong willingness to ramp up quickly.
- Strong understanding of Linux, containers, and container runtimes (e.g., Docker, containerd).
- Experience with Kubernetes networking, storage, and ingress concepts (CNI, CSI, load balancers, ingress controllers).
- Hands‑on experience with scripting and automation (e.g., Bash, Python, or Go) for operational tasks.
- Familiarity with configuration management and IaC tools such as Ansible and Terraform.
- Experience with monitoring and logging in Kubernetes environments (e.g., Prometheus, Grafana, ELK/Graylog, or similar).
- Strong troubleshooting skills, clear written documentation, and effective communication with both technical and non‑technical stakeholders.
- Experience with Open Stack Magnum and its Kubernetes integrations.
- Experience operating Kubernetes on other public cloud platforms (EKS, AKS, GKE).
- Knowledge of Ceph, S3‑compatible storage, and their use with Kubernetes.
- Understanding of application architectures on Kubernetes, including microservices and 12‑factor applications.
- Exposure to CI/CD tool chains (e.g., Git, Git Lab/Jenkins, Artifactory, etc.) for containerized workloads.
- Experience with security hardening and best practices for Kubernetes (RBAC, Pod Security, network policies, image scanning).
- Demonstrated experience operating production Kubernetes clusters and working as part of a cross‑functional cloud/platform team.
- Ability…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: