Description
:In this role, you'll join a team to provide 24/7 operational support of our Kubernetes (GKE/AKS/Kubernetes) environment, focusing on operational support and optimizing its health and performance. Your expertise in Kubernetes (GKE/AKS) will be crucial, as you'll oversee the management and security of our containerized applications. This includes ensuring efficient resource allocation and adherence to best practices for container deployments. Additionally, your secondary responsibilities include supporting the availability and performance of the entire GCP (Google Cloud Platform) Cloud environment at a platform level, proactively identifying and resolving any potential issues.
Professional certifications related to Kubernetes are beneficial (Certified Kubernetes Administrator (CKA), Certified Kubernetes Security Specialist (CKS), etc. Other certifications related to GCP platform are beneficial, along with Certified Terraform Associate.
The role requires familiarity with ITIL processes (incident, change, and problem management) and availability for off-hours support.
Drive root cause analysis on repeatable incidents to help prevent issues in the future.
Provide operational consultancy for future-state technologies.
Stay up to date with emerging security threats and industry best practices related to container security and cloud-native technologies.
Responsible for DEV to PROD GCP Cloud Containers/PaaS/IaaS/etc. support and processes. This is to ensure quality, performance, and availability of Public Cloud services (GCP).
Critical thinker with strong research and analytics skills.
Mandatory technical skills include:
Kubernetes (GKE/AKS) Specifics:
3+ years of experience supporting container technologies such as Kubernetes, Google Kubernetes Engine (GKE), Azure Kubernetes Service (AKS), Docker, Podman. Strong to expert knowledge of providing operational support related to Kubernetes workloads (GKE/AKS/etc.)
Experience implementing Kubernetes technologies such as network policies, service mesh, certificate manager, ingress controllers, etc..
Strong understanding of Kubernetes resource types (i.e. cluster roles, services, deployments etc.).
Experience developing Helm Charts.
Familiarity with Cloud PaaS services such as Google Cloud Run, Google GKE Autopilot, and Anthos Service Mesh.
Experience using IaC (Infrastructure-as-Code) tools such as Terraform, ARM, Bicep.
Understanding of Public Key Infrastructure (PKI), managing public key and private key certificates in Cloud environment for PaaS services and applications.
Strong fundamental knowledge of Operating Systems (RHEL, Ubuntu).
Knowledge of monitoring tools such as Dynatrace, Datadog, etc..
GCP (Google Cloud Platform) Cloud Environment Specifics:
Experience supporting GCP services such as GKE, GCS, Dataflow, Big Query, Cloud SQL (SQL/Postgre
SQL), REDIS, Cassandra, Big Table, Cloud Filestore, Persistent Storage, Apigee, Kafka, etc..
Knowledge with OS technologies (Red Hat Linux, Windows).
Experience developing CI/CD pipelines using technologies such as Git Hub Actions, Jenkins, etc..
Experience developing compliance policies/scripts using tools such as Google Org Policy, Aquasec, Wiz.
Strong understanding of network security principles, encryption protocols and identity management concepts.
Knowledge of scripting languages and tools such as Python, JavaScript, Power Shell, Bash.
Experience and knowledge supporting an Azure Public Cloud environment (while not necessary) would be valuable.
EXPERIENCE &
EDUCATION:
Undergraduate degree or Technical Certificate
Graduate degree, preferred
7+ years relevant experience
EMPLOYEE / TEAM:
Work effectively as a team, supporting other members of the team in resolving critical service issues
Prioritize and manage own workload in order to deliver quality results and meet timelines
Support a positive work environment that promotes service to the business, quality, innovation and teamwork and ensure timely communication of issues/ points of interest.
Participate in knowledge transfer within the team and business units
Identify and recommend opportunities to enhance productivity, effectiveness and operational efficiency of the business unit and/or team
Who We Are:
TD is one of the world's leading global financial institutions and is the fifth largest bank in North America by branches/stores. Every day, we deliver legendary customer experiences to over 27 million households and businesses in Canada, the United States and around the world. More than 95,000 TD colleagues bring their skills, talent, and creativity to the Bank, those we serve, and the economies we support.
We are guided by our vision to Be the Better Bank and our purpose to enrich the lives of our customers, communities and colleagues.
TD is deeply committed to being a leader in customer experience, that is why we believe that all colleagues, no matter where they work, are customer facing. As we build our business and deliver on our strategy, we are innovating to enhance the customer…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: