More jobs:
Job Description & How to Apply Below
The Role
As a Dev Ops engineer you will provide L3 support across Edge/on‑prem and cloud troubleshooting to resolve escalated incidents and collaborate with senior engineers and Product to validate fixes and prevent recurrences.
Having a good understanding of our product, its components and their interactions is essential in troubleshooting and problems remediation. Good Linux administration (RHEL primarily, plus Ubuntu) and Open Shift/Kubernetes experience are essential.
You will build lightweight automations (Python, Bash, Ansible) to reduce Operations (Customer Deployment) issues. When not handling L3 work, you'll execute safe cloud deployments and upgrades via existing Git Ops/IaC pipelines (Flux, Ansible, Terraform), offering feedback and minor adjustments. You'll maintain and tune Alert manager rules and improve Grafana dashboards to ensure actionable, low‑noise alerting with Prometheus/Grafana.
You will automate and improve SOPs, and support knowledge sharing through workshops, training, and clear documentation, contribute to infrastructure testing and security/vulnerability remediation with the Dev Ops engineering and Security teams.
What You'll Do
Designs and maintains CI/CD pipelines using Git Lab CI/CD.
Implements Infrastructure as Code (IaC) with tools like Terraform.
Manages basic deployments and assists in CI/CD process improvements.
Writes and executes simple automation scripts (e.g., Ansible playbooks).
Troubleshoots and optimizes Kubernetes cluster operations.
Write and maintain system operations documentation (articles, diagrams, data flows, etc.) for new and existing applications and services.
Keep up-to-date on best practices and new technologies.
Conducts, designs, and executes staging/UAT/production and mass service deployment scenarios.
Collaborates on technical architecture and system design.
Analyzes and collects data: log files, application stack traces, thread dumps, etc.
Reproduces and simulate application incidents to create debug reports and coordinate delivery of application fixes.
Works in off-routine hours occasionally.
Works with customers and travel to international customer or partner locations.
Collaborating With
Operations (Customer Deployment) teams:
Collaborate with the Operations teams for troubleshooting and solving L3 tickets, create automations to reduce and optimize workload.
Dev Ops Cloud and Edge teams:
Work closely with the wider Dev Ops engineering teams, your manager, developers and QA engineers on technical documentation, integration and deployments of releases and changes.
Security Team:
Collaborate with the team to ensure the security of our cloud and edge solutions.
Our Tech Stack
At Everseen, you will have the opportunity to work with cutting-edge technology. Our stack includes:
CI/CD Tools:
Git Lab CI/CD
Cloud Platforms:
Azure (AKS, Registry), GCP (GKE)
Edge Platforms:
Docker, Podman, Kubernetes(k0s) and Openshift
Edge OS: RHEL, Ubuntu
Automation Tools:
Ansible (AWX), Jinja, Terraform
Deployment Tools:
Helm, Flux CD
Observability:
Prometheus, Loki, Grafana alloy, Grafana dashboards, Thanos
Databases:
Elasticsearch, MongoDB
Authentication:
Keycloak
Scripting
Languages:
Python, Bash
Profile And Skills
Experience:
3+ years in Dev Ops/SRE or similar operations‑focused roles with strong automation experience.
Networking:
Experience in DNS, routing, container communication, firewalls, reverse-proxying, load-balancing, edge to cloud communication and troubleshooting.
System Administration:
Good system administration skills are required for deploying and troubleshooting OS level outages and Everseen's containerized Edge application in customer network.
Cloud Expertise:
Proven experience with Azure (or GCP), including fully automated infrastructure and deployment.
CI/CD Pipelines:
Proven experience in implementing and managing CI/CD pipelines (Git Lab CI/CD preferred) and good knowledge of Git and associated workflows (e.g., Gitflow).
Observability:
Proven experience with monitoring, logging, and alerting tools and stacks.
Scripting:
Good scripting skills in Bash and Python.
Containerization:
Good knowledge of…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×