Senior Technical Operations Engineer; Opensearch
Listed on 2026-01-24
-
IT/Tech
Cloud Computing
Own Every Moment at Net App
At Net App, your ideas power innovation. We lead in intelligent data infrastructure—delivering unified storage, integrated data services, and solutions that help organizations unlock the full potential of their data, from AI to multicloud.
Ready to innovate and contribute to our path to $10B? Here, you'll collaborate with passionate teams, tackle real-world challenges, and see your impact in how customers transform and grow. If you're ready to bring curiosity, creativity, and drive to every moment, Net App is where your journey begins.
Job SummaryNet App is looking for a Senior Tech Ops Engineer to join our growing Instaclustr team in North Carolina, US. Net App’s Instaclustr offering provides open source as-a-service company, delivering reliability manage cutting edge open-source technologies (Cassandra, Kafka, Postgre
SQL, Open Search, Cadence, Postgres and Click House) for our customers around the world.
Net App Instaclustr makes it easy for our customers to run powerful open-source applications at the highest levels of scale. We have developed a platform that takes care of the whole lifecycle: provisioning infrastructure, installing applications and, most importantly, keeping the applications running reliably in production. Since being founded in 2013, Instaclustr has grown strongly, with over 300 customers worldwide, and over 19,000 nodes under management.
Our Technical Operations Engineers are the frontline team keeping our large fleet of cloud-hosted open-source clusters up and running. Your work will ensure the security, reliability and performance of world-class systems and databases. You will collaborate with our customer’s technical teams, from globally recognised companies in the gaming, banking and logistics industry sectors, ranging from big multinationals to emerging start-ups.
The RoleIf you have excellent operational knowledge in managing Open search clusters, look no further! As a Site Reliability Engineer (Open search), you are in the frontline team keeping our large fleet of cloud-hosted Open search clusters up and running. Every day, you will diagnose and solve interesting technical problems, providing Open search as a Managed Service in a highly automated environment. Our service is relied on by some of the leading global names in Banking and Financial Services, Telecom, IoT and Tech companies that interact with millions of end users.
Job Requirements- Have good experience in Open Search technology (preferably including issue investigation, upgrades/migration) and/or a desire to upskill and develop to a true expert level. It’s preferable to have experience diagnosing issues/problems with other technologies such as Cassandra, Kafka.
- Have good experience working on one Public Cloud provider such as AWS, Azure or GCP.
- Preferably have past IT Customer service/support experience.
- Good fundamental Computer science / software engineering skills and knowledge, particularly Operating System internals, memory management, and networking.
- Strong knowledge and experience with Linux and be comfortable working from the command line (essential)
- Exceptional ability to communicate clearly and professionally in written and verbal English (essential).
- Work as part of a team and use your initiative to get things done.
- Ability to follow required processes and procedures.
- Investigating/researching Open search issues by reviewing the codebase or Jira project would be a plus.
- Programming skills in Python or Java, and source code control using Git would be a plus.
- Provide expert operational support to our nodes running in the cloud (AWS, Azure and GCP), using technologies such as Linux (Debian), Docker, and languages including Java, Python and bash
- Participate in on-call Level 2 roster
- Liaise with our customers’ engineers in resolving interesting issues related to Open search usage and other supported technologies.
- Undertake complex cluster operations such as migrations, upgrades and maintenance on our fleet.
- Develop and continually improve our suite of internal automation tools, applications, and processes.
Typically requires a minimum of 5 years of…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).