×
Register Here to Apply for Jobs or Post Jobs. X

Senior AI Infrastructure Engineer

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: ArangoDB, Inc.
Full Time position
Listed on 2026-01-24
Job specializations:
  • Software Development
    AI Engineer, Senior Developer, Machine Learning/ ML Engineer, Cloud Engineer - Software
Job Description & How to Apply Below

Overview

Senior Software Engineer – AI Infrastructure at Arango

About Arango:
Arango provides a trusted data foundation for the next wave of Enterprise AI with graph-based Contextual AI, transforming enterprise data into a System of Context that enables LLMs to deliver better outcomes at scale and cost efficiency. The Arango AI Data Platform offers a single, integrated environment to build and scale AI-powered applications, unifying graph, vector, document, and key-value data with full-text, geospatial, and vector search — creating the System of Context, the bridge between enterprise data and LLMs.

We’re a global team based in California and Cologne, united by curiosity, collaboration, and a passion for helping developers, data engineers, and technology leaders innovate faster and smarter with AI. We are trusted by NVIDIA, HPE, the London Stock Exchange, the U.S. Air Force, NIH, and Articul8, and we are a member of the NVIDIA Inception Program and the AWS ISV Accelerate Program.

If you’re excited about shaping the future of Contextual AI, come build with us.

Location

Only candidates in Europe will be considered.

About the Role

We are looking for a senior, hands-on software engineer to help maintain, stabilize, and debug our AI infrastructure. This role has high ownership and requires deep technical problem-solving skills in production environments. The most important qualification is not a specific language, but a strong understanding of how complex software systems actually behave in production.

Key Responsibilities
  • Maintain, develop and stabilize AI infrastructure services
  • Architect and implement foundational services and shared libraries that scale our entire AI infrastructure ecosystem.
  • Debug complex, non-obvious production issues across application, process, network, and memory layers
  • Systematically analyze, isolate, and resolve failures in unclear scenarios
  • Improve observability, profiling, and tracing across services
  • Work with distributed microservices and internal platforms
  • Operate and improve systems running on Docker, Kubernetes, and Helm
  • Support CI/CD pipelines (Circle

    CI), testing, and security-relevant components
  • Work with MLflow, Triton, and distributed Python modules
Core Requirements
  • Senior-level experience (minimum 5+ years)
  • Passion for working with cutting-edge technologies in a fast-moving AI environment, including LLM-based workflows and pipelines
  • Comfortable working independently in a fast-paced, evolving environment
  • Strong debugging skills in complex, distributed systems with the ability to identify root causes when failures are ambiguous
  • Solid understanding of how software behaves in production environments
  • Experience designing and operating microservices, distributed systems, and databases
  • Proven experience building and scaling high-availability services
Core Technology Stack
  • Python (primary language) - 5+ years
  • Docker, Kubernetes, Helm - 5+ years
  • CI/CD pipelines and testing frameworks (e.g., Circle

    CI)
  • Observability tools: metrics, logs, tracing, and profiling
  • Exposure to AI/ML infrastructure, including tools such as MLflow and Triton
Nice to Have
  • Customer-facing experience, including proofs of concept (PoCs) or technical demos
  • Experience with AI/ML infrastructure and orchestration, i.e., MLflow and Triton
  • Cross-language debugging experience
  • Familiarity with Rust
  • Experience working with databases, No

    SQL, multi-model, or graph databases
  • Knowledge of Retrieval-Augmented Generation (RAG) and Graph

    RAG concepts
  • Understanding of graph algorithms and graph-based data modeling
Why Join ArangoDB

Our headquarters is in San Francisco (US) and we have an office in Cologne (Germany), but most of our diverse team works remotely worldwide. So, do you prefer your desk at home or do you want to join us at one of our locations? Your choice.

The global minds of Arango team come from 5 different continents and more than 20 countries. Diverse backgrounds enable us to see new solutions. We invite people from every culture, national origin, religion, sexual orientation or gender identity, and from every age to apply to our positions. Arango is committed to a workplace free of discrimination and harassment based on any of these characteristics.

We love this diversity and encourage everyone curious and visionary to join the multi-model movement.

#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary