Principal Cloud Backend Engineer
Listed on 2026-01-14
-
Software Development
Cloud Engineer - Software, Backend Developer, Software Engineer, DevOps
The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.
Samba Nova Suite™ is the first full‑stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the Samba Nova Suite is a fully integrated platform, delivered on‑premises or in the cloud, combined with state‑of‑the‑art open‑source models that can be easily and securely fine‑tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets.
AboutThe Role
We are seeking a highly skilled and experienced Principal or Senior Principal Cloud Backend Engineer to architect and build the core platform that powers our large‑scale AI inference services, with a critical focus on enabling flexible billing and monetization strategies. You will own the design and implementation of the systems that not only ensure reliability and scalability but also directly unlock new revenue streams and business models for our AI services.
This is a high‑impact role where you will solve complex challenges at the intersection of cloud‑native AI infrastructure, metering, and monetization. You will build the foundational systems for usage‑based pricing, subscription plans, and dynamic entitlements that serve as the economic engine for our business. If you are passionate about building platforms that are both technically robust and commercially critical, we want to hear from you.
Key Responsibilities- Platform Architecture & Strategy: Lead the technical vision and architecture for our inference serving and monetization platform. Design systems that are fault‑tolerant, highly available, and can scale to meet growing demand while accurately tracking usage for billing.
- Monetization Platform Design: Architect the core systems for flexible monetization, including:
- Entitlements & Quota Management: Designing a flexible system to define and enforce complex usage plans, rate limits, and access policies.
- Usage Metering & Aggregation: Building a highly reliable and accurate system to meter usage (e.g., tokens, requests) at scale and prepare data for billing.
- Billing Integration: Designing clean abstractions and APIs to seamlessly integrate with external billing and payment providers (e.g., Stripe, Metronome).
- Distributed Systems Design: Architect and implement complex distributed systems involving real‑time rate limiting, quota enforcement, and fair‑share scheduling for a multi‑tenant environment.
- Performance & Cost Optimization: Identify and eliminate bottlenecks in the end‑to‑end system, ensuring low‑latency request handling while maintaining precise financial accuracy.
- Technical Leadership: Serve as a technical leader and mentor. Establish best practices in code quality, testing, and observability for business‑critical financial data pipelines.
- Cross‑Functional
Collaboration:
Work closely with Product Management, Finance, and GTM teams to translate business requirements for new pricing models (e.g., subscriptions, pay‑as‑you‑go, custom enterprise plans) into scalable technical solutions.
- 10+ years of experience in software engineering, with a significant focus on designing and building large‑scale, distributed backend systems in cloud environments.
- 5+ years in a Principal or Lead Engineer role, with a proven track record of architecting, delivering, and operating business‑critical platforms.
- Expert proficiency in one or more of the following:
Go, Rust, and C++. Deep understanding of concurrency, performance optimization, and systems programming. - Deep, hands‑on experience with cloud‑native technologies (Kubernetes, Docker, etc.) and major cloud providers (AWS, GCP, Azure).
- Extensive experience with both SQL and No
SQL databases (e.g., Postgre
SQL, Redis) and designing data models for high‑throughput, low‑latency applications. - Strong…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).