×
Register Here to Apply for Jobs or Post Jobs. X
More jobs:

Software Engineer III​/Senior, AI Gateway

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: ngrok
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    AI Engineer
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below

About Ngrok Inc.

ngrok is an all-in-one cloud networking platform that secures, transforms, and routes traffic to services running anywhere. Instead of cobbling together nginx, NLBs, VPNs, model routers, and oodles of other tools, developers solve every networking problem with one gateway. Doesn’t matter if they’re sharing localhost or running AI workloads in production.

We're trusted by more than 9 million developers at companies like Git Hub, Okta, Hashi Corp, and Twilio. What started as a way to put your local app on a public URL has grown into a universal gateway for API delivery, AI inference, device fleets, and site-to-site connectivity. It’s the same ngrok that millions of developers have loved and leaned on every day for years, now with the power to run production traffic at scale.

A

Few Things You Should Know
  • Our mascot is a rock
  • We are obsessed with our pets, Viper sunglasses and Bufo (yes, the toad)
  • We have a designated Chief Emoji Officer - they are vital to our success!
  • We like software that’s serious and culture that’s not

Most people skim to “requirements” and bounce. You're actually reading this. That's the kind of thoroughness we respect, or you're just procrastinating. Either way, same same and you should keep reading.

About The AI Gateway Team

Our AI Gateway team builds the systems that define how AI traffic is identified, controlled, and understood as it passes through ngrok.

We own the AI‑specific control plane at the gateway layer: policies, usage tracking, and enforcement that sit directly on live customer traffic. Our systems must behave correctly under real-world conditions—traffic spikes, unexpected model behavior, misconfigured policies, and customers asking, “Why was this blocked?” or “Where did my tokens go?”

What You’ll Actually Do
  • Build and evolve the AI Gateway:
    You’ll work on the AI‑aware gateway components that classify and handle AI traffic in real time. This code runs directly in the request path and must be fast, safe, and predictable.
  • Own AI traffic policy enforcement:
    You’ll design and implement AI Gateway Traffic Policy Objects—rate limits, usage caps, and access rules specific to AI workloads. These policies exist to prevent runaway costs, misuse, and accidental exposure without breaking legitimate traffic.
  • Track AI usage and token consumption:
    You’ll build and maintain systems that accurately measure AI usage—requests, tokens, and related metadata—so customers can understand how their AI systems behave and what they’re consuming.
  • Make AI behavior observable and explainable:
    You’ll expose clear, trustworthy signals around AI traffic: what was allowed or blocked, which policies applied, and how usage accumulated. When customers ask “what happened?”, the gateway should already know.
  • Design abstractions that hide complexity:
    You’ll work with product and design to build AI‑specific gateway primitives that feel intentional and safe, without leaking provider quirks or infrastructure details into customer workflows.
  • Ship systems customers trust in production:
    You’ll collaborate closely with Gateway, Customer Data, and Platform teams to ensure AI usage data, policy enforcement, and billing signals line up—so customers can turn these features on with confidence.
You Might Be a Great Fit If…
  • You’re comfortable in a statically typed, compiled language such as Go, Rust, C++, or Java (with bonus points for Go)
  • You’ve worked with AI/LLMs and can appreciate their unique brand of edge‑cases
  • You care about developer experience and thoughtful abstractions
  • You enjoy defining system behavior, not just plumbing
  • You’ve thought about retries, limits, and costs before being asked
  • You like systems that move complexity from the user to the system
Extra Credit If You’ve Worked On
  • AI platforms or inference infrastructure
  • API gateways with product‑level opinions
  • Usage limits, quotas, or billing‑adjacent systems
  • Customer‑facing observability tools
Tech Stack

ngrok runs entirely on AWS. Engineers develop by using remote development tools and/or ssh to connect to remote EC2 environments that run a full Kubernetes cluster of the ngrok stack, closely mirroring production. The codebase is primarily Go and…

Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary