Platform Engineer
Verfasst am 2026-01-20
-
IT/Informationstechnik
Systemingenieur, Cyber-Sicherheit, Cloud Computing, Site Reliability Ingenieur/in
Remote (Europe) | Full-time | Experienced Hires
Zalion is on a mission to eliminate repetitive procurement work through agentic AI. We’re building autonomous agents that operate deep within enterprise procurement — navigating messy data, legacy systems, and complex workflows to deliver real impact.
Join us early and help define how enterprise AI is done right.
🚀 What You’ll be responsible forYou will:
Own our platform foundations end-to-end — from AWS architecture and IaC to CI/CD, observability, and incident readiness.
Build and evolve secure, scalable AWS infrastructure (networking, compute, storage, IAM) optimized for reliability and cost.
Design and maintain CI/CD pipelines on Git Hub that are fast, repeatable, and developer-friendly (clear feedback loops, safe deploys, strong defaults).
Define and operate infrastructure using Terraform — with clean modules, sensible standards, and automated validation.
Improve developer experience through golden paths: templates, self-service environments, paved roads for deployments, and internal tooling that removes friction.
Drive availability, scalability, and resilience : deployment strategies, rollbacks, capacity planning, DR thinking, and performance tuning.
Implement pragmatic security-by-default : least privilege IAM, secrets management, secure supply chain, and guardrails that enable speed without compromising safety.
Establish and refine observability and reliability practices (SLOs/SLIs, monitoring, alerting, postmortems, runbooks) that scale with the team.
Partner closely with product engineering to reduce operational load and keep delivery velocity high as Zalion grows.
AWS (core services; compute, networking, IAM, logging/monitoring, managed data services)
Git Hub (Actions, CI/CD workflows, checks, release automation)
Containers & orchestration (e.g., ECS/Fargate and/or Kubernetes depending on evolution)
Observability tooling (metrics, logs, tracing; e.g., Grafana/Prometheus/Open telemetry and friends)
Security tooling (SAST/DAST, dependency scanning, secrets scanning, policy as code)
✅ What We’re Looking ForStrong experience as a Platform / Dev Ops / Site Reliability Engineer in product teams shipping to production.
Deep practical knowledge of AWS : networking, IAM, security controls, and designing for failure.
Hands-on expertise with Terraform (not just “running apply”): modules, state strategy, DRY patterns, environment separation, and automated reviews.
Solid CI/CD engineering experience with Git Hub : pipeline design, artifact/versioning, deployment safety, and fast feedback loops.
A strong mindset for reliability and operability : you think in failure modes, automation, and measurable outcomes (SLOs).
Security awareness and discipline: you build guardrails that make the secure path the easy path.
A builder mindset : you ship improvements, measure impact (lead time, deploy frequency, MTTR), and iterate.
Comfort with ambiguity and ownership : you proactively identify platform bottlenecks and fix them without waiting for perfect specs.
4+ years experience in relevant roles (startup/scale-up experience is a plus).
Build the platform behind agentic AI systems that run in real enterprise environments
Immediate impact — your work accelerates every engineer and every release
Competitive salary + meaningful equity
High-end equipment
We value clarity and honesty. If something is messy, we’ll say so — and then we’ll fix it. We want an architect who enjoys building platforms that are secure, observable, and boring-in-production (in the best way).
Links to things you’ve built (Git Hub, blog posts, talks, etc.)
2–3 sentences on why this role and Zalion interest you
#J-18808-LjbffrUm nach Stellen zu suchen, sie anzusehen und sich zu bewerben, die Bewerbungen aus Ihrem Standort oder Land akzeptieren, klicken Sie hier, um eine Suche zu starten: