Software Engineer II; Backend + Data pipelines Job Denver area,Colorado USA,Software Development

Position: Software Engineer II (Backend + Data pipelines)

Join to apply for the Software Engineer II (Backend + Data pipelines) role at Scribd, Inc.

Overview

The ML Data Engineering team powers metadata extraction, enrichment, and content understanding across all Scribd brands. We process hundreds of millions of documents, billions of images, and deliver high-quality metadata to enable content discovery and trust for millions of users worldwide. Our systems operate at massive scale, supporting diverse datasets like user-generated content (UGC), ebooks, audio books, and more. We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely with applied research and product teams to deploy scalable ML and LLM-powered solutions in production.

Role Overview

We’re seeking a Software Engineer II with strong backend development experience and a passion for solving complex data challenges this role, you’ll design, build, and optimize distributed systems that extract, enrich, and process metadata for a wide range of content. You’ll work closely with ML engineers, product managers, and cross-functional partners to integrate machine learning models and LLM-based services into production pipelines and deliver impactful, high-performance solutions.

This role offers the opportunity to work on cutting-edge generative AI and metadata enrichment problems at a truly global scale.

Tech Stack

Python, Scala, Ruby on Rails, Airflow, Databricks, Spark, HTTP APIs, AWS (Lambda, ECS, SQS, Elasti Cache, Sagemaker, Cloudwatch, Datadog) and Terraform.

Key Responsibilities

Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content.
Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines.
Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.
Optimize and refactor existing systems for performance, scalability, and reliability.
Ensure data accuracy, integrity, and quality through automated validation and monitoring.
Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.
Manage and maintain data pipelines, security and infrastructure.

Requirements

4+ years of professional software engineering experience
Proficiency in Python, Scala, Ruby, or similar languages
Experience designing and building distributed systems at scale
Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda
Experience with infrastructure-as-code tools like Terraform (or similar)
Experience working with a public cloud provider (AWS, Azure, or Google Cloud)
Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads
Proven ability to test, profile, and optimize systems for performance, scalability, and reliability
Bachelor’s degree in Computer Science or equivalent professional experience
Bonus:
Experience working with LLMs or integrating ML models into production systems

Working at Scribd, you will be part of a company that values flexibility and intentional in-person collaboration. Scribd Flex supports choosing a daily work-style with occasional in-person attendance required for all employees, regardless of location.

Compensation

At Scribd, your base pay is one part of your total compensation package and is determined within a range based on location and level. Salaries for the United States vary by region; ranges are provided for California and non-California locations, with additional equity and benefits described in the package.

Working at Scribd, Inc.

Are you currently based in a location where Scribd is able to employ you? Primary residences must be in or near specified cities in the United States, Canada, or Mexico, with reasonable commuting distance.

Benefits, Perks, And Wellbeing At Scribd

Benefits/perks may vary by location
Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees
12 weeks paid parental leave
Short-term/long-term disability plans
401k/RSP matching
Onboarding stipend for home…


Increase/decrease your Search Radius (miles)



Job Posting Language