Site Reliability Engineer
Listed on 2026-01-04
-
IT/Tech
Cloud Computing, Systems Engineer
Job Summary
Free Wheel is seeking a Junior Dev Ops / SRE 2 to join Freewheel OPS team based in Denver, CO or Chicago, IL. As a member of the Global Operation team, you will be responsible for ensuring the reliability, scalability, and performance of Freewheel systems. Working closely with engineers and other operation sub-teams, you will manage infrastructure, optimize system reliability, automate daily operations, and resolve technical issues that impact upstream/downstream platform.
Job Description Qualifications- 1-3 years of experience as an SRE, Dev Ops or Operations Engineer.
- Programming
Skills:
Proficient in at least one programming language, such as Python, Go, Java, or Scala, with the ability to write efficient scripts and automation tools. - Experience with an automation tool or framework such as Ansible, Terraform, Kubernetes, Docker for automating system deployment and maintenance.
- Experience with cloud platforms (e.g., AWS, OCI, GCP, Azure) is a plus.
- Experience with Terraform and infrastructure as code (IaC) principle is a plus.
- System Monitoring and Log Management:
Familiar with monitoring and log management tools such as Prometheus, Grafana, ELK Stack, or other similar tools. - Team Collaboration and Communication:
Excellent communication skills with the ability to convey technical information clearly and concisely to both technical and non-technical stakeholders. - Proactive learner eager to grow in operations and governance.
- Education:
Bachelor’s degree or higher in Computer Science, Software Engineering, or a related field.
- System Monitoring and Optimization
:
Design and implement monitoring and alerting systems to ensure the stability, reliability, and performance of data platforms. Join on-call shift to quickly respond to and resolve issues. - Automation and Tool Development
:
Develop and maintain automation tools and scripts for deployment, monitoring, backup and disaster recovery. - Performance Optimization
:
Analyze and optimize the performance of data storage, query performance, and data flows to ensure efficient processing of large-scale datasets, reduce latency, and improve processing speed. - Incident Response and Troubleshooting
:
Respond quickly to platform failures, perform troubleshooting, and coordinate cross-team efforts to resolve issues and ensure high availability and reliability of data. - Capacity Planning and Scaling
:
Work with engineering teams to analyze and forecast capacity requirements, ensuring the system can handle traffic growth and scale infrastructure accordingly. Support Freewheel powered Live events. - Cloud Access Management & Governance
:
Maintain consistent cloud standards and support enforcement of governance and compliance practices across cloud environment. - Documentation and Knowledge Sharing
:
Document the architecture, configurations, and operational procedures for platforms, ensuring knowledge is shared across the team and providing relevant training. - Security and Compliance
:
Ensure platforms meet security standards and compliance requirements to prevent breaches or misuse. - Cross-Team Collaboration
:
Collaborate with engineering team, product team, and project management team to support product design and implementation, solving reliability-related issues.
We offer SRE positions in 3 different areas, SRE2, SRE2-Data and SRE2-Cloud
ENG, while each area has a slightly different day-to-day focus depending on the development teams they support, the core responsibilities and requirements remain consistent. If the candidates would like to focus on SRE2-Cloud
ENG area, the responsibilities and qualifications will focus more on cloud environment governance and Infrastructure as Code (IaC).
- Understand our Operating Principles; make them the guidelines for how you do your job.
- Own the customer experience - think and act in ways that put our customers first, give them seamless digital options at every touchpoint, and make them promoters of our products and services.
- Know your stuff - be enthusiastic learners, users and advocates of our game-changing technology, products and services, especially our digital tools and experiences.
- Win as a team - make big things…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).