×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Developer; SRE

Job in Toronto, Ontario, C6A, Canada
Listing for: PowerToFly
Full Time position
Listed on 2026-02-28
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Support
Salary/Wage Range or Industry Benchmark: 107000 CAD Yearly CAD 107000.00 YEAR
Job Description & How to Apply Below
Position: Senior Site Reliability Developer (SRE)
Job

Requisition

Position Overview
We are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for the Autodesk Platform Services and Emerging Technologies organization. The team delivers high-value, exabyte-scale and cloud data platform components powering desktop, mobile, and web products. This enables our product teams to build cohesive in-product data experiences, our partners to integrate and expand our data, and our end-users to work with their data across all Autodesk products.

This pivotal role focuses on ensuring the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners.

Responsibilities

Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture

Independently manage requirement analysis, solution design, implementation, and release planning

Ensure strict adherence to security, trust, compliance guidelines, and standards

Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security

Automate infrastructure deployment, scaling, and management using modern Dev Ops tools and practices

Implement and maintain configuration management and infrastructure as code (IaC) using Terraform

Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and periodic maintenance activities

Contribute to remediation of critical vulnerabilities (CVEs)

Promote and document security and best practices across all pillars of Dev Ops/SRE throughout system design

Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues

Participate in on-call rotations, providing critical 24x7 support for production systems

Minimum Qualifications

Bachelor’s degree or higher in Computer Science, Engineering, or a related field

5+ years of progressive experience in Site Reliability Engineering, Dev Ops, or a similar field

Proficiency with managing AWS resources and understanding of networking and security protocols

Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and Cloud Formation

Expertise in defining and building CI/CD processes with tools like Jenkins, Git Hub, and Artifactory

Experience with container-based technologies like Docker, Kubernetes and AWS ECS

Experience with monitoring and logging tools such as Dynatrace, Grafana, Data Dog, ELK Stack, and Cloud Watch

Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment

Strong experience with UNIX/Linux systems and programming languages such as Python, Go, Bash, Groovy, and Node.js

Technology Stack:
Java/Spring Boot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, Dynamo

DB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, Cloud Watch, Kibana, Open Search), Kafka, Flink, Jenkins, Git Hub, Jira, Google Apigee, Service Now, and Splunk

Preferred Qualifications

Knowledge of applying AI and ML solutions for engineering processes and/or Dev Ops automation

Knowledge of standardized observability frameworks such as Open Telemetry

Relevant certifications (e.g., AWS Certified Dev Ops Engineer, AWS Site Reliability Engineer)

Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures

Broad knowledge of data streaming pipelines like Kinesis, Firehose, and Kafka

Knowledge on core Java and Spring Boot concepts in JVM optimization

Knowledge on build tools, e.g. Gradle

Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment

Self-directed team player and independent contributor, demonstrating accountability and end-to-end ownership

Learn More
About Autodesk
Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings…
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary