
Autodesk
Design and make software for architecture, engineering, construction, and entertainment industries.
Senior Software Reliability Developer
Senior SRE building and operating cloud-native, high‑availability services for Autodesk Platform Services
Job Highlights
About the Role
• Design, configure, and evolve cloud infrastructure to improve availability, resiliency, performance, and cost efficiency as systems scale. • Operate and optimize production workloads on EC2, ECS, and serverless AWS services. • Drive continuous improvements in CI/CD pipelines, deployment strategies, and rollback mechanisms. • Define and track SLIs, SLOs, and error budgets, balancing reliability with delivery velocity. • Ensure systems remain compliant with security and regulatory standards through proactive updates and controls. • Participate in technical design discussions and architectural decision‑making. • Build tools and automation to improve operational efficiency, monitoring, alerting, and observability. • Create dashboards and metrics that enable data‑driven decision‑making and transparency. • Troubleshoot complex production issues, drive root‑cause analysis, and implement long‑term fixes. • Participate in a rotational on‑call schedule to ensure timely service recovery and production health. • Collaborate closely with developers in a Scrum‑based Agile environment.
Key Responsibilities
- ▸cloud infra
- ▸aws ops
- ▸ci/cd
- ▸automation
- ▸metrics
- ▸security
What You Bring
Autodesk is looking for a passionate and experienced Senior Software Reliability Developer to join our Autodesk Platform Services (APS) – Foundational Services team. In this hybrid role, the team meets in the Toronto office one day per week to collaborate and build strong working relationships. Reporting to the Software Engineering Manager, you will be part of an Agile, cross‑functional team designing and operating highly available, cloud‑native services to improve platform reliability and scalability. The ideal candidate is a collaborative, results‑driven developer who enjoys solving challenging problems, is continuously curious, and has a strong desire to learn and take on new technical challenges. They are comfortable presenting working software, discussing progress, and engaging with stakeholders. Autodesk’s culture guides how we work, treat each other, connect with customers and partners, and show up in the world. As an Autodesker you can do meaningful work that helps build a better world designed and made for all. • Own service reliability outcomes, including service reviews, fire drills, and high‑availability assessments. • Bachelor’s degree in Computer Science or a related technical field. • 6+ years of software engineering experience, including 3+ years in a Site Reliability Engineering role with SLO ownership. • Hands‑on experience with observability, monitoring, and logging tools such as Dynatrace, Splunk, OpenTelemetry, CloudWatch, Grafana, or Prometheus. • Strong understanding of SRE principles, best practices, and system architectures. • Proven experience building, deploying, and operating services on AWS. • Experience with continuous delivery methodologies and CI/CD tools. • Solid knowledge of resiliency patterns and cloud security fundamentals. • Experience troubleshooting production issues in collaboration with users and cross‑functional teams. • Infrastructure as Code (IaC) experience, preferably using Terraform. • Familiarity with security and compliance standards such as SOC 2.
Requirements
- ▸bachelor's
- ▸sre
- ▸aws
- ▸terraform
- ▸ci/cd
- ▸observability
Benefits
For Canada‑BC based roles, the starting base salary is expected between $107,000 and $157,300, with offers based on experience and location and may include bonuses, stock grants, and a comprehensive benefits package.
Work Environment
Hybrid