Who We Are; What We Do; Where We’re Going
Magnet Forensics is a global leader in the development of digital investigative software that acquires, analyzes, and shares evidence from computers, smartphones, tablets, and IoT-related devices. We are continually innovating so our customers can deploy advanced and effective tools to protect their companies, communities, and countries.
Serving thousands of customers globally, our solutions are playing a crucial role in modernizing digital investigations, helping investigators fight crime, protect assets, and guard national security.
With employees based around the world, Magnet Forensics has been expanding our global presence. As a part of Magnet Forensics, you can expect to make a difference in the world, no matter what role you play. You’ll be supported through learning and development, not to mention an incredible team with unbelievable talent and integrity.
If you think you would be the right person to join our team working towards this goal, we would love to hear from you!
About the Role
We’re looking for a Technical Manager to lead and grow our Site Reliability Engineering (SRE) function. You’ll establish our first central SRE team - building the team, defining the practice, and shaping how we ensure performance, reliability, and operational excellence across our production platform.
This is a hands-on leadership role: you’ll bring deep technical expertise, empathy for both people and systems, and a strong instinct for scaling practices sustainably. Your work will have direct impact on our engineering velocity, platform resilience, and customer satisfaction.
Please note: The role will be a hybrid of remote work and in office for candidates in a commutable distance to our Waterloo and Ottawa offices. We have a flexible working arrangement.
What You’ll Do
- Build and lead a high-performing SRE team from the ground up
- Define and evolve SRE practices, standards, and tooling - acting as a thought leader across the organization
- Collaborate with engineering teams to improve observability, incident response, reliability, and platform performance
- Stay hands-on: dive into infrastructure, debug issues, write and review code, and guide architectural decisions
- Foster a strong culture of accountability, learning, and psychological safety
- Drive automation and continuous improvement across deployment pipelines, monitoring, and ops workflows
- Partner with Engineering, Product, and Security to align SRE efforts with company priorities
What We're Looking For:
- Proven experience leading SRE or Production Engineering teams at SaaS companies
- Strong technical background in cloud infrastructure (preferably AWS), distributed systems, and DevOps/SRE practices
- Proficient coder in Python or another modern high-level programming language, with experience writing production-quality tools and automation
- Passion for mentorship and team development - you care deeply about your people
- Demonstrated success establishing or scaling reliability engineering practices
- Proficiency with modern observability stacks (e.g. Prometheus, Grafana, Datadog, (OpenTelemetry) and infrastructure-as-code (e.g. Terraform, AWS CDK)
- A bias toward automation, repeatability, and operational excellence
- Bonus: experience navigating compliance or high-availability requirements
Why Join Us?
- Build a central SRE function from the ground up
- Play an impactful leadership role with real technical depth
- Join a collaborative, high-trust culture where your voice is heard
- Shape how a fast-moving company scales reliably and sustainably