Senior Manager, Infrastructure
Wealthsimple
Runtime Platform sits at the core of this mission. This team owns:
- Infrastructure foundations: the Kubernetes clusters, compute, networking, and cloud infrastructure that everything runs on
- Observability: the instrumentation, dashboards, alerting, and tooling that help teams understand their systems in production
- Production operations: how we run at scale, respond to incidents, and continuously improve our reliability postureWe build for the long term, investing in foundational capabilities that compound over time, not just quick fixes.
The role:
- We're looking for a Senior Engineering Manager to own Runtime Platform. You'll report to the VP of Platform Engineering and partner closely with peer leads and Principal Engineers.
- You own the strategy and execution for Runtime Platform. You set the technical direction, build and develop the team, and are accountable for outcomes. You'll be hands-on enough to have credibility with your engineers, and strategic enough to influence cross-functional roadmaps.
- You'll spend significant time on stakeholder management: understanding what product engineering teams need, translating those needs into platform capabilities, and building trust through consistent delivery. You'll represent Runtime Platform in incident reviews, architecture discussions, and planning cycles.
- You'll inherit a team tackling the core challenges of reliability engineering: building systems that fail gracefully, making production legible to the teams that depend on it, and continuously improving how we ship and operate at scale. You prioritize ruthlessly, make bets that compound over time, and own the results.
What do you bring:
- Large-scale infrastructure experience. You've operated platforms serving high-traffic, high-stakes workloads. You understand distributed systems, Kubernetes, cloud infrastructure (AWS preferred), and what it takes to keep them running. You've dealt with capacity planning, cost optimization, and disaster recovery.
- Strong engineering management fundamentals. You've led teams of engineers and managers. You know how to hire, develop, and retain strong people. You've navigated performance challenges and built healthy team cultures.
- Production mindset. You've been on-call. You've led incident response. You understand observability, SLOs, and the discipline required to run reliable systems.
- Regulatory awareness. You've operated in environments with compliance requirements like SOC2 and PCI. You understand the change management discipline required in regulated industries and can balance velocity with auditability.
- Stakeholder fluency. You can translate technical work into business impact. You build trust with non-platform teams. You know when to say no and how to say yes in ways that don't overcommit your team.
