Software Engineer, Compute Infrastructure
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Software Engineer, Compute Infrastructure based in Canada.
This role sits at the core of a modern cloud platform responsible for powering large-scale, production-grade applications across multiple regions and infrastructure providers. You will help design and evolve the compute layer that enables developers to deploy and scale services seamlessly without managing underlying infrastructure complexity. The environment is highly systems-focused, working across Kubernetes, distributed systems, and low-level cloud primitives to ensure performance, reliability, and scalability. You will contribute to building and operating infrastructure that spans hyperscalers, custom hardware, and emerging execution environments such as micro-VMs. This is a deeply technical role requiring strong ownership over platform reliability and performance at scale. You will work closely with other infrastructure engineers to investigate complex system behaviors, optimize resource utilization, and improve orchestration mechanisms. The role is ideal for engineers who thrive in solving ambiguous, high-impact infrastructure challenges in fast-growing distributed systems.
Accountabilities:
- Own and evolve core compute infrastructure across multiple cloud providers, regions, and data centers, ensuring scalability and reliability at global scale
- Design and build platform capabilities that improve service performance, availability, deployment flexibility, and fault tolerance across distributed systems
- Investigate and resolve complex infrastructure issues spanning Kubernetes clusters, control planes, data planes, and underlying kernel-level systems
- Improve system efficiency and performance through profiling, benchmarking, experimentation, and continuous tuning of infrastructure components
- Develop and maintain infrastructure automation, including cluster provisioning, configuration, testing, upgrades, and lifecycle management
- Contribute to the design and implementation of orchestration systems, controllers, and scheduling logic using systems programming languages such as Go or Rust
- Participate in on-call rotations and incident response, improving observability, reliability, and operational maturity of the platform
- Collaborate with engineering teams across the organization to ensure a stable, secure, and predictable compute environment
- 7+ years of experience building and operating large-scale distributed systems or cloud infrastructure platforms
- Deep expertise with Kubernetes or similar container orchestration systems in production environments
- Strong programming skills in Go, Rust, or similar systems-level languages used for infrastructure development
- Proven experience designing, debugging, and operating complex distributed systems at scale
- Strong understanding of infrastructure tradeoffs involving performance, reliability, scalability, and cost efficiency
- Experience executing high-risk infrastructure changes or upgrades with minimal downtime
- Hands-on experience with observability, incident response, and production on-call responsibilities
- Preferred: experience with virtualization technologies such as Firecracker, gVisor, or Kata Containers
- Preferred: familiarity with Linux internals, eBPF, kernel tuning, or low-level system optimization
- Preferred: experience improving container startup performance, resource isolation, or multi-tenant security
- Competitive compensation with equity participation
- 100% employer-paid medical insurance and heavily subsidized dental and vision coverage for employees and dependents
- 4 weeks of paid vacation annually
- 14 weeks of fully paid parental leave for all parents
- Long-term disability coverage and life insurance
- Flexible spending accounts (FSA) and health savings accounts (HSA) options
- Monthly wellness, lifestyle, and mental health stipend
- Home office stipend and monthly internet and phone reimbursement
- Commuter benefits for onsite employees
- Continuous learning and professional development support
Requirements:
Benefits:
Software pay context
Based on 7,897 disclosed Software salaries on RoleSuite, the role pays a median of $158K/year, with most offers between $123K and $200K (10th–90th percentile: $102K–$235K).
See the full Software salary breakdown →