Senior Backend Software Engineer (Observability)
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior Backend Software Engineer (Observability) based in United Kingdom.
This role sits at the core of a next-generation cloud infrastructure platform built for the AI era, where observability is essential to ensuring reliability, performance, and scalability at massive scale. You will join a highly technical engineering environment focused on building and evolving a unified observability ecosystem that spans logs, metrics, traces, alerting, and advanced troubleshooting capabilities. The work directly impacts how engineers across distributed systems understand, monitor, and optimize complex production environments. You will contribute to high-throughput backend services handling telemetry ingestion, distributed storage, and query processing, helping shape the foundation of modern AI cloud operations. This is a deeply technical, systems-oriented role where backend engineering meets large-scale distributed architecture and operational intelligence. You will collaborate with experienced engineers in a fast-paced, innovation-driven environment where ownership and engineering excellence are central.
Accountabilities:
- Design, build, and scale backend services powering a large-scale observability platform, including telemetry ingestion, storage systems, query engines, and alerting pipelines.
- Develop and optimize distributed systems that process logs, metrics, and traces at high volume with a strong focus on reliability and performance.
- Contribute to the evolution of a unified observability ecosystem supporting developers across cloud infrastructure, Kubernetes, CI/CD, and production workloads.
- Troubleshoot and resolve complex production issues across distributed environments, ensuring system stability and operational excellence.
- Collaborate with cross-functional engineering teams to improve system architecture, scalability, and developer experience.
- Explore and implement innovative approaches to operational intelligence, including AI-assisted troubleshooting and observability enhancements.
- Participate in design discussions, code reviews, and technical planning to ensure high engineering standards and system robustness.
- 5+ years of professional software engineering experience in backend or distributed systems development.
- Strong proficiency in Go (Golang) or willingness to quickly adopt it in a production environment.
- Proven experience designing and operating distributed systems with high scalability and reliability requirements.
- Strong understanding of system performance, fault tolerance, and production-grade software engineering practices.
- Experience debugging and resolving complex issues in large-scale production environments.
- Excellent collaboration and communication skills, with a strong team-oriented mindset.
- Bonus: Experience with observability tools or frameworks such as Prometheus, Grafana, Loki, Jaeger, OpenTelemetry, Mimir, Tempo, VictoriaMetrics, or similar ecosystems.
- Bonus: Experience working with ClickHouse in production environments.
- Competitive compensation package
- Career growth and continuous learning opportunities in advanced cloud and AI infrastructure
- Flexible working model with strong autonomy and ownership
- Collaborative, engineering-driven culture focused on innovation and impact
- Opportunity to work on cutting-edge AI and observability challenges at scale
- International teams and exposure to global engineering practices
- High-impact role shaping next-generation cloud observability systems
Requirements:
Benefits:
Backend pay context
Based on 256 disclosed Backend salaries on RoleSuite, the role pays a median of $166K/year, with most offers between $87K and $198K (10th–90th percentile: $87K–$245K).
See the full Backend salary breakdown →