Senior DevOps Engineer

Jobgether · US

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior DevOps Engineer based in the United States.

This role sits at the heart of platform reliability and delivery, ensuring that engineering teams can ship safely, quickly, and at scale across complex cloud-native environments. You will be responsible for operating and improving Kubernetes-based infrastructure, CI/CD pipelines, and observability systems that power mission-critical applications. The environment is highly hands-on, combining incident response, automation, and continuous improvement across deployment and runtime systems. You will work closely with engineering teams to strengthen release processes, reduce operational friction, and improve system resilience. This position requires a strong DevOps mindset with deep technical fluency across cloud, automation, and monitoring tools. It is ideal for someone who thrives in fast-moving environments where reliability and efficiency are equally important.

Accountabilities:

You will be responsible for ensuring the stability, scalability, and efficiency of platform operations while enabling engineering teams to deliver software reliably and safely. This includes:

  • Operating and improving platform tooling to support reliable software delivery, including ticket triage, issue resolution, and service request handling
  • Maintaining and evolving self-service workflows, including documentation, templates, and deployment guardrails
  • Managing Kubernetes environments, including Helm deployments, namespace management, rollout troubleshooting, and incident response support
  • Supporting and enhancing CI/CD pipelines (primarily GitLab CI), including job configuration, deployment strategies, and quality gates
  • Monitoring and improving observability systems using tools such as Prometheus, Alertmanager, Thanos, and OpenTelemetry
  • Maintaining dashboards, alerts, and SLO/SLA indicators while reducing noise and improving signal quality
  • Supporting service instrumentation across metrics, logs, and traces using OpenTelemetry
  • Participating in on-call rotations, incident response, and post-incident documentation and improvements
  • Driving automation and cost optimization efforts, including resource right-sizing and operational efficiency improvements
  • Contributing to documentation, runbooks, onboarding guides, and operational playbooks
  • Requirements:

    The ideal candidate is an experienced DevOps or SRE professional with strong automation skills, deep cloud-native expertise, and a focus on operational excellence in production environments.

    • 8+ years of experience in DevOps, SRE, or platform engineering roles
    • Strong hands-on experience with Kubernetes and related ecosystem tools (Helm, Docker, ingress controllers, etc.)
    • Solid experience with CI/CD systems, preferably GitLab CI, including pipeline design and deployment strategies
    • Strong scripting ability in Bash or Python (Go is a plus) for automation and tooling
    • Practical experience with AWS services such as IAM, EC2/EKS, S3, CloudWatch, and Secrets Manager
    • Deep understanding of observability concepts including metrics, logs, tracing, and alerting systems
    • Experience with Prometheus, Alertmanager, Thanos, and OpenTelemetry
    • Comfortable working in ticket-driven environments (Jira, ServiceNow) and following change management processes
    • Strong communication skills and ability to collaborate with engineering and product teams
    • Bonus: Terraform experience for infrastructure as code and AWS/Kubernetes provisioning
    • Bonus: API integration experience (Python, Java, or Go) for internal tooling
    • Bonus: Strong Linux and container runtime debugging knowledge
    • Bonus: Exposure to regulated industries such as finance or insurance environments
    • Benefits:

      • Competitive compensation package aligned with experience
      • Fully remote role within the United States
      • Opportunity to work on large-scale, cloud-native infrastructure systems
      • High-impact role focused on reliability, automation, and platform engineering excellence
      • Exposure to modern DevOps tooling including Kubernetes, CI/CD, and observability stacks
      • Collaborative engineering culture focused on continuous improvement and innovation
      • Opportunity to work in fast-paced environments solving complex technical challenges

DevOps pay context

Based on 1,180 disclosed DevOps salaries on RoleSuite, the role pays a median of $142K/year, with most offers between $115K and $173K (10th–90th percentile: $101K–$210K).

See the full DevOps salary breakdown →
Apply →