Jobs / Honeycomb.io

Senior Site Reliability Engineer

Honeycomb.io · United Kingdom · Remote
United Kingdom127,670-150,200 GBP/yearlyRemote
Remuneration
127,670-150,200 GBP/yearly
Location
United Kingdom · Remote
Visa sponsorship
No visa sponsorship
Please note we cannot currently sponsor or support visa transfers at this time.

Job summary

Honeycomb is seeking a Senior Site Reliability Engineer to help scale backend systems for high-volume customers and improve reliability. This role involves working with various technologies like AWS, Kubernetes, and Kafka, and participating in an on-call rotation. The ideal candidate will contribute to a healthy cross-Atlantic engineering culture and navigate tradeoffs between reliability and other organizational goals.

Benefits

Equity with employee-friendly stock programUnlimited PTOHome office stipendCo-working stipendInternet stipendFull benefits coverage for employeesAdditional coverage for dependentsUp to 16 weeks of paid parental leaveAnnual development allowance

Qualifications

  • Strong experience in AWS
  • Strong experience in Kubernetes
  • Experience performing cost analysis and reduction
  • Solid Helm experience
  • Solid Terraform experience
  • Solid CI/CD experience
  • Project management skills
  • Software engineering experience
  • Experience with Golang (plus)
  • Experience with performance engineering (plus)
  • Experience with Kafka or other high-volume distributed systems
  • Excellent written communication skills
  • Excellent spoken communication skills
  • Ability to tailor communication for audience
  • Ability to give direct feedback
  • Curiosity to learn how people and systems work
  • Willingness to make people and systems partners in initiatives
  • Familiarity with observability concepts (SLOs, instrumentation)
  • Familiarity with data-driven decision making
  • Comfort operating in ambiguity

Responsibilities

  • Scale backend systems to support high-volume customers
  • Build organizational trust through transparent communication
  • Give and receive direct and kind feedback
  • Work with backend teams to optimize infrastructure utilization
  • Train and be trained as an Incident Commander
  • Develop a healthy cross-Atlantic engineering culture
  • Participate in the team’s on-call rotation (EU side of follow-the-sun)
  • Navigate tradeoffs between reliability and organizational goals
  • Act as an external ambassador through blog posts, conference talks, and presentations (optional)

Skills

AmbassadorAWSGoHelmHiveHoneycombKafkaKubernetesSlackTerraform

Work schedule

On-call rotation

Relocation

No