Jobs / Honeycomb.io

Senior Site Reliability Engineer

Apply Now

Honeycomb.io · United Kingdom · Remote

United Kingdom127,670-150,200 GBP/yearlyRemote

Apply Now

Remuneration

127,670-150,200 GBP/yearly

Location

United Kingdom · Remote

Visa sponsorship

No visa sponsorship

Please note we cannot currently sponsor or support visa transfers at this time.

Job summary

Honeycomb is seeking a Senior Site Reliability Engineer to help scale backend systems for high-volume customers and improve reliability. This role involves working with various technologies like AWS, Kubernetes, and Kafka, and participating in an on-call rotation. The ideal candidate will contribute to a healthy cross-Atlantic engineering culture and navigate tradeoffs between reliability and other organizational goals.

Benefits

Equity with employee-friendly stock programUnlimited PTOHome office stipendCo-working stipendInternet stipendFull benefits coverage for employeesAdditional coverage for dependentsUp to 16 weeks of paid parental leaveAnnual development allowance

Qualifications

Strong experience in AWS
Strong experience in Kubernetes
Experience performing cost analysis and reduction
Solid Helm experience
Solid Terraform experience
Solid CI/CD experience
Project management skills
Software engineering experience
Experience with Golang (plus)
Experience with performance engineering (plus)
Experience with Kafka or other high-volume distributed systems
Excellent written communication skills
Excellent spoken communication skills
Ability to tailor communication for audience
Ability to give direct feedback
Curiosity to learn how people and systems work
Willingness to make people and systems partners in initiatives
Familiarity with observability concepts (SLOs, instrumentation)
Familiarity with data-driven decision making
Comfort operating in ambiguity

Responsibilities

Scale backend systems to support high-volume customers
Build organizational trust through transparent communication
Give and receive direct and kind feedback
Work with backend teams to optimize infrastructure utilization
Train and be trained as an Incident Commander
Develop a healthy cross-Atlantic engineering culture
Participate in the team’s on-call rotation (EU side of follow-the-sun)
Navigate tradeoffs between reliability and organizational goals
Act as an external ambassador through blog posts, conference talks, and presentations (optional)

Skills

AmbassadorAWSGoHelmHiveHoneycombKafkaKubernetesSlackTerraform

Work schedule

On-call rotation

Relocation

Apply Now