Jobs / Grafana Labs

Staff Software Engineer - Databases SRE | Germany | Remote

Grafana Labs · Home Office, Deutschland · Remote
Home Office, DeutschlandExp: 8+ yrs109,709-131,651 EUR/yearlyRemote
Remuneration
109,709-131,651 EUR/yearly
Location
Home Office, Deutschland · Remote
Central European Summer Time (UTC+2)
Visa sponsorship
Not specified

Job summary

Grafana Labs is seeking a Staff Software Engineer - SRE to enhance the reliability of their Cloud databases for high-SLA customers. The role involves close collaboration with product engineering squads, designing automation for reliability practices, and leading incident response efforts.

Benefits

EquityBonus30 days annual leaveGrafana Shutdown Days

Qualifications

  • 8+ years engineering experience, 4+ in SRE/CRE/production engineering
  • Strong Kubernetes experience in AWS, GCP, or Azure
  • Familiarity with infrastructure-as-code tooling
  • Strong technical leadership experience
  • Experience operating multi-tenant systems in production
  • Experience designing and implementing SLOs
  • Experience with one or more programming languages
  • Experience with Linux operating systems internals
  • Excellent problem-solving and troubleshooting skills
  • Ability to reason about performance, scaling, and failure modes
  • Comfortable working within an engineering team

Responsibilities

  • Partner closely with product engineering squads
  • Own production reliability for high-SLA and complex customer environments
  • Design and implement automation to scale reliability practices
  • Ensure customers meet SLO targets
  • Define and evolve per-tenant SLOs and reliability models
  • Proactively reduce SLO burn to prevent repeat incidents
  • Serve as primary escalation point and on-call for incidents
  • Lead customer-impacting incident response and post-incident reviews
  • Contribute to design docs and code reviews
  • Influence feature design for scalability and operability
  • Build automation to eliminate toil
  • Improve alert quality and reduce noisy escalations

Skills

AWSAzureGCPGoGrafanaGrafana CloudHelmJavaKubernetesLinuxLokiMimirPythonTempoTerraform

Relocation

No