Jobs / Grafana Labs
Staff Software Engineer - Databases SRE | Germany | Remote
Grafana Labs · Home Office, Deutschland · Remote
Home Office, DeutschlandExp: 8+ yrs109,709-131,651 EUR/yearlyRemote
Remuneration
109,709-131,651 EUR/yearly
Location
Home Office, Deutschland · Remote
Central European Summer Time (UTC+2)
Visa sponsorship
Not specified
Job summary
Grafana Labs is seeking a Staff Software Engineer - SRE to enhance the reliability of their Cloud databases for high-SLA customers. The role involves close collaboration with product engineering squads, designing automation for reliability practices, and leading incident response efforts.
Benefits
EquityBonus30 days annual leaveGrafana Shutdown Days
Qualifications
- 8+ years engineering experience, 4+ in SRE/CRE/production engineering
- Strong Kubernetes experience in AWS, GCP, or Azure
- Familiarity with infrastructure-as-code tooling
- Strong technical leadership experience
- Experience operating multi-tenant systems in production
- Experience designing and implementing SLOs
- Experience with one or more programming languages
- Experience with Linux operating systems internals
- Excellent problem-solving and troubleshooting skills
- Ability to reason about performance, scaling, and failure modes
- Comfortable working within an engineering team
Responsibilities
- Partner closely with product engineering squads
- Own production reliability for high-SLA and complex customer environments
- Design and implement automation to scale reliability practices
- Ensure customers meet SLO targets
- Define and evolve per-tenant SLOs and reliability models
- Proactively reduce SLO burn to prevent repeat incidents
- Serve as primary escalation point and on-call for incidents
- Lead customer-impacting incident response and post-incident reviews
- Contribute to design docs and code reviews
- Influence feature design for scalability and operability
- Build automation to eliminate toil
- Improve alert quality and reduce noisy escalations
Skills
AWSAzureGCPGoGrafanaGrafana CloudHelmJavaKubernetesLinuxLokiMimirPythonTempoTerraform
Relocation
No