Jobs / Vertafore

Director, Site Reliability Engineering

Apply Now

Vertafore · Denver, CO, United States

Denver, CO, United StatesExp: 15+ yrs175,000-220,000 USD/yearlyRemote

Apply Now

Remuneration

175,000-220,000 USD/yearly

Location

Denver, CO, United States

Visa sponsorship

No visa sponsorship

The selected candidate must be legally authorized to work in the United States.

Job summary

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response, automation, and CI/CD practices for assigned product families. Directors will manage multiple teams and collaborate with various departments to ensure operational excellence and bridge the gap between development and operations.

Benefits

Medical planVision planDental planLife insuranceAD&D insuranceShort Term DisabilityLong Term DisabilityPension PlanEmployer MatchMaternity LeavePaternity LeaveParental LeaveEmployee and Family Assistance Program (EFAP)Education AssistanceEmployee Referral programInternal Recognition programPPO optionsHigh-deductible optionsHealth Savings Account (HSA)Flexible Spending Accounts (FSA)

Qualifications

Bachelor’s degree in Computer Science, Information Systems, or related field
15+ years in Software Engineering, SRE, DevOps, or reliability roles
8+ years in leadership
Proven ability to leverage software engineering principles and practices to solve reliability and operational challenges
Expertise in CI/CD, observability, and incident response
Strong AWS knowledge
Experience with container orchestration
Proven ability to lead reliability programs across multiple SaaS products
Experience architecting applications or infrastructure for high-growth cloud platforms
Experience in B2B SaaS environments involving large-scale distributed systems
Proven leadership in communicating and influencing at team, peer, and leadership levels
Demonstrated experience driving operational excellence through metrics and KPIs
Background supporting financial services, healthcare, or regulated industries (preferred)

Responsibilities

Lead reliability, performance, and observability initiatives for Vertafore products
Own SLIs/SLOs, incident response, automation, and CI/CD practices for assigned product families
Manage multiple teams
Collaborate with Product Development, Architecture, Cloud Operations, Information Security, and SRE leaders for operational excellence
Bridge the gap between development and operations using a software engineering mindset for system administration
Own the lifecycle of services from inception and design through deployment, operation, and refinement
Define and enforce SLIs/SLOs for Vertafore flagship products
Drive observability strategy across application and infrastructure layers
Oversee CI/CD pipelines for product deployments
Monitor and cap manual, repetitive operational work at 50% using automation and AI tools
Manage error budgets to balance feature release velocity with platform stability
Define and participate in 24x7 on-call rotations for assigned products
Ensure rapid incident resolution and blameless postmortems
Partner with Cloud Operations on capacity planning, OS patching, and load balancing
Align reliability goals with product roadmaps and customer SLAs
Manage managers and engineers
Mentor teams on automation, observability, and reliability best practices

Skills

AnsibleAWSGitLabJenkins

Degrees

Bachelor’s degree in Computer ScienceBachelor’s degree in Information SystemsBachelor’s degree in related field

Industry

InsuranceB2B SaaSFinancial servicesHealthcareRegulated industries

Relocation

Apply Now