Jobs / Vertafore

Director, Site Reliability Engineering

Vertafore · Denver, CO, United States
Denver, CO, United StatesExp: 15+ yrs175,000-220,000 USD/yearlyRemote
Remuneration
175,000-220,000 USD/yearly
Location
Denver, CO, United States
Visa sponsorship
No visa sponsorship
The selected candidate must be legally authorized to work in the United States.

Job summary

The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response, automation, and CI/CD practices for assigned product families. Directors will manage multiple teams and collaborate with various departments to ensure operational excellence and bridge the gap between development and operations.

Benefits

Medical planVision planDental planLife insuranceAD&D insuranceShort Term DisabilityLong Term DisabilityPension PlanEmployer MatchMaternity LeavePaternity LeaveParental LeaveEmployee and Family Assistance Program (EFAP)Education AssistanceEmployee Referral programInternal Recognition programPPO optionsHigh-deductible optionsHealth Savings Account (HSA)Flexible Spending Accounts (FSA)

Qualifications

  • Bachelor’s degree in Computer Science, Information Systems, or related field
  • 15+ years in Software Engineering, SRE, DevOps, or reliability roles
  • 8+ years in leadership
  • Proven ability to leverage software engineering principles and practices to solve reliability and operational challenges
  • Expertise in CI/CD, observability, and incident response
  • Strong AWS knowledge
  • Experience with container orchestration
  • Proven ability to lead reliability programs across multiple SaaS products
  • Experience architecting applications or infrastructure for high-growth cloud platforms
  • Experience in B2B SaaS environments involving large-scale distributed systems
  • Proven leadership in communicating and influencing at team, peer, and leadership levels
  • Demonstrated experience driving operational excellence through metrics and KPIs
  • Background supporting financial services, healthcare, or regulated industries (preferred)

Responsibilities

  • Lead reliability, performance, and observability initiatives for Vertafore products
  • Own SLIs/SLOs, incident response, automation, and CI/CD practices for assigned product families
  • Manage multiple teams
  • Collaborate with Product Development, Architecture, Cloud Operations, Information Security, and SRE leaders for operational excellence
  • Bridge the gap between development and operations using a software engineering mindset for system administration
  • Own the lifecycle of services from inception and design through deployment, operation, and refinement
  • Define and enforce SLIs/SLOs for Vertafore flagship products
  • Drive observability strategy across application and infrastructure layers
  • Oversee CI/CD pipelines for product deployments
  • Monitor and cap manual, repetitive operational work at 50% using automation and AI tools
  • Manage error budgets to balance feature release velocity with platform stability
  • Define and participate in 24x7 on-call rotations for assigned products
  • Ensure rapid incident resolution and blameless postmortems
  • Partner with Cloud Operations on capacity planning, OS patching, and load balancing
  • Align reliability goals with product roadmaps and customer SLAs
  • Manage managers and engineers
  • Mentor teams on automation, observability, and reliability best practices

Skills

AnsibleAWSGitLabJenkins

Degrees

Bachelor’s degree in Computer ScienceBachelor’s degree in Information SystemsBachelor’s degree in related field

Industry

InsuranceB2B SaaSFinancial servicesHealthcareRegulated industries

Relocation

No