Jobs / IDEXX Laboratories

Senior Site Reliability Engineer

IDEXX Laboratories · Westbrook, ME, United States
Westbrook, ME, United StatesExp: 7+ yrs100,000-125,000 USD/yearlyOnsite
Remuneration
100,000-125,000 USD/yearly
Location
Westbrook, ME, United States
Visa sponsorship
Not specified

Job summary

IDEXX Laboratories is seeking a Senior Site Reliability Engineer to join their Site Reliability Engineering Team. This role involves modernizing infrastructure, strengthening system resilience, and scaling a global platform, leveraging AI tools to accelerate delivery and improve system quality. The position is a high-impact individual contributor role with end-to-end ownership of deployment and release systems, including CI/CD architecture and infrastructure modernization.

Benefits

Annual cash bonusHealth benefitsDental benefitsVision benefits401k matchingFinancial supportPet insuranceMental health resourcesVolunteer paid days offEmployee stock programFoundation donation matching

Qualifications

  • 7+ years of experience in DevOps, SRE, Platform Engineering, or similar roles focused on CI/CD, cloud infrastructure, and system reliability
  • Strong experience with AWS Serverless architectures
  • Strong experience with Terraform and CloudFormation
  • Strong experience with CI/CD pipelines, preferably GitHub Actions
  • Strong experience with Azure Entra ID, OAuth2, OpenID Connect
  • Strong experience with Maven build tooling
  • Strong experience with Git-based version control workflows, preferably GitHub
  • Proven ability to design and optimize deployment pipelines
  • Proven ability to troubleshoot complex distributed systems
  • Proven ability to make data-driven decisions
  • Proven ability to translate business requirements into scalable technical solutions
  • Strong communication skills
  • Strong collaboration skills
  • Strong organizational skills
  • Understanding of system design patterns for reliability and scalability
  • Experience with Kotlin or Java development (nice to have)
  • Experience with NoSQL databases (e.g., DynamoDB) and relational databases (e.g., PostgreSQL) (nice to have)
  • Experience working in Agile or Scrum environments (nice to have)
  • Familiarity with artifact management tools such as JFrog Artifactory (nice to have)
  • Experience defining and managing SLAs, SLOs, and SLIs (nice to have)

Responsibilities

  • Own the design and evolution of CI/CD pipeline architecture, governance, and standards
  • Modernize and automate deployment pipelines for Kotlin-based AWS Lambda services using GitHub Actions
  • Standardize infrastructure and deployment processes across services
  • Reduce manual deployment effort through automation
  • Leverage AI tools to improve productivity and system quality
  • Design, build, and evolve scalable, resilient AWS cloud infrastructure
  • Lead implementation of disaster recovery, high availability, and fault-tolerant designs
  • Automate infrastructure provisioning and lifecycle management
  • Build and maintain end-to-end observability (metrics, logging, tracing, alerting)
  • Establish effective alerting to reduce noise and ensure high-signal incident detection
  • Proactively identify and address system risks before they impact customers
  • Lead incident response in shared on-call rotation (triage, mitigation, communication)
  • Drive root cause analysis and blameless postmortems to prevent recurrence
  • Own and govern the release process, including deployment gates and approvals
  • Review and approve deployment plans to ensure quality and stability
  • Optimize the build and release lifecycle for speed, consistency, and reliability
  • Manage cross-repository dependencies and versioning strategies
  • Lead remediation of security vulnerabilities, collaborating with the Security team
  • Establish processes to proactively prevent new security risks
  • Embed secure development and deployment practices into pipelines

Skills

ArtifactoryAWSAzureCloudFormationCloudFrontDynamoDBEventBridgeGitGitHubGitHub ActionsJavaKotlinAWS LambdaMakeMavenOpenTelemetryPostgreSQLPythonS3SNSSQSTerraformTypeScript

Languages

KotlinJavaPythonTypeScript

Relocation

No