Jobs / BMO Financial Group

DevOps Engineer/Site Reliability Engineer

BMO Financial Group · Toronto, ON, Canada
Toronto, ON, CanadaExp: 5+ yrs75,900-141,900 CAD/yearlyHybrid
Remuneration
75,900-141,900 CAD/yearly
Location
Toronto, ON, Canada
Visa sponsorship
Not specified

Job summary

BMO Financial Group is seeking a highly skilled DevOps / Site Reliability Engineer to join their technology team in Toronto. The role involves designing, building, and operating resilient infrastructure for critical business applications, with a focus on automation and continuous improvement.

Qualifications

  • 5+ years of experience in DevOps, SRE, or related roles in hybrid environments
  • Strong experience with AWS services and Infrastructure as Code
  • Hands-on experience with configuration management tools
  • Strong experience with observability platforms
  • Proficiency in scripting and development using Python, Bash, and/or JavaScript
  • Experience implementing automation-first solutions across infrastructure and application layers
  • Solid understanding of security and compliance practices within regulated industries
  • Experience with Git-based workflows
  • Working knowledge of ServiceNow and ITSM processes
  • Strong experience with RHEL systems administration and clustering technologies
  • Proven ability to support and operate large-scale, mission-critical systems

Responsibilities

  • Partner with development, operations, and security teams to design and deliver secure, scalable, and resilient infrastructure solutions
  • Build and maintain automation frameworks for deployment, scaling, and observability
  • Design and implement CI/CD pipelines, release strategies, and recovery mechanisms
  • Continuously improve system performance, availability, reliability, and security posture
  • Provide end-to-end ownership of mission-critical platforms, including production support and root-cause analysis
  • Proactively monitor systems using observability tools to identify and address performance and reliability improvements
  • Lead or contribute to incident response, triage, and resolution with a focus on rapid recovery and prevention
  • Support deployment activities and manage implementation issues through to resolution
  • Drive adoption of modern engineering practices, tools, and processes to enhance delivery and operational efficiency
  • Analyze complex technical issues and recommend solutions aligned with business impact
  • Ensure compliance with enterprise standards and regulatory requirements
  • Participate in an on-call rotation to support production systems if needed

Skills

AnsibleAWSBashAWS CDKCloudWatchDynatraceElasticsearchGitGitHubJavaScriptPythonRHELServiceNowTypeScript

Travel

Occasional travel may be required

Industry

Financial services

Relocation

No