Jobs / BMO Financial Group
DevOps Engineer/Site Reliability Engineer
BMO Financial Group · Toronto, ON, Canada
Toronto, ON, CanadaExp: 5+ yrs75,900-141,900 CAD/yearlyHybrid
Remuneration
75,900-141,900 CAD/yearly
Location
Toronto, ON, Canada
Visa sponsorship
Not specified
Job summary
BMO Financial Group is seeking a highly skilled DevOps / Site Reliability Engineer to join their technology team in Toronto. The role involves designing, building, and operating resilient infrastructure for critical business applications, with a focus on automation and continuous improvement.
Qualifications
- 5+ years of experience in DevOps, SRE, or related roles in hybrid environments
- Strong experience with AWS services and Infrastructure as Code
- Hands-on experience with configuration management tools
- Strong experience with observability platforms
- Proficiency in scripting and development using Python, Bash, and/or JavaScript
- Experience implementing automation-first solutions across infrastructure and application layers
- Solid understanding of security and compliance practices within regulated industries
- Experience with Git-based workflows
- Working knowledge of ServiceNow and ITSM processes
- Strong experience with RHEL systems administration and clustering technologies
- Proven ability to support and operate large-scale, mission-critical systems
Responsibilities
- Partner with development, operations, and security teams to design and deliver secure, scalable, and resilient infrastructure solutions
- Build and maintain automation frameworks for deployment, scaling, and observability
- Design and implement CI/CD pipelines, release strategies, and recovery mechanisms
- Continuously improve system performance, availability, reliability, and security posture
- Provide end-to-end ownership of mission-critical platforms, including production support and root-cause analysis
- Proactively monitor systems using observability tools to identify and address performance and reliability improvements
- Lead or contribute to incident response, triage, and resolution with a focus on rapid recovery and prevention
- Support deployment activities and manage implementation issues through to resolution
- Drive adoption of modern engineering practices, tools, and processes to enhance delivery and operational efficiency
- Analyze complex technical issues and recommend solutions aligned with business impact
- Ensure compliance with enterprise standards and regulatory requirements
- Participate in an on-call rotation to support production systems if needed
Skills
AnsibleAWSBashAWS CDKCloudWatchDynatraceElasticsearchGitGitHubJavaScriptPythonRHELServiceNowTypeScript
Travel
Occasional travel may be required
Industry
Financial services
Relocation
No