Jobs / Imagine Communications

Cloud Site Reliability Engineer

Imagine Communications · Waterloo, ON, Canada
Waterloo, ON, CanadaExp: 6+ yrs115,000-125,000 CAD/yearlyOnsite
Remuneration
115,000-125,000 CAD/yearly
Location
Waterloo, ON, Canada
Visa sponsorship
Not specified

Job summary

The Site Reliability Engineer (SRE) will apply deep expertise in DevOps practices, automation, infrastructure orchestration, configuration management, and continuous integration to support the delivery and operation of mission‑critical applications. This role will focus primarily on the development, deployment, and reliability of the xGPlatform and its associated peripheral services. The SRE will play a key role in advancing Imagine Communications toward a robust, multitenant, multi‑cloud product strategy. The ideal candidate brings a strong background and passion for software development, DevOps, and cloud technologies. This individual will design and build scalable systems using a diverse technology stack that includes AWS, Azure, Node.js, C#, and modern deployment automation and tooling. In addition to building reliable services, the SRE will empower engineering teams to work more efficiently and effectively.

Benefits

Medical insuranceDental insuranceVision insuranceLife insuranceTravel insuranceEmployee Assistance ProgramWellness programsLifeSpeakVitalityPaid volunteer time

Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or a related technical field.
  • 6+ years of professional experience in Site Reliability Engineering, DevOps, Cloud Engineering, or Software Development roles supporting production systems.
  • Strong understanding of cloud architecture principles, including scalability, resiliency, high availability, security, and cost optimization.
  • Hands‑on experience designing, deploying, and operating applications and infrastructure in AWS and/or Azure.
  • Proficiency with infrastructure‑as‑code and cloud‑native technologies (e.g., Terraform, Ansible, Docker, Kubernetes, Prometheus, messaging or caching systems).
  • Extensive experience with monitoring, logging, and observability tools and practices.
  • Proven ability to troubleshoot and resolve complex production issues, including ownership of Tier‑3 incidents and root cause analysis.
  • Experience integrating systems using Web APIs, messaging, or event‑driven architectures.
  • Working knowledge of SQL and NoSQL databases, including schema design, querying, and operational considerations.
  • Experience working in Agile and DevOps environments.
  • Strong communication and collaboration skills, with the ability to work effectively across engineering, architecture, and business teams.
  • Experience operating and supporting mission‑critical, customer‑facing, or managed service platforms.
  • Experience leading or contributing to incident response, post‑incident reviews, and reliability improvements.
  • Familiarity with SRE practices such as service health indicators and reliability objectives.
  • Experience identifying and reducing operational toil through automation and process improvement.
  • Experience contributing to platform architecture decisions or reusable cloud deployment patterns.
  • Hands‑on experience with infrastructure and delivery tools such as Terraform, Ansible, or Azure DevOps.
  • Experience with scripting/programming languages such as Go, Node.js, PowerShell, Python, or Shell scripting.
  • Exposure to cost management, capacity planning, and performance optimization in cloud environments.
  • Familiarity with cloud security and compliance standards such as SOC 2.

Responsibilities

  • Apply deep expertise in DevOps practices, automation, infrastructure orchestration, configuration management, and continuous integration to support the delivery and operation of mission‑critical applications.
  • Focus primarily on the development, deployment, and reliability of the xGPlatform and its associated peripheral services.
  • Play a key role in advancing Imagine Communications toward a robust, multitenant, multi‑cloud product strategy.
  • Design and build scalable systems using a diverse technology stack that includes AWS, Azure, Node.js, C#, and modern deployment automation and tooling.
  • Empower engineering teams to work more efficiently and effectively.
  • Design, build, deploy, and operate applications and infrastructure across AWS, Azure, and other cloud service providers.
  • Manage and maintain development, staging, and production environments using infrastructure‑as‑code and automation best practices.
  • Design and implement systems and tooling that improve the reliability, scalability, security, and supportability of Imagine’s Managed Services offerings.
  • Promote DevOps and cloud best practices within the team to improve quality, reduce operational risk, increase security, drive efficiency and reuse, and optimize costs.
  • Collaborate with product, architecture, and business stakeholders to understand user needs and translate them into reliable, scalable technical solutions.
  • Integrate and orchestrate diverse cloud services and internal systems using Web APIs and event‑driven architectures.
  • Architect, document, and review system designs with a strong focus on security, resiliency, and operational excellence.
  • Build and integrate cloud‑based services and automation to improve workforce productivity and reduce manual operational effort.
  • Partner with architecture and development teams to design reusable deployment patterns and establish governance and observability models.
  • Apply cloud compliance, security, and reliability standards to application and platform design.
  • Lead the investigation, troubleshooting, and resolution of Tier‑3 production incidents and escalations, contributing to root cause analysis and continuous improvement.

Skills

AnsibleAWSAzureAzure DevOpsBashC#DockerGoKubernetesNode.jsPowerShellPrometheusPythonTerraformGit

Certifications

AWS Certified Solutions ArchitectDevOps Engineer

Degrees

Bachelor’s degree in Computer ScienceBachelor’s degree in EngineeringBachelor’s degree in a related technical field

Languages

Node.jsC#GoPowerShellPythonShell scripting

Relocation

No