Jobs / Pelmorex
Infrastructure Engineer - Contract - Hybrid
Pelmorex · Oakville, ON, Canada
Oakville, ON, CanadaContractExp: 3+ yrsHybrid
Remuneration
Not specified
Location
Oakville, ON, Canada
Visa sponsorship
No visa sponsorship
Please note this is a contract role until February 2027 and we are only considering candidates who are eligible to work in Canada and can work in a Hybrid model at our Oakville, ON, Canada location.
Job summary
Pelmorex is seeking an Infrastructure Engineer - Meteorological Development for a contract role until February 2027. This role combines infrastructure engineering, platform reliability, and software development to deliver highly available, scalable, and high-performance systems. The engineer will work in a modern cloud-native environment, designing and operating distributed systems on Kubernetes and AWS ECS, and ensuring system reliability through Linux-based operations and observability tooling.
Benefits
Flexible Work EnvironmentSummer HoursPaid Vacation Days Off (in addition to Personal Days)
Qualifications
- Degree or diploma in Computer Science, Engineering, Mathematics, or equivalent practical experience.
- Minimum 3+ years of experience designing, implementing, and managing containerized environments using Docker and Kubernetes across platforms such as CloudStack, Proxmox, EKS & GKE.
- Strong knowledge of cloud platforms and infrastructure technologies including AWS, GCP, CloudStack, and Proxmox.
- Experience with core networking concepts including TCP/IP, NAT, DNS, load balancing, and firewalls.
- Strong experience building and supporting production-grade distributed systems in Linux environments.
- Experience developing production applications in one or more of the following languages: Go, Python, C/C++, or Rust.
- Strong scripting and infrastructure automation experience using technologies such as Python, Bash, Terraform, Helm, Terragrunt, Groovy, or JavaScript.
- Understanding of data structures, algorithms, and performance optimization concepts.
- Experience designing and supporting highly available, scalable systems handling large volumes of data.
- Experience with relational and NoSQL databases, including caching technologies such as Redis or AWS ElastiCache.
- Experience with CI/CD practices and tools such as Jenkins or similar platforms.
- Experience building and consuming REST-based APIs and services.
- Familiarity with security best practices and secure infrastructure design principles.
- Experience with monitoring and observability tools such as Prometheus, Grafana, ElasticSearch, Kibana, Jaeger and Kiali.
- Understanding of cloud cost optimization, including balancing performance, reliability, scalability, and operational efficiency.
- Strong problem-solving and communication skills with the ability to gather requirements, whiteboard solutions, and collaborate effectively across teams.
- Experienced in maintaining operational runbooks, automation playbooks, and reliable cloud infrastructure procedures for production systems.
- Experience with AI/ML infrastructure or platforms is considered an asset.
- Keen interest in data and data mining, problem solving.
- Experience with spatiotemporal data sets and scientific data formats including netCDF, HDF, GRIB and BUFR is a plus.
Responsibilities
- Design and operate distributed systems on Kubernetes (AWS EKS & GCP GKE) with Istio service mesh and AWS ECS using Docker containers, infrastructure as code (Terraform & Terragrunt), and GitOps/CI/CD pipelines (on Jenkins).
- Work across AWS and/or GCP services.
- Manage application delivery with Helm.
- Support stateful and caching layers such as Redis (AWS Elasticache).
- Ensure system reliability through Linux-based operations and observability tooling including Prometheus, Grafana, Kibana, ElasticSearch, Jaeger and Kiali.
- Implement and maintain playbooks and runbooks using bash scripts and Ansible.
- Work closely with technical leads, developers, operations teams, and infrastructure administrators.
- Modernize and standardize infrastructure through automation, observability, and cloud/platform engineering best practices.
- Ensure platform stability, scalability, security, and operational excellence across critical meteorological systems and data pipelines.
- Provide support to Pelmorex’s meteorological infrastructure and platforms that power The Weather Network, MeteoMedia, ElTiempo.es, and subsidiaries.
- Design and implement Meteorological infrastructure core services running under Kubernetes (AWS EKS) with Istio Service mesh, EC2 instances, ECS Clusters and deployed under Jenkins CI/CD pipeline via Helm, SSM and bash deployment scripts.
- Improve monitoring systems running under Prometheus, Grafana and Kibana.
- Use AI-assisted development tools and workflows to improve productivity, accelerate delivery, enhance code quality, support troubleshooting, and streamline operational tasks while maintaining quality and accountability.
- Provide technical advice and produce effort estimates on required work; execute work based on priorities.
- Work with cross-functional teams to ensure all systems function as intended and without impediment to existing products or processes.
- Develop and/or integrate monitoring systems.
- Work with Technology Operations team to ensure end-to-end data monitoring in software design, from data acquisition to delivery.
- Use knowledge of databases and database technology to create distributed, multi-tier applications capable of servicing large volumes of data.
- Provide support to operational teams for production systems and ensure services stay available and stable.
- Maintain operational runbooks, automation playbooks, and reliable cloud infrastructure procedures for production systems.
Skills
AnsibleAWSBashC++DockerECSEKSElasticsearchGCPGKEGoGrafanaHelmIstioJaegerJavaScriptJenkinsKibanaKubernetesLinuxPrometheusProxmoxPythonRedisRESTRustS3SNSSQSTerraformTerragrunt
Degrees
Degree or diploma in Computer ScienceDegree or diploma in EngineeringDegree or diploma in Mathematics
Contract length
Until February 2027
Relocation
No