Jobs / Lazer

Senior Infrastructure/ DevOps Engineer Fintech

Lazer · Canada
CanadaExp: 5+ yrsRemote
Remuneration
Not specified
Location
Canada
Visa sponsorship
Not specified

Job summary

Lazer is seeking a Senior Infrastructure/DevOps Engineer with a minimum of 5 years of experience in DevOps, Infrastructure, or SRE roles. The role involves implementing and adapting infrastructure using IaC tools, designing CI/CD pipelines, and managing core cloud services with a strong focus on security and automation. The ideal candidate will be proficient in Docker, Kubernetes, and cloud providers like AWS or GCP, and capable of writing clean, maintainable code in Go, Python, or Node.js.

Benefits

Competitive compensationUnlimited PTOFull healthcare benefitsDental benefitsVision benefits401k for US employeesWork from anywhereWork/life balanceRegular team retreats

Qualifications

  • Minimum of 5 years dedicated experience in DevOps, Infrastructure, or SRE roles
  • Expertise with Docker, Kubernetes (k8s), and Terraform/Pulumi
  • Deep, proven expertise in either AWS or GCP infrastructure, with the ability to quickly grasp and transition to other cloud providers
  • Strong ability to write clean, maintainable code for automation in Go, Python, or Node.js
  • Demonstrable experience implementing and maintaining modern cloud security controls and meeting key compliance standards (SOC 2, PIPEDA, HIPAA, and/or GDPR)
  • Proven ability to quickly onboard, diagnose problems, and propose and implement solutions with minimal oversight
  • Experienced in a consultant or freelancer capacity, with the ability to understand and communicate effectively with both technical and non-technical stakeholders
  • Expertise in at least one major container platform: EKS, GKE, ECS, Fargate, or Cloud Run
  • Experience with S3 (AWS) or Cloud Storage (GCP)
  • Experience with RDS (AWS) or CloudSQL (GCP)
  • Experience deploying event-driven components using AWS Lambda, GCP Cloud Functions, or equivalents
  • Experience with CDNs and message queues
  • Production AI/agent experience, including running LLM or agent systems
  • Experience with AI observability and cost control, including tracing multi-step agent runs, token cost, latency, output quality, budgets, rate limiting, and caching (Langfuse, LangSmith, Arize, or similar)
  • Experience with infrastructure for AI systems: model gateways and provider routing with failover (LiteLLM, Bedrock, Vertex), durable execution for long-running multi-step workflows (Temporal, Step Functions, Inngest), eval and regression pipelines for prompt or model changes, and retrieval, vector-store, and context plumbing (including MCP)
  • Experience with vector databases and GPU/TPU compute
  • Domain experience in fintech or crypto/web3 environments
  • Experience with crypto/web3 infrastructure: running nodes (Ethereum, Solana, or others), indexing solutions (The Graph, custom indexers), or RPC infrastructure
  • Experience with payment processing, ledger architecture, or financial transaction systems, and meeting compliance requirements in regulated environments
  • Experience with high-volume, mission-critical systems: real-time data flows, websocket feeds, payment rails, or distributed architectures handling millions of transactions

Responsibilities

  • Quickly implement and adapt infrastructure using Terraform, Pulumi, or other major IaC tools
  • Deeply understand how to design, build, and optimize secure, multi-stage Dockerfiles
  • Design, build, and manage robust CI/CD pipelines to automate testing, building, and deployment across environments
  • Provision and manage foundational cloud services
  • Apply networking concepts including load balancers, VPNs for secure connectivity, and private VPCs for isolation
  • Utilize subnetting, routing, VPC peering, and NAT gateways to build secure systems
  • Protect PII; apply encryption, secrets management, network firewalls, and web application firewalls (AWS WAF, GCP Cloud Armor) following security best practices
  • Write high-quality automation and tooling in Go, Python, Node.js, or Bash for client-specific operational challenges
  • Ensure robust monitoring and high system uptime

Skills

AWSBashCloud ArmorCloud FunctionsCloud RunDatadogDockerECSEKSFargateGCPGKEGoKubernetesAWS LambdaNode.jsPrometheusPulumiPythonS3TerraformCloud SQL

Certifications

AWS cloud certificationsGCP cloud certifications

Industry

FintechCrypto/Web3

Relocation

No