Jobs / Sander Geophysics
System Administrator, DevOps and Infrastructure NEW
Sander Geophysics · Ottawa, ON, Canada
Ottawa, ON, CanadaRemote
Remuneration
Not specified
Location
Ottawa, ON, Canada
Visa sponsorship
Not specified
Job summary
The System Administrator, DevOps and Infrastructure performs day-to-day administration of SGL’s infrastructure through maintaining, upgrading, managing and troubleshooting software, hardware and networks.
Benefits
Comprehensive health benefitsDental benefitsVision benefitsParamedical benefitsLife insurancePaid vacationPersonal leaveMedical leaveOn-site gymOn-site parking
Qualifications
- Postsecondary Degree or Diploma in computer science, related field, or combined education and experience
- Experience with Linux system administration and on-premises infrastructure management
- Proficiency with containerization and orchestration technologies such as Docker and Kubernetes
- Familiarity with CI/CD tooling, version control systems, GitOps workflows, and DevOps practices
- Experience with infrastructure-as-code tools such as Terraform/OpenTofu
- Experience with observability and monitoring tools (Prometheus, Grafana, Loki, Vector)
- Knowledge of networking concepts, directory services (LDAP, Active Directory, OIDC/SAML), database systems (MySQL/MariaDB, PostgreSQL), and distributed storage (Ceph)
- Experience with virtualization platforms such as VMware vSphere and Proxmox VE
- Proficiency in Go, with additional experience in Python, TypeScript, and shell scripting
- Experience building and maintaining custom container images and CI/CD pipeline templates
- Experience with identity and access management, including OAuth2/OIDC, LDAP proxying, and SSO integrations
- Knowledge of secrets management solutions such as HashiCorp Vault
- Ability to obtain secret security clearance
- Ability to produce thorough and effective technical documentation
- Strong interpersonal, teamwork, and communications skills
- Ability to solve problems quickly and automate processes
- Ability to establish priorities and multitask in a fast-paced environment
Responsibilities
- Analyze, assess, and document on-premises physical server infrastructure requirements
- Oversee hardware and software installations, system monitoring, and preventive maintenance
- Design and implement log management and observability pipelines using Rsyslog, Loki, Vector, and Grafana
- Develop information systems policies and procedures for IT operations, including backup workflows, disk utilization, and network share management across Windows and Linux environments
- Assess and monitor infrastructure health metrics by developing system monitoring agents and integrating with directory services
- Evaluate, deploy, and manage containerized services using Docker
- Develop and maintain CI/CD pipelines in GitLab
- Analyze requirements and design bare-metal Kubernetes clusters, including distributed storage (Ceph), networking, and load balancing solutions
- Assess existing workloads and provide recommendations for migration and consolidation onto container orchestration platforms
- Implement monitoring, auto-scaling, and failover mechanisms for high-availability and business continuity
- Author infrastructure-as-code using Terraform/OpenTofu and Talos Linux for provisioning and managing Kubernetes cluster nodes
- Develop and maintain identity and access management integrations
- Build centralized IT management platforms/internal tooling (e.g., inventory, archiving, media management systems) as containerized microservices deployed on Kubernetes
- Design and operate access control reconciliation loops that enforce ACL state across systems based on authoritative directory sources
- Develop automation for employee on-boarding and off-boarding workflows
- Manage virtualization platforms including VMware vSphere and Proxmox VE for workload virtualization, VM provisioning, and resource allocation
- Create custom container images for enterprise mail and groupware platforms, integrating CalDAV/CardDAV, LDAP authentication, and mail server monitoring
- Maintain records and documentation
- Provide project status reports
- Comply with company safety precautions and policies, including reporting hazards and incidents
Skills
BashCephDockerGitLabGoGrafanaIAMKubernetesLinuxLokiMariaDBMySQLOpenTofuPostgreSQLPrometheusProxmoxPythonTerraformTypeScriptVaultVMwarevSphereWindows
Degrees
Postsecondary Degree or Diploma in computer science
Languages
GoPythonTypeScriptShell scripting
Security clearance
Secret security clearance
Relocation
No