Jobs / The Innovation Game

Infrastructure Engineer

The Innovation Game · Cambridge, ENG, United Kingdom
Cambridge, ENG, United Kingdom43,000-77,000 GBP/yearlyRemote
Remuneration
43,000-77,000 GBP/yearly
Location
Cambridge, ENG, United Kingdom
Visa sponsorship
Not specified

Job summary

The Innovation Game (TIG) is seeking an Infrastructure Engineer to build, maintain, and improve the infrastructure. This role involves taking ownership of deployment systems, monitoring, reliability, and operational resilience, ensuring the infrastructure remains secure, performant, and scalable. The ideal candidate enjoys solving engineering challenges, automating tasks, improving system reliability, and responding effectively to incidents in a startup environment.

Qualifications

  • Experience with Rust.
  • Strong Linux administration skills, particularly Ubuntu and command-line environments.
  • Experience operating and maintaining production systems.
  • Experience managing deployment infrastructure, remote servers, databases, reverse proxies, DNS, and infrastructure services such as Cloudflare.
  • Strong scripting and automation skills.
  • Experience working with Docker and docker-compose.
  • Ability to troubleshoot infrastructure issues.
  • Ability to communicate clearly on technical matters.
  • Verbal and written English language fluency.
  • Comfortable working independently and taking ownership in a fast-moving startup environment.

Responsibilities

  • Develop and improve monitoring, alerting, and observability across infrastructure.
  • Act as a first responder to production incidents, attacks, outages, and unexpected infrastructure failures.
  • Manage and maintain deployment infrastructure, remote servers, databases, networking configuration, and cloud services.
  • Maintain and improve operational documentation, runbooks, and internal infrastructure processes.
  • Automate operational workflows and develop scripts to improve reliability and engineering productivity.
  • Contribute to infrastructure architecture decisions and establish best practices around security, deployment, and operational resilience.
  • Identify and implement infrastructure optimizations, including caching strategies, deployment improvements, and performance tuning.

Skills

CloudflareDockerGrafanaLinuxOpenTelemetryPrometheusRustUbuntuDocker Compose

Languages

English

Relocation

No