Jobs / The Innovation Game
Infrastructure Engineer
The Innovation Game · Cambridge, ENG, United Kingdom
Cambridge, ENG, United Kingdom43,000-77,000 GBP/yearlyRemote
Remuneration
43,000-77,000 GBP/yearly
Location
Cambridge, ENG, United Kingdom
Visa sponsorship
Not specified
Job summary
The Innovation Game (TIG) is seeking an Infrastructure Engineer to build, maintain, and improve the infrastructure. This role involves taking ownership of deployment systems, monitoring, reliability, and operational resilience, ensuring the infrastructure remains secure, performant, and scalable. The ideal candidate enjoys solving engineering challenges, automating tasks, improving system reliability, and responding effectively to incidents in a startup environment.
Qualifications
- Experience with Rust.
- Strong Linux administration skills, particularly Ubuntu and command-line environments.
- Experience operating and maintaining production systems.
- Experience managing deployment infrastructure, remote servers, databases, reverse proxies, DNS, and infrastructure services such as Cloudflare.
- Strong scripting and automation skills.
- Experience working with Docker and docker-compose.
- Ability to troubleshoot infrastructure issues.
- Ability to communicate clearly on technical matters.
- Verbal and written English language fluency.
- Comfortable working independently and taking ownership in a fast-moving startup environment.
Responsibilities
- Develop and improve monitoring, alerting, and observability across infrastructure.
- Act as a first responder to production incidents, attacks, outages, and unexpected infrastructure failures.
- Manage and maintain deployment infrastructure, remote servers, databases, networking configuration, and cloud services.
- Maintain and improve operational documentation, runbooks, and internal infrastructure processes.
- Automate operational workflows and develop scripts to improve reliability and engineering productivity.
- Contribute to infrastructure architecture decisions and establish best practices around security, deployment, and operational resilience.
- Identify and implement infrastructure optimizations, including caching strategies, deployment improvements, and performance tuning.
Skills
CloudflareDockerGrafanaLinuxOpenTelemetryPrometheusRustUbuntuDocker Compose
Languages
English
Relocation
No