Jobs / MPS Limited

Site Reliability Engineer

MPS Limited · London, ENG, United Kingdom
London, ENG, United KingdomExp: 5+ yrsRemote
Remuneration
Not specified
Location
London, ENG, United Kingdom
Visa sponsorship
Not specified

Job summary

Join MPS as a Site Reliability Engineer to manage and support cloud and on-premise platforms, ensuring high reliability, security, and scalability. This role involves migrating the Newquay platform to AWS, utilizing automation and monitoring for continuous improvement across infrastructure and services. You will work closely with internal teams and partners to maintain systems to the highest standards.

Benefits

Access to EOT (Employee Ownership Trust) tax-free bonus25 days annual leaveEnhanced parental leaveEnhanced sick leaveRetention recognition perksBirthday or work anniversary day offCinema perks including premieres and special screeningsRental Housing Deposit SupportHybrid and remote working arrangementsVolunteering day off

Qualifications

  • Five or more years of professional experience installing, configuring, and troubleshooting Linux-based environments.
  • Solid experience in the administration and performance tuning of application stacks (e.g., PHP 7, Apache, Python, NGINX, Percona / MySQL).
  • Experience working across the full SDLC/STLC.
  • Proficiency with automation software (e.g., Puppet, cfengine, Chef).
  • Proficiency with Infrastructure as Code (such as Terraform, CloudFormation).
  • Experience with AWS (specifically EKS, S3, SQS, VPC, TGW, and networking).
  • Experience with Digital cinema or VOD technologies and environments.
  • Skills across multiple technologies with particular emphasis on Linux.
  • Solid scripting skills, including source control.
  • Skilled in the implementation and management of virtualized environments with VMware, Nutanix, or AWS.
  • Experience with Linux, Percona / MySQL / MariaDB (specifically around Galera clustering).
  • Solid networking knowledge (OSI network layers, TCP/IP).
  • Applicable knowledge of technologies used by teams (PHP, Python, C++, Node.js, Selenium, Spectron, Electron, SQL, API Integration).
  • Experience with Docker, EKS, AWS, Terraform.
  • Experience with CI/CD Pipelines (preferably GitLab).

Responsibilities

  • Manage and monitor all Linux infrastructure associated with the Newquay platform.
  • Maintain the existing Percona cluster during migration.
  • Install, configure, test, and maintain operating systems, applications software, and systems management tools.
  • Ensure all devices are appropriately configured and patched.
  • Ensure all devices utilize managed builds via Puppet or alternative tools.
  • Monitor and test application performance for potential bottlenecks.
  • Identify possible solutions and work with developers to implement fixes.
  • Maintain security, backup, and redundancy strategies.
  • Provide 2nd and 3rd level support.
  • Liaise with vendors and other IT personnel for problem resolution.
  • Design, develop, and maintain cloud infrastructure using infrastructure as code tools.
  • Monitor the cloud environment for optimal cost and performance metrics.
  • Work with software development teams to build and maintain continuous integration and deployment processes.
  • Own and maintain the AWS and EKS infrastructure.
  • Ensure the Newquay infrastructure is working and efficient.
  • Ensure the on-call team is aware of how to support and troubleshoot Newquay.
  • Provide regular reports on programs and projects to the EMT and SMT.
  • Ensure product owners are key in the decision-making process.

Skills

AmbassadorAWSChefCloudFormationC++DockerEKSGitLabLinuxMySQLNGINXNode.jsNutanixPHPPuppetPythonS3SQSTerraformVMwareGitLab CI

Industry

Theatrical content distributionElectronic content deliveryHard drive replication

Relocation

No