Jobs / The Fidelis Partnership
Platform Engineer
The Fidelis Partnership · London, ENG, United Kingdom
London, ENG, United KingdomFull timeExp: 7+ yrsOnsite
Remuneration
Not specified
Location
London, ENG, United Kingdom
Visa sponsorship
Not specified
Job summary
The Fidelis Partnership is seeking a Platform Engineer to join their Analytics Product Engineering team in London. This role involves designing, building, and operating platform capabilities for high-performance, distributed computing products, focusing on infrastructure automation, CI/CD, observability, and performance optimization. The engineer will collaborate with various teams to define and deliver the platform roadmap in a regulated environment.
Qualifications
- At least 7 years of experience in platform engineering, DevOps or Site Reliability Engineering supporting distributed, high performance compute systems, ideally with Microsoft HPC in a Windows Server environment.
- Strong experience with hybrid, bare metal and virtualized infrastructure environments.
- Strong understanding of networking, security and access control principles and components, including Microsoft Entra (Active Directory).
- Extensive applied experience of observability practices including logging, monitoring and alerting (e.g. SolarWinds).
- Practical experience of commissioning and maintaining Microsoft SQL Server installations.
- Experience designing, deploying and maintaining containerisation and orchestration solutions (e.g. Docker, Kubernetes) a significant plus.
- Experience of using Infrastructure as Code tools such as Terraform or Bicep for definition and management of environments a significant plus.
- Experience building CI/CD pipelines (e.g. Azure DevOps) a plus.
- Solid experience of provisioning and managing storage, backup/restore and Disaster Recovery solutions.
- Strong communication and stakeholder engagement skills.
Responsibilities
- Define and execute the platform engineering strategy and roadmap aligned to analytics product needs.
- Design and operate distributed runtime environments including compute orchestration and workload scheduling.
- Implement Infrastructure as Code (IaC) and automated environment provisioning.
- Build and maintain CI/CD pipelines supporting distributed systems and shared components.
- Implement observability through logging, metrics and alerting, improving platform reliability and debuggability.
- Monitor and optimise system performance, throughput and resource utilisation.
- Ensure platform security, access control and compliance with internal standards.
- Own and maintain operational documentation, runbooks and support procedures to ensure continuity of service.
- Establish knowledge-sharing and cross-training practices to reduce single points of failure.
- Ensure key platform processes are documented, repeatable and transferable across the team.
- Support resilience through shared ownership of critical platform components, releases and incident response.
- Collaborate with engineering squads and stakeholders to drive adoption of platform capabilities and standards.
Skills
AzureAzure DevOpsBicepDockerKubernetesTerraformWindowsWindows Server
Industry
InsuranceReinsurance
Relocation
No