Jobs / Barclays

Trade Floor Site Reliability Engineer

Barclays · London, ENG, United Kingdom
London, ENG, United KingdomOnsite
Remuneration
Not specified
Location
London, ENG, United Kingdom
Visa sponsorship
Not specified

Job summary

Join Barclays as a Trade Floor Site Reliability Engineer, providing real-time support to Credit EMEA traders and sales teams. This role focuses on maintaining stable and performant critical trading platforms, especially with the expansion of electronic and algo trading. You will work on the London trading floor, ensuring seamless client service by monitoring, maintaining, and resolving complex technical issues within the bank's critical technology infrastructure.

Qualifications

  • Experience in systems engineering, including Linux and Windows.
  • Experience with networking, Kubernetes, and cloud infrastructure.
  • Proficiency in automation tools for system reliability at scale.
  • Proficiency in implementing monitoring, alerting, and observability for critical trading platforms.
  • Ability to automate manual activities.
  • Ability to manage incidents effectively, troubleshoot issues swiftly, communicate proactively, and perform root cause analysis.
  • Prior experience in supporting Credit or IB asset classes (e.g., Rates, Equities, FX).
  • Experience working with PaaS products.
  • Experience with virtualization, containerization, or orchestration of compute/network/storage.
  • In-depth technical knowledge and experience in the assigned area of expertise.
  • Thorough understanding of underlying principles and concepts within the area of expertise.

Responsibilities

  • Provide real-time support to Credit EMEA traders and sales teams.
  • Ensure critical trading platforms are stable and performant.
  • Maintain seamless client service for electronic and algo trading.
  • Implement monitoring, alerting, and observability for critical trading platforms.
  • Automate manual activities.
  • Manage incidents effectively and troubleshoot issues swiftly.
  • Proactively communicate and perform root cause analysis to prevent future incidents.
  • Monitor and maintain critical technology infrastructure.
  • Resolve complex technical issues with minimal disruption to operations.
  • Provide technical support for the service management function.
  • Develop support models and service offerings to improve customer service.
  • Execute preventative maintenance tasks on hardware and software.
  • Utilize monitoring tools and metrics to identify and prevent potential issues.
  • Maintain a knowledge base with detailed documentation of resolved cases.
  • Analyze system logs, error messages, and user reports to identify root causes.
  • Resolve hardware, software, and network issues by fixing or replacing components, reinstalling software, or applying configuration changes.
  • Perform automation, monitoring enhancements, capacity management, resiliency, business continuity management, front office specific support, and stakeholder management.
  • Identify and remediate or raise potential service impacting risks and issues.
  • Proactively assess support activities and implement automations for stability and efficiency.
  • Actively tune monitoring tools, thresholds, and alerting.

Skills

KubernetesLinuxMakeWindows

Relocation

No