Jobs / NTT DATA

Site Reliability Engineering (SRE) / Observability Technical Lead

NTT DATA · London, ENG, United Kingdom
London, ENG, United KingdomExp: 5+ yrsRemote
Remuneration
Not specified
Location
London, ENG, United Kingdom
Visa sponsorship
Not specified

Job summary

Seeking an experienced Site Reliability Engineer (SRE) / Observability Technical Lead to drive observability and reliability projects.

Benefits

Tailored benefits for wellbeingContinuous growth opportunitiesFlexible work options

Qualifications

  • Expertise with APM tools like New Relic, Datadog, AppDynamics, or Dynatrace.
  • Experience with OpenTelemetry for distributed tracing.
  • Proficiency in Infrastructure as Code using Terraform.
  • Understanding of cloud platforms like AWS, GCP, or Azure.
  • Experience with automation tools like Ansible, Chef, or Puppet.
  • Knowledge of CI/CD pipelines and tools like GitHub Actions, Jenkins, or Azure DevOps.
  • Experience managing Kubernetes and containerized environments.
  • Familiarity with log aggregation platforms like ELK Stack or Splunk.
  • Strong leadership and collaboration skills.

Responsibilities

  • Lead development and management of observability and reliability frameworks.
  • Design and implement monitoring and observability solutions.
  • Manage Infrastructure as Code (IaC) initiatives using Terraform.
  • Drive automation strategies for monitoring and logging pipelines.
  • Develop and maintain observability roadmaps.
  • Collaborate with product management and sales teams for technical support.
  • Lead teams to enhance CI/CD pipelines and deployment reliability.
  • Engage with vendors to evaluate and integrate observability solutions.
  • Mentor junior engineers and analysts.

Skills

AnsibleAppDynamicsAWSAzureAzure DevOpsChefDatadogDockerDynatraceGCPGitHubGitHub ActionsHelmJenkinsKubernetesNew RelicOpenTelemetryPuppetSplunkTerraform

Relocation

No