Jobs / NTT DATA
Site Reliability Engineering (SRE) / Observability Technical Lead
NTT DATA · London, ENG, United Kingdom
London, ENG, United KingdomExp: 5+ yrsRemote
Remuneration
Not specified
Location
London, ENG, United Kingdom
Visa sponsorship
Not specified
Job summary
Seeking an experienced Site Reliability Engineer (SRE) / Observability Technical Lead to drive observability and reliability projects.
Benefits
Tailored benefits for wellbeingContinuous growth opportunitiesFlexible work options
Qualifications
- Expertise with APM tools like New Relic, Datadog, AppDynamics, or Dynatrace.
- Experience with OpenTelemetry for distributed tracing.
- Proficiency in Infrastructure as Code using Terraform.
- Understanding of cloud platforms like AWS, GCP, or Azure.
- Experience with automation tools like Ansible, Chef, or Puppet.
- Knowledge of CI/CD pipelines and tools like GitHub Actions, Jenkins, or Azure DevOps.
- Experience managing Kubernetes and containerized environments.
- Familiarity with log aggregation platforms like ELK Stack or Splunk.
- Strong leadership and collaboration skills.
Responsibilities
- Lead development and management of observability and reliability frameworks.
- Design and implement monitoring and observability solutions.
- Manage Infrastructure as Code (IaC) initiatives using Terraform.
- Drive automation strategies for monitoring and logging pipelines.
- Develop and maintain observability roadmaps.
- Collaborate with product management and sales teams for technical support.
- Lead teams to enhance CI/CD pipelines and deployment reliability.
- Engage with vendors to evaluate and integrate observability solutions.
- Mentor junior engineers and analysts.
Skills
AnsibleAppDynamicsAWSAzureAzure DevOpsChefDatadogDockerDynatraceGCPGitHubGitHub ActionsHelmJenkinsKubernetesNew RelicOpenTelemetryPuppetSplunkTerraform
Relocation
No