Jobs / Int***
Senior Observability Engineer
Int*** · PA, United States
Visa sponsorship details are locked. Unlock company name and apply link with .
PA, United StatesExp: 5-7 yrsOnsite
Remuneration
Not specified
Location
PA, United States
Visa sponsorship
Sponsors visa
Job summary
Seeking a Senior Observability Engineer to administer and maintain observability tools like Splunk, AppDynamics, and Zenoss, ensuring optimal performance and reliability of IT systems.
Qualifications
- Minimum of 5–7 years in Observability/Monitoring/Site reliability engineering
- Proven experience in implementing, managing and maintaining observability tools
- Proficiency in Splunk and AppDynamics
- Proficiency in Zenoss
- Strong in MELT, Metrics, Events, Logs and Traces
- Hands-on troubleshooting and support
- Experience with OpenTelemetry instrumentation patterns
- Maintain platform reliability, upgrades, patching, and security hardening
- Exposure to Kubernetes observability
- Strong knowledge of IT infrastructure, applications, and networking
- Experience with scripting and automation tools
- Familiarity with cloud environments
- Excellent problem-solving and analytical skills
- Strong communication and collaboration abilities
- Ability to work independently and in a team-oriented environment
- Experience with other monitoring and observability tools
- Knowledge of DevOps practices and CI/CD pipelines
- Hands-on Infrastructure-as-Code and Git-based workflows
Responsibilities
- Administer and configure Splunk, AppDynamics, OTEL and Zenoss platforms
- Perform regular updates, patches, and upgrades to observability tools
- Continuously monitor the health and performance of the Splunk, APPD and Zenoss systems
- Ensure data integrity and availability within the observability platforms
- Provide support to internal users, assisting with troubleshooting and resolving issues
- Develop and deliver training sessions for users
- Create and manage dashboards, reports, and alerts
- Work with stakeholders to define monitoring requirements
- Manage onboarding and alert creation
- Optimize system performance by tuning configurations
- Maintain comprehensive documentation of configurations, processes, and procedures
- Develop and enforce best practices for monitoring and observability
- Collaborate with IT and DevOps teams
- Participate in incident response efforts
Skills
AnsibleAppDynamicsAWSAzureBashGitGrafanaKubernetesOpenTelemetryPrometheusPythonSplunkTerraform
Degrees
Bachelor's degree in Computer ScienceInformation TechnologyRelated field
Relocation
No