Jobs / JD.com
Site Reliability Engineer
JD.com · Berlin, BE, Deutschland
Berlin, BE, DeutschlandExp: 3+ yrsOnsite
Remuneration
Not specified
Location
Berlin, BE, Deutschland
Visa sponsorship
Not specified
Job summary
Ensure the stability of eCommerce mobile and web applications in European countries. Responsibilities include monitoring, incident management, automating deployments, scaling, reliability testing, and incident post-mortems. Collaborate with global engineering and commercial teams to maintain a seamless and reliable user experience.
Qualifications
- Bachelor's degree in Computer Science, Software Engineering, or a related field.
- 3+ years of experience in DevOps, site reliability engineering (SRE), system stability assurance, or operations and maintenance development.
- Experience with common public cloud platform products (e.g., cloud hosting, cloud storage, object storage, CDN).
- Proficiency in containerization technologies such as Docker and Kubernetes.
- Familiarity with Linux operating systems and common commands.
- Proficiency in scripting languages such as Shell, Python, or Go.
- Knowledge of mainstream monitoring tools such as Prometheus, Grafana, and Zabbix.
- Experience in Java microservices architecture development or operations.
- Expertise in Java memory tuning and performance optimization.
- Experience with common middleware including MySQL, Kafka, ElasticSearch, and Redis.
- Effective communication in English.
- eCommerce or retail industry experience (preferred).
- Intermediate or above proficiency in the Chinese language (preferred).
- 2+ years of experience designing, analyzing, and troubleshooting large-scale distributed systems (preferred).
Responsibilities
- Monitor system performance data and alerts, identify abnormal indicators and risks, and notify colleagues for resolution.
- Perform emergency system recovery and execute contingency plans to minimize business losses.
- Address and resolve system issues and queries from local stakeholders.
- Collaborate with team members in other regions to ensure the stability of the European e-commerce platform system.
- Summarize system issues and solutions to continuously improve system stability.
Skills
BashDockerElasticsearchGoGrafanaJavaKafkaKubernetesLinuxMySQLPrometheusPythonRedis
Relocation
No