Jobs / End Clothing

Platform Engineer

End Clothing · Newcastle upon Tyne, ENG, United Kingdom
Newcastle upon Tyne, ENG, United KingdomOnsite
Remuneration
competitive salary
Location
Newcastle upon Tyne, ENG, United Kingdom
Visa sponsorship
No visa sponsorship
Employment is conditional upon having the legal right to work in the UK for the role offered.

Job summary

The Platform Engineer will be responsible for the reliability, security, performance, and day-to-day operational excellence of the Shopify Plus platform and its critical integration ecosystem. This role involves leading observability, incident response, stability, problem management, and operational governance to ensure safe and predictable trading. The engineer will also take ownership of vendor-delivered CI/CD pipelines, maintaining release controls, access, configuration, and continuous improvement within Shopify's SaaS constraints.

Benefits

30 days holiday (including bank holidays)Flexible workingHoliday trading (Buy or sell 3 days)Your birthday offAccess to Employee Assistance ProgrammeHealthcare Cashback PlanMoments that matter gifts (Weddings and Babies)A pension that both you and the company contribute toGenerous staff discountOpportunities for professional development and career progression

Qualifications

  • Platform/SRE/operations experience supporting high-traffic ecommerce, with strong incident management and operational discipline
  • Experience maintaining CI/CD pipelines delivered by others (e.g., GitHub Actions/Azure DevOps): permissions, secrets, environment configs, release controls/quality gates, and troubleshooting
  • Strong observability and reliability engineering experience (New Relic/Sentry/Datadog/etc.): defining SLIs/SLOs, improving alert quality, incident analysis, RCA/problem management, and measurable MTTR/incident reduction
  • Strong integration operations experience across iPaaS/ERP flows: queues/backpressure, rate-limiting, idempotency, retries/DLQs, replay/backfill, reconciliation, and data integrity monitoring
  • Practical understanding of operating within Shopify Plus SaaS constraints (limited edge/WAF control) and implementing controls in the layers you do own (apps, services, integrations, tooling, processes, vendor escalation)
  • Experience operating/supporting D365 environments (Dev/Sandbox, UAT, Prod): environment management, release coordination, upgrade rehearsals/testing in lower environments, and go/no-go support
  • Strong governance capability: access reviews (joiner/mover/leaver), least privilege, audit evidence, and secure operational practices across Shopify/Patchworks/D365 and monitoring tools
  • Experience with D365 licensing and capacity/storage management, plus cost observability and optimisation
  • Strong stakeholder management under pressure: clear communications, structured updates, escalation with evidence packs, and calm coordination during peak/trading events
  • Strong Shopify Plus operational knowledge: Admin API/GraphQL + webhooks, rate-limit strategies, bulk operations, and navigating Shopify Plus support/escalation processes
  • Patchworks (or similar iPaaS) experience: mapping/versioning, error handling patterns, replay/reprocessing, and operational reporting
  • Hands-on D365 platform administration: environment refreshes, release validation, upgrade rehearsals, role/security model familiarity, and capacity/storage optimisation
  • Infrastructure-as-Code familiarity (Terraform or equivalent) for the integration/runtime layer (cloud resources, queues, secrets, monitoring as code)
  • Experience with SLO tooling and incident/problem management practices (post-incident reviews, stability backlogs, automation/runbook maturity)
  • Experience implementing synthetic monitoring and RUM for ecommerce journeys, including peak-readiness monitoring uplift

Responsibilities

  • Own the reliability, security, performance, and day-to-day operational excellence of the Shopify Plus platform and its critical integration ecosystem (Shopify Patchworks D365)
  • Lead observability, incident response, stability and problem management, and the operational governance that keeps trading safe and predictable
  • Take BAU ownership of vendor-delivered CI/CD pipelines—maintaining release controls, access and configuration, and continuous improvement—while operating effectively within Shopify's SaaS constraints (Shopify-managed hosting/CDN/WAF, with clear escalation paths where required)
  • Maintain vendor-delivered pipelines for themes/apps/integration services; manage access, configurations, release controls, and quality gates; ensure pipelines remain reliable, secure, and well-documented
  • Own the change calendar, release readiness checks, environment parity checks, and exception processes during freeze/peak windows
  • Understand Shopify-managed protections and operate the process to engage Shopify Plus Support (e.g., bot protection schedules) when required
  • Define and maintain dashboards/alerts across storefront experience (RUM/synthetics where available) and integration services; ensure actionable alerting, clear ownership, and on-going tuning
  • Define ownership boundaries, SLIs/SLOs (availability, latency, data freshness), alert thresholds, and regular reporting for platform and integration services
  • Lead triage, stakeholder communications, mitigation, RCA and preventative actions; maintain a stability backlog and drive repeat-incident themes to closure to improve MTTR and reduce recurrence
  • Operational ownership of Shopify APIs/webhooks and Patchworks/D365 flows—rate-limit safe patterns, idempotent processing, retries/DLQs, replay tooling, reconciliation, and data correctness checks
  • Implement operational controls for key flows (orders, inventory, pricing, fulfilment) including mismatch detection, audit trails, and repeatable replay/backfill procedures
  • Oversee Dev/Sandbox, UAT and Production environments (access, refresh cadence, config hygiene, and release readiness)
  • Manage D365 version upgrades by testing in lower environments first; coordinate regression validation and go/no-go decisioning
  • Manage user/licence reviews and access governance; monitor and optimise storage/capacity; track and report cost/consumption drivers and optimisation actions
  • Manage secrets, least privilege, auditability, dependency hygiene, and secure configuration for apps/services; coordinate security reviews and remediation plans
  • Develop trading-event readiness plans (monitoring uplift, incident playbooks, change risk controls/freeze coordination, validation checklists) and business continuity/degraded-mode procedures for integration failures
  • Manage Shopify Plus / Patchworks / D365 support escalations with evidence packs (logs, timelines, impact) and track actions to closure
  • Run periodic access reviews across Shopify apps, Patchworks, D365, and monitoring tooling; maintain joiner/mover/leaver processes and audit evidence
  • Maintain runbooks, operational standards, and “how-to-operate” documentation; ensure onboarding for Engineering/Support is clear and current

Skills

AzureAzure DevOpsDatadogGitHubGitHub ActionsGraphQLNew RelicSentryTerraformWindows

Work schedule

40 Hours per week - Monday to Friday

Relocation

No