Jobs / HSBC
MLOps Engineer (LLM/GenAI)
HSBC · Sheffield, ENG, United Kingdom
Sheffield, ENG, United KingdomRemote
Remuneration
Not specified
Location
Sheffield, ENG, United Kingdom
Visa sponsorship
Not specified
Job summary
HSBC is seeking an MLOps Engineer (LLM/GenAI) to engineer production-grade infrastructure for modern AI, focusing on hosting LLMs and optimizing inference performance.
Qualifications
- Extensive experience in building AI platforms and fine-tuning pipelines
- Strong Python and CUDA engineering skills
- Deep inference optimisation expertise
- Production hosting experience with Docker/Kubernetes and cloud platforms
- End-to-end fine-tuning expertise
Responsibilities
- Design, build, and operate scalable model hosting platforms for LLMs, embeddings, and STT/TTS
- Optimise inference for latency, throughput, and cost
- Evaluate and integrate inference frameworks to maximise performance
- Monitor inference health and troubleshoot issues
- Build end-to-end fine-tuning pipelines and integrate models
Skills
AWSAzureDockerGCPKubernetesPython
Relocation
No