Jobs / The Hartford

Principal AI Engineer - Agent Ops / SRE

The Hartford · Columbus, OH, United States
Columbus, OH, United StatesFull timeExp: 10+ yrs168,400-220,000 USD/yearlyRemote
Remuneration
short-term or annual bonuses, long-term incentives, and on-the-spot recognition
Location
Columbus, OH, United States
Visa sponsorship
No visa sponsorship
Candidates must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position

Job summary

The Hartford is seeking a Principal AI Engineer - Agent Ops/SRE to join their applied AI COE Team. This role involves supporting data scientists and AI solution engineers in building, deploying, and maintaining AI-COE products, ensuring reliability and efficiency. The position focuses on driving efficiency across the AI delivery lifecycle (AgentOps) and applying strong software and systems engineering practices to scale and operate AI systems (SRE).

Benefits

Short-term or annual bonusesLong-term incentivesOn-the-spot recognition

Qualifications

  • Bachelor's degree in Computer Science, Computer Engineering, or a technical field.
  • 10+ years building and shipping software and/or platform solutions for enterprises.
  • Programming experience with Python is required.
  • 3+ years of experience with IAC (Terraform).
  • 5+ years of experience owning production CICD, GitOps and release management gating.
  • 3+ years of experience in implementing observability, performance & reliability solutions: SLO, P99-95 latency, alert tuning, & dashboards.
  • Experience with AI observability/monitoring tools such as Dynatrace, Splunk, Arize & OpenTelemetry/OpenInference is required.
  • Proven experience with Google's Gemini Enterprise Agent platform is a plus.
  • Experience with GKE/Docker/Registry is a plus.
  • Proven experience in working with other cloud providers such as AWS cloud is a plus.
  • Experience with Automated Testing, Automated Deployments, Agile methodologies, Unit Testing, and Integration Testing tools.
  • Conversational UX/UI design (multi-turn chatbots) and Human-Agent-Interaction (HAI) is a plus.
  • Experience with IR, vector embedding, and Hybrid/Semantic search technologies.
  • Experience with LLM orchestration frameworks like Langchain, LlamaIndex, LangSmith, LangGraph, Google Agent Development Kit, is a plus.
  • Experience with Generative AI Guardrails, responsible AI, adversarial attack mitigation, and red teaming is a plus.
  • Foundational understanding of Natural Language Processing and Deep Learning.
  • Excellent problem-solving skills and the ability to work in a collaborative team environment.
  • Excellent communication skills.

Responsibilities

  • Serve as technical liaison between AI COE and Platform Engineering & Enterprise SRE teams.
  • Ensure AI systems meet requirements for performance, latency, throughput, resiliency, recovery, observability and reliability.
  • Partner with AI engineers, Applied AI Scientist, and AI Architects to design, build and maintain scalable, fault tolerant AI systems as per SLO.
  • Partner with Platform engineering team to design and implement CICD, GITOps, and IAC (Terraform) modules.
  • Ensure use of AgentOps NSA, standards, reference architecture and tooling.
  • Partner with enterprise release management and AI Governance team to build & deploy AI solutions using their platform tooling.
  • Support the entire AI lifecycle as per the standard work template.
  • Build standardized deployment templates, reference architecture, automation scripts, terraform modules, CICD pipelines, and operational runbooks for AI workloads.
  • Design and build IDP (Harness) catalogs, templates & pipelines partnering with enterprise platform engineering team.
  • Manage production systems to ensure enterprise SLOs are met.
  • Manage incident response for production systems, including triaging, escalating, RCA and implementing corrective actions.

Skills

AWSDockerDynatraceGKEHarnessOpenTelemetryPythonSplunkTerraformGit

Degrees

Bachelor's degree in Computer ScienceBachelor's degree in Computer EngineeringBachelor's degree in a technical field

Industry

Insurance

Relocation

No