Jobs / Group1001

DWX - Site Reliability & Automation Engineer

Group1001 · Zionsville, IN, United States
Zionsville, IN, United StatesExp: 7+ yrs180,000-230,000 USD/yearlyRemote
Remuneration
180,000-230,000 USD/yearly
Location
Zionsville, IN, United States
Visa sponsorship
Not specified

Job summary

Group 1001 is seeking a DWX Site Reliability & Automation Engineer to focus on automation, efficiency, and AI enablement. This role involves identifying high-leverage opportunities for automation, building end-to-end automated workflows, and safely deploying AI tools across the DWX team. The engineer will also drive reliability as a discipline by implementing SRE practices and partnering with various teams to enhance effectiveness.

Benefits

Comprehensive health insuranceDental insuranceVision insuranceBasic Life InsuranceSupplemental Life InsuranceShort-Term DisabilityLong-Term DisabilityEmployee Assistance ProgramWellness programs401K plan with matching contributions

Qualifications

  • 7+ years in SRE, platform engineering, or DevOps roles.
  • Experience in regulated environments (L&A, insurance, or financial services preferred).
  • Proven ability to build and run automation at scale.
  • Expert command of Infrastructure-as-code (Terraform, Bicep, ARM or equivalent).
  • Expert command of CI/CD pipelines (Azure DevOps, GitHub Actions) with experience in testing, gating, and progressive delivery.
  • Proficiency in scripting and development (PowerShell, Python, and at least one general-purpose language for production code).
  • Strong understanding of Git, branching strategies, code review discipline, and repository hygiene.
  • Deep knowledge of cloud platforms (Azure deeply; AWS or GCP a plus).
  • Expertise in observability tools (Azure Monitor, Log Analytics, KQL, Application Insights or comparable).
  • Experience with API and integration work (Graph API, REST, webhooks, event-driven patterns).
  • Hands-on experience deploying LLM-powered workflows in enterprise environments.
  • Understanding of agentic systems, MCP, retrieval-augmented patterns, and AI safety practices.
  • Experience with AI governance (prompt safety, data classification, output validation, audit trails).
  • Demonstrated ability to drive digital transformation from ticket-driven to product-driven operations, and reactive to proactive.
  • Ability to build business cases for technical solutions.
  • Preferred: Experience implementing SRE practice from scratch.
  • Preferred: Background in platform engineering (building internal developer platforms, golden paths, paved roads).
  • Preferred: AI/ML engineering experience beyond prompt-level work (fine-tuning, evaluation, RAG architectures, agent frameworks).
  • Preferred: Certifications such as Azure Solutions Architect Expert, Azure DevOps Engineer Expert, HashiCorp Terraform Associate, or equivalent.
  • Preferred: Experience with ServiceNow scripting and flow designer.

Responsibilities

  • Identify high-leverage automation opportunities by observing work, mining ticket data, and finding patterns.
  • Quantify toil, define error budgets, and establish SLOs.
  • Engineer end-to-end automated workflows using IaC (Terraform, Bicep), CI/CD (Azure DevOps, GitHub Actions), configuration-as-code, and orchestration platforms (Logic Apps, Power Automate, ServiceNow flows, custom services).
  • Ensure code is production-grade, version-controlled, peer-reviewed, observable, and resilient.
  • Deploy AI tools (Copilot, Claude, internal LLM platforms, agentic systems) securely and compliantly.
  • Define patterns for AI-assisted troubleshooting, AI-augmented runbooks, prompt libraries, agent workflows, and guardrails for Cybersecurity, Legal, and Compliance.
  • Implement SRE practices including meaningful SLIs and SLOs, proactive observability, and post-incident learning.
  • Apply chaos and resilience thinking to the digital workplace.
  • Partner with the Senior Support Manager on problem management to convert recurring incidents into automation backlog.
  • Collaborate with Solutions Engineers, Operations Engineers, Microsoft Engineer, Cybersecurity, Networks, Architecture, Product Management, and Business Technology to amplify their effectiveness.

Skills

AWSAzureAzure DevOpsAzure MonitorBicepGCPGitGitHubGitHub ActionsPowerShellPythonRESTServiceNowTerraform

Certifications

Azure Solutions Architect ExpertAzure DevOps Engineer ExpertHashiCorp Terraform Associate

Industry

InsuranceFinancial services

Relocation

No