Jobs / Benzinga

DevOps Engineer

Benzinga · United States · Remote
United StatesRemote
Remuneration
Not specified
Location
United States · Remote
Visa sponsorship
Not specified

Job summary

A DevOps Engineer who thinks deeply about systems and their behavior. This role involves defining, building, and maintaining robust, observable, and scalable infrastructure, collaborating with development teams for seamless integration and deployment. The engineer will focus on automating tasks and leveraging development practices like Test-Driven-Development and continuous integration to create a reliable, high-performing platform.

Qualifications

  • Strong communication skills.
  • Self-motivated with strong organizational skills.
  • Experience with AWS/GCP, Kubernetes, Terraform, CI/CD, OpenSearch/Elasticsearch, Postgres, MySQL, Kafka, BigQuery, Python, NodeJS, Go, Java, Prometheus, Grafana, Coralogix, Varnish, Nginx, Kong.
  • Ability to reason about software, algorithms, and performance.
  • Experience thinking about systems, including edge cases, failure modes, behaviors, and implementations.
  • Experience with distributed systems and understanding of modern web stacks.
  • Proficiency with a *nix shell.
  • Ability to discover and patch SQLi, XSS, CSRF, SSRF, authentication and authorization flaws, and other web-based security vulnerabilities.
  • Knowledge of common authentication technologies including OAuth, SAML, CAs, OTP/TOTP.

Responsibilities

  • Radiate knowledge about service infrastructure and reliability to the development team.
  • Identify and resolve system scalability issues.
  • Plan the growth of Benzinga's infrastructure.
  • Document actions to create repeatable processes and automation.
  • Improve the deployment process.
  • Define, provision, and manage production infrastructure using Kubernetes, cloud-native serverless, and Terraform.
  • Proactively identify and reduce security risks, aligning with SOC2 auditing.
  • Develop security training and guidance for internal development teams.
  • Design, build, and maintain core infrastructure for scaling Benzinga.
  • Participate in on-call rotation for availability incidents and support.
  • Debug production issues across all services and stack levels.
  • Configure monitoring and alerting for symptoms, not outages.
  • Manage daily maintenance and evolution of Prometheus monitoring and alerting infrastructure.
  • Bundle Prometheus monitoring as an out-of-the-box solution.
  • Build and maintain the public monitoring gateway.
  • Migrate performance monitoring to Prometheus.
  • Improve performance monitoring coverage.
  • Create automated alerts for regressions.

Skills

AWSBashBigQueryElasticsearchGCPGoGrafanaJavaKafkaKongKubernetesLinuxMakeMySQLNGINXNode.jsOpenSearchPostgreSQLPrometheusPythonTerraform

Languages

PythonNodeJSGoJava

Work schedule

On-call rotation

Relocation

No