Jobs / Collins Aerospace
Principal Site Reliability Engineer - ARINCDirect (Remote)
Collins Aerospace · Richmond, VA, United States · Remote
Richmond, VA, United StatesExp: 8+ yrs107,500-204,500 USD/yearlyRemote
Remuneration
107,500-204,500 USD/yearly
Location
Richmond, VA, United States · Remote
Eastern Daylight Time (UTC-4)
Visa sponsorship
No visa sponsorship
Must be authorized to work in the U.S. without the company’s immigration sponsorship now or in the future. The company will not offer immigration sponsorship for this position. The company will not seek an export authorization for this role.
Job summary
The ARINCDirect team within Collins Aerospace is seeking a Site Reliability Engineer (SRE) to automate and improve infrastructure reliability. This role focuses on infrastructure automation, release engineering, and continuous delivery, working with the Platform and Operations team. The SRE will be responsible for service availability, performance, monitoring, incident response, and capacity planning for ARINCDirect's commercial products.
Benefits
Medical insuranceDental insuranceVision insuranceThree weeks of vacation401(k) plan with employer matchingEmployer retirement contributionLifetime Income Strategy optionTuition reimbursement programStudent Loan Repayment ProgramLife insuranceDisability coveragePet insuranceHome and auto insuranceAdditional life and accident insuranceCritical illness insuranceGroup legalID theft protectionBirth leave benefitsAdoption leave benefitsParental leave benefits
Qualifications
- Degree in Science, Technology, Engineering, or Mathematics (STEM) with a minimum of 8 years of relevant experience, or an Advanced Degree with a minimum of 5 years of experience, or 12 years of relevant experience without a degree.
- Authorized to work in the U.S. without sponsorship.
- Experience as an SRE, Platform Engineer, or similar role in a Linux or UNIX environment with large, complex infrastructures using Docker and Kubernetes.
- Experience automating configuration and infrastructure with tools like Saltstack, Ansible, Terraform, or other declarative languages.
- Experience with hardware, including servers, network switches, and cabling.
- Preferred: Experience managing infrastructure using GitOps with continuous delivery (CD) pipelines.
- Preferred: Proficiency in Python, Linux Shell (bash, awk, sed).
- Preferred: Experience with PostgreSQL, RDBMS, and SQL.
- Preferred: Familiarity with Cloud infrastructure, ideally AWS.
- Preferred: Understanding of SRE principles, including building observability solutions and exposing metrics for SLOs and KPIs.
- Preferred: Understanding of IT infrastructure services such as DNS, DHCP, LDAP, NFS.
- Preferred: Understanding of network segmentation, routing, and VPNs.
Responsibilities
- Automate and improve infrastructure reliability to ensure resilience and reproducibility.
- Manage service availability, performance, monitoring, incident response, and capacity planning.
- Create, improve, and manage environments for data-driven resource allocation, problem identification, and capacity planning.
- Maintain physical infrastructure using Linux.
- Facilitate the adoption of Kubernetes and declarative infrastructure.
- Influence technology decisions and direction to support the ARINCDirect platform.
- Collaborate with SREs and other teams to design dependable and scalable solutions.
- Identify, implement, and champion process improvements for productivity, collaboration, and delivery efficiency.
- Participate in shared on-call rotation.
Skills
AnsibleAWSBashDockerKubernetesLinuxPostgreSQLPythonSaltStackTerraform
Degrees
ScienceTechnologyEngineeringMathematics
Work schedule
On-call rotation
Industry
AvionicsAerospace
Security clearance
None/Not Required
Relocation
No