Site Reliability Engineer, Platform (AWS experience mandatory)

Seattle (WA)
Posted 3 months ago

A strong candidate will

  • Be proficient in best practices in leveraging AWS for IaaS and PaaS services
  • Evaluate, build and maintain automation efforts for deploying and managing production services
  • Have expertise around security best practices in highly-regulated industries
  • Be passionate about measurement, observability and early detection of issues using leading and lagging metrics
  • Provide critical, business-level analysis of critical customer facing problems across the technology stack
  • Have strong interpersonal skills, act as a liaison between engineers, internal and external stakeholders
  • Evangelize and drive streamlining of processes for building and operating highly available, scalable services

Tools & Technologies

  • We have a wide and varying set of technologies that reflects the complexities of modern cloud development.
  • AWS and wide range of services (S3, RDS, ECS, Redshift)
  • Ansible & Hashicorp Tools (Packer, Vault, Terraform, Consul, etc.), ELK, Data Dog
  • VPN solutions of many flavors and vendors (OpenVPN, IPsec, etc.)
  • Python, Java and Ruby, React, Bash and a smattering of Golang, RabbitMQ, SNS, and SQS
  • Polyglot storage (MySQL, MSSQL, PostgreSQL)
  • Large scale data exchange platforms including Mirth Connect Interoperability Engine for HL7 v2/v3 and FHIR

Minimum Qualifications

  • 3+ years experience building and managing AWS/GCP/Azure cloud service operations
  • Direct experience with hardening of environments in security-first software development and/or system operations
  • Solid expertise in Linux server administration
  • Proficiency with networking, preferably with foreign VPN-connected networks
  • Deep knowledge of enabling system and application level monitoring/alerting and metric analysis
  • At-scale integration with remote networks and foreign dataflow
  • Developing DevOps tooling and support for developer workflow and automation
  • Experience with resolving real world performance issues, tuning, scaling, and security for large scale distributed cloud systems
  • Excellent written and verbal communication and reporting skills or coordinating across teams, with customers, and technical vendors
  • Willing to be part of a 24/7 on-call rotation
  • Ability to demonstrate composed urgency in stressful situations

Nice to have:

  • Prior experience in regulated industries with high Data Quality and Governance standards.
  • Software Testing (SDET) experience and passion for analyzing failure

Must have authorization to work in the US.

Job Type: Full-time

Job Features

Job TypeContract

Apply Online

A valid email address is required.