Back to Jobs

[Remote] Lead Site Reliability Engineer

Remote, USA Full-time Posted 2026-06-16

Note: The job is a remote job and is open to candidates in USA. Gifthealth is revolutionizing healthcare by simplifying the management of prescriptions and health services. The Lead Site Reliability Engineer (SRE) is responsible for building reliable, scalable software systems and implementing DevOps practices to enhance application performance and resilience.


Responsibilities

  • Designs, builds, and maintains reliable, scalable software systems supporting Ruby on Rails applications
  • Embs reliability, performance, and operational best practices into application code and development workflows
  • Owns DevOps practices including CI/CD reliability, deployment strategies, and release safety
  • Leads incident response, debugging, and root cause analysis across application and platform layers
  • Implements and evolves observability (logging, metrics, tracing) within application and service code
  • Partners with engineering teams on architecture, capacity planning, and technical standards

Skills

  • Bachelor's degree in computer science, engineering, or related field OR equivalent professional experience in software engineering, SRE, or DevOps roles
  • 5+ years of experience in software engineering, SRE, or DevOps roles
  • Hands-on experience building and operating Ruby on Rails applications in production
  • Experience in owning production incidents and application-level reliability
  • Knowledge of Ruby on Rails application architecture and production operations; software reliability engineering principles (SLOs, SLIs, error budgets); and modern DevOps and CI/CD practices
  • Strong software engineering skills (Ruby and/or comparable backend languages)
  • Debugging and performance optimization of production applications skills
  • CI/CD pipelines, deployment automation, and release tooling skills
  • Monitoring and observability tooling (Datadog, New Relic, Prometheus, etc.) skills
  • Ability to write production-quality code that improves system reliability
  • Ability to collaborate with product and engineering teams to influence design decisions
  • Ability to troubleshoot complex, cross-system failures
  • Cloud platform certifications (AWS, GCP, Azure)
  • SRE or DevOps-focused certifications
  • Experience in high-growth or scaling engineering organizations
  • Experience working in regulated or customer-impact–sensitive environments
  • Knowledge of security and compliance considerations in production systems
  • Infrastructure as Code (Terraform or similar) skills
  • Containerization and orchestration (Docker) skills
  • Ability to mentor engineers on operational ownership and reliability practices
  • Ability to balance speed of delivery with long-term system health

Company Overview

  • GiftHealth is a healthcare tech startup that streamlines pharmacy experience with free delivery and competitive medication pricing. It was founded in 2020, and is headquartered in Columbus, Ohio, USA, with a workforce of 501-1000 employees. Its website is https://www.gifthealth.com.

  •   Apply To This Job

    Similar Jobs