[Remote] Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. BayOne Solutions is seeking a Site Reliability Engineer to enhance stability and reliability for clients. The role emphasizes the combination of systems engineering and data science to develop tools and strategies for incident management.
Responsibilities
- We are looking for a Site Reliability Engineer with experience in incident response
- In this role, you will help client understand where we can improve stability and reliability
- There will be a focus on the intersection of systems engineering and data science, building the tooling and culture necessary to transform raw incident logs into actionable reliability strategies
Skills
- 4+ years in SRE, DevOps, or Systems Engineering roles managing production environments at scale
- Strong experience with SQL and data analysis
- Expertise in one or more programming languages such as Golang, Java, Python, or C++
- Deep understanding of alerting systems, distributed tracing, structured logging, and metrics collection
- Experience with container orchestration (Kubernetes) and cloud infrastructure (GCP)
Company Overview
Apply To This Job