Please, let Swan Bitcoin know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
The Site Reliability Engineer (SRE) at Swan will work closely with the development team, CTO, and cloud/infra engineers to develop and operate a robust and scalable platform supporting Swan’s business lines.
Responsibilities include establishing the SRE program, guiding engineers in observability, logging, monitoring, and capacity planning, creating consistent and well-monitored systems, helping with high-priority development projects, working closely with the security team, and providing mentorship for other SREs.
The role involves setting up onboarding processes for new on-call engineers, determining appropriate response times for alerts, scoping work discovered in on-call response, and collaborating with the security team on incident response.
Skills and experience required include familiarity with Datadog, Postgres databases, AWS RDS, HA architectures in AWS, DNS, SSL, AWS networking, Docker, ECS, security principles in the cloud, and the AWS Well Architected Framework.
The ideal candidate should be cool under pressure, able to manage incidents involving multiple systems, communicate effectively, and occasionally take pager alerts during working hours and weekends.
Requirements:
Experience with Datadog or similar tools for setting up monitors, alerting systems, anomaly management, and forecasting.
Medium to advanced level understanding of Postgres databases, including optimizing SQL queries and knowledge of AWS RDS.
Excellent understanding of HA architectures in AWS.
Mid-level knowledge of DNS, SSL, AWS networking, Docker, and ECS.
Working knowledge of security principles in the cloud and familiarity with the AWS Well Architected Framework.
Ability to manage incidents under pressure, communicate effectively, and occasionally take pager alerts during working hours and weekends.
Benefits:
Opportunity to work with a passionate and fully distributed startup team at Swan.
Chance to be involved in high-priority development projects and establish the SRE program.
Mentorship opportunities for other SREs and on-call team members.
Collaboration with the security team on incident response and security concerns.
Exposure to a flat organizational structure that encourages leadership and product involvement.
Involvement in the Bitcoin community through various activities like writing, podcasts, conferences, and open-source projects.
Apply now
Please, let Swan Bitcoin know you found this job
on RemoteYeah
.
This helps us grow 🌱.