This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
Replicant is seeking a skilled Site Reliability Engineer to enhance the infrastructure and systems that support the company’s growth.
The role involves ensuring the smooth operation and high availability of Replicant's production systems.
Responsibilities include monitoring system performance, identifying bottlenecks, and implementing optimizations to improve reliability and efficiency.
The engineer will develop and maintain tools and automation to prevent and quickly resolve incidents.
Collaboration with engineering teams is essential to improve the reliability and scalability of applications and infrastructure.
Participation in on-call rotation is required to address production issues and ensure service uptime.
The engineer will contribute to infrastructure design and implementation, focusing on scalability, security, and cost-effectiveness.
Staying updated on industry best practices and emerging technologies in SRE and DevOps is expected.
Requirements:
Candidates must have proven experience in managing and troubleshooting complex, distributed systems in a production environment.
A strong understanding of cloud platforms, preferably GCP, and containerization technologies like Kubernetes is required.
Proficiency in scripting languages and automation tools, such as Python, Bash, and Terraform, is necessary.
Experience with monitoring and observability systems, including Datadog and Prometheus, is essential.
Excellent problem-solving skills and a proactive approach to identifying and mitigating potential issues are required.
Strong communication and collaboration skills are necessary to work effectively in a team environment.
A passion for ensuring the reliability and performance of critical systems is expected.
Bonus points for experience with CI/CD pipelines, infrastructure-as-code practices, knowledge of networking concepts, familiarity with security best practices for cloud-based systems, and familiarity with telephony applications.
Benefits:
Replicant offers a remote working environment that respects time zone differences.
Employees receive highly competitive salaries, equity, and for US employees, a 401(k) plan.
Top-of-the-line healthcare benefits, including medical, vision, and dental coverage, are provided.
A health and wellness perk is included to support employee well-being.
An equipment stipend is available to ensure employees have the necessary tools for their work.
The company has a flexible vacation policy to promote work-life balance.
Employees can enjoy amazing team trips and offsites, fostering a strong team culture.
After 4.5 years of service, Replicants are eligible for a 5-week sabbatical.