Remote Senior Site Reliability Engineer ( Remote - US)
Posted
Apply now
Please, let Jobgether know you found this job
on RemoteYeah.
This helps us grow π±.
Description:
Jobgether is seeking a Senior Site Reliability Engineer (SRE) to join a company in the United States.
The role involves scaling, securing, and improving the organization's cloud infrastructure.
The primary focus is to ensure the reliability and scalability of systems through proactive solutions and automation of infrastructure management.
Responsibilities include working closely with engineering and platform teams to enhance service reliability, managing Kubernetes clusters, and optimizing cloud resources.
The SRE will lead incident response, conduct post-incident reviews, and refine best practices for system performance and security.
Key accountabilities include owning system reliability initiatives, participating in on-call rotations, designing and managing Kubernetes clusters, architecting AWS infrastructure, automating infrastructure provisioning, enhancing observability, and conducting post-incident reviews.
Requirements:
Candidates must have a minimum of 5 years of experience in SRE, DevOps, or Infrastructure Engineering, showcasing strong ownership and problem-solving skills.
Proficiency in Kubernetes, Helm, and networking security practices is required.
In-depth experience with AWS services such as RDS, Aurora, VPC, EKS, EC2, and IAM is essential.
Expertise in PostgreSQL administration, including performance tuning and high availability management within AWS, is necessary.
Familiarity with CI/CD tools like GitHub Actions and ArgoCD, focusing on automation and security best practices, is expected.
A strong understanding and experience in Infrastructure as Code (IaC) using Crossplane and Terraform is required.
Experience in observability and monitoring with Datadog is necessary.
Proficiency in Python and Bash scripting for system automation and management is required.
Strong communication skills and the ability to collaborate effectively across engineering teams and document processes in Confluence are essential.
Benefits:
The position offers a competitive base salary and equity options.
Comprehensive health, dental, and vision coverage is provided for employees and their families.
Life insurance and mental wellness coverage are included.
Employees enjoy Flex Time Off (unlimited) in addition to company-paid holidays.
Paid family leave, medical leave, and bereavement leave policies are available.
Retirement saving plans are offered to help employees plan for the future.
A home office setup allowance is provided to customize the work environment.
An annual professional development stipend is available to support employee growth.
Flexible remote work options are offered, allowing for global team collaboration.
Apply now
Please, let Jobgether know you found this job
on RemoteYeah
.
This helps us grow π±.