Remote Senior DevOps Engineer (US Remote CT / ET timezone)
Posted
This job is closed
This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
SEON is seeking a Senior DevOps Engineer focused on Site Reliability Engineering (SRE) to join their team remotely in the US (CT/ET timezone).
The role involves maintaining and improving the reliability, scalability, and performance of the cloud infrastructure.
Responsibilities include implementing SRE best practices, developing monitoring and alerting systems, managing incident response, and conducting post-incident reviews.
The engineer will continuously monitor and optimize cloud infrastructure performance, automate routine tasks, and analyze system capacity for future growth.
The position requires defining, measuring, and monitoring SLOs and SLIs, collaborating with engineering and product teams, and maintaining documentation for architecture and troubleshooting.
On-call support is required to ensure continuous application and infrastructure availability, along with ensuring security and compliance through regular audits.
Staying current with new technologies and industry trends is essential for evaluating their impact on infrastructure and reliability practices.
Requirements:
Candidates must have 8+ years of experience as a DevOps Engineer or in a similar software engineering role, with a focus on SRE principles and practices.
A strong ability to troubleshoot complex issues related to system resources or applications is required.
A proactive approach to identifying and resolving issues independently is essential, along with a strong problem-solving attitude.
Proficiency with Kubernetes and AWS EKS is preferred.
Expertise in Infrastructure as Code (Terraform) is necessary.
Extensive experience with high-performance, scalable, multi-region AWS infrastructure is required.
Strong experience with monitoring and logging tools such as Prometheus, Grafana, Elasticsearch, and Kibana is needed.
Proficiency with incident management tools like PagerDuty or Opsgenie is required for effective on-call management.
Familiarity with CI/CD pipelines and tools such as Github Actions or TeamCity is necessary.
Excellent communication and collaboration skills are essential for working effectively with cross-functional teams.
Benefits:
Employees will participate in an Employee Stock Ownership Plan (ESOP).
The position offers flexible working hours.
A generous holiday allowance is provided.
There are significant opportunities for learning and development.
Private health insurance is available, including coverage for dependents and mental health support.
Complimentary weekly language courses are offered.
Enhanced parental leave is part of the benefits package.