Please, let PayPay know you found this job
on RemoteYeah.
This helps us grow 🌱.
Description:
As the senior Site Reliability Engineer (SRE), you will lead a small team focused on maintaining a robust observability pipeline across EKS clusters.
You will establish a culture of reliability throughout the organization and manage growth sustainably while delivering customer and engineer satisfaction.
Key responsibilities include leading the engineering team in the architecture, implementation, and optimization of the platform built on Victoria Metrics, OpenTelemetry, Quickwit, and ClickHouse.
You will develop and maintain a deep understanding of network and cloud infrastructure to enable effective troubleshooting and incident response.
Collaboration with other engineering teams is essential, providing guidance on reliability, performance, and efficiency.
You will automate incident response to proactively address issues before they impact customers and mentor the technical skills of the engineering team.
Driving continuous improvement of observability tools and practices to enhance visibility and reliability across the organization is a key part of the role.
You will communicate with stakeholders to explain complex technical concepts.
Requirements:
A minimum of 5 years of experience as a Site Reliability Engineer or Tech Lead is required.
You must have at least 5 years of experience in AWS and EKS.
Several years of experience in designing, implementing, and operating large-scale observability with Victoria Metrics is necessary.
A senior Infrastructure Engineer level of understanding of cloud architecture, particularly on AWS, and AWS/EKS network infrastructure is required.
Proficiency in programming one or multiple languages such as Python, Go, or Rust is essential.
Strong problem-solving and troubleshooting skills to quickly identify and resolve complex issues are necessary.
A passion for continuously improving observability practices and driving innovation is expected.
Benefits:
The position offers a full-time employment status with the flexibility to work from anywhere in Japan.
You will enjoy super flex time with no core hours, typically working from 10:00 am to 6:45 pm.
Holidays include every Saturday, Sunday, national holidays in Japan, New Year's break, and company-designated special days.
Paid leave includes annual leave (up to 14 days in the first year) and personal leave (5 days each year).
The salary is paid annually in 12 installments, based on skills, experience, and abilities, with annual reviews and a special incentive based on company performance.
Additional benefits include social insurance, a 401K plan, translation/interpretation support, and VISA sponsorship with relocation support.
Apply now
Please, let PayPay know you found this job
on RemoteYeah
.
This helps us grow 🌱.