Remote Site Reliability Engineer (SRE)

Posted

Apply now
Please, let PayPay know you found this job on RemoteYeah. This helps us grow 🌱.

Description:

  • As the senior Site Reliability Engineer (SRE), you will lead a small team focused on maintaining a robust observability pipeline across EKS clusters.
  • You will establish a culture of reliability throughout the organization and manage growth sustainably while delivering customer and engineer satisfaction.
  • Key responsibilities include leading the engineering team in the architecture, implementation, and optimization of the platform built on Victoria Metrics, OpenTelemetry, Quickwit, and ClickHouse.
  • You will develop and maintain a deep understanding of network and cloud infrastructure to enable effective troubleshooting and incident response.
  • Collaboration with other engineering teams is essential, providing guidance on reliability, performance, and efficiency.
  • You will automate incident response to proactively address issues before they impact customers and mentor the technical skills of the engineering team.
  • Driving continuous improvement of observability tools and practices to enhance visibility and reliability across the organization is a key part of the role.
  • You will communicate with stakeholders to explain complex technical concepts.

Requirements:

  • A minimum of 5 years of experience as a Site Reliability Engineer or Tech Lead is required.
  • You must have at least 5 years of experience in AWS and EKS.
  • Several years of experience in designing, implementing, and operating large-scale observability with Victoria Metrics is necessary.
  • A senior Infrastructure Engineer level of understanding of cloud architecture, particularly on AWS, and AWS/EKS network infrastructure is required.
  • Proficiency in programming one or multiple languages such as Python, Go, or Rust is essential.
  • Strong problem-solving and troubleshooting skills to quickly identify and resolve complex issues are necessary.
  • A passion for continuously improving observability practices and driving innovation is expected.

Benefits:

  • The position offers a full-time employment status with the flexibility to work from anywhere in Japan.
  • You will enjoy super flex time with no core hours, typically working from 10:00 am to 6:45 pm.
  • Holidays include every Saturday, Sunday, national holidays in Japan, New Year's break, and company-designated special days.
  • Paid leave includes annual leave (up to 14 days in the first year) and personal leave (5 days each year).
  • The salary is paid annually in 12 installments, based on skills, experience, and abilities, with annual reviews and a special incentive based on company performance.
  • Additional benefits include social insurance, a 401K plan, translation/interpretation support, and VISA sponsorship with relocation support.
Apply now
Please, let PayPay know you found this job on RemoteYeah . This helps us grow 🌱.
About the job
Posted on
Job type
Salary
-
Experience level
Technology stack
Report this job

Job expired or something else is wrong with this job?

Report this job
Leave a feedback