This job post is closed and the position is probably filled. Please do not apply.
π€ Automatically closed by a robot after apply link
was detected as broken.
Description:
The Observability team at Reddit is seeking a Staff Software Engineer who excels at the intersection of infrastructure and software development.
The team manages a suite of tools for engineers to comprehend their creations, primarily utilizing open-source solutions at scale such as Prometheus, Thanos, Grafana, Vector, and more.
Responsibilities include working on monitoring, logging, and distributed tracing systems, addressing challenges of scale and performance engineering.
The role involves collaborating with a team of software engineers to enhance Reddit's infrastructure platform, improve observability components, and contribute to the technical and strategic direction of eventing at Reddit.
Automation of critical aspects of the event-driven development process and sharing on-call responsibilities are part of the day-to-day tasks.
The position offers the opportunity to directly impact hundreds of millions of users globally and contribute to shaping the future of Reddit.
Requirements:
7+ years of experience in developing internet-scale software, preferably in an infrastructure context.
Familiarity with distributed systems development, with a bonus for knowledge of tools like Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, and Loki.
Experience in developing on Kubernetes or similar distributed systems, with a significant advantage in Kubernetes controller or operator development.
Strong troubleshooting skills in systems and software, along with experience in engineering large systems, tracking work, and being proactive on projects.
Excellent communication skills to collaborate effectively with a service-oriented team and company.