This job post is closed and the position is probably filled. Please do not apply.
🤖 Automatically closed by a robot after apply link
was detected as broken.
Description:
The Observability team at Reddit is seeking a Staff Software Engineer who excels at the intersection of infrastructure and software development.
The team manages a suite of tools focused on enabling engineers to comprehend their creations, primarily utilizing open-source solutions at scale.
Responsibilities include working on monitoring, logging, and distributed tracing systems, dealing with billions of data points, and addressing unique challenges of scale.
The role involves performance engineering, product innovation, and enhancing reliability and scalability of logging systems.
The team is currently developing a tracing product based on OTEL, Clickhouse, and Grafana for internal use at Reddit.
The position offers the opportunity to work on challenging infrastructure and software engineering problems that directly impact millions of users worldwide.
Requirements:
Minimum 7 years of experience in developing internet-scale software, preferably in an infrastructure context.
Familiarity with distributed systems development, with a bonus for experience with tools like Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, and Loki.
Proficiency in developing on top of Kubernetes or similar distributed systems, with Kubernetes controller or operator development experience being a significant advantage.
Strong troubleshooting skills in both systems and software.
Experience in engineering large systems, project management, and self-driven project execution.
Excellent communication skills to collaborate effectively with a service-oriented team and company.