Remote Staff Software Engineer - Compute Reliability and Efficiency
Posted
This job is closed
This job post is closed and the position is probably filled. Please do not apply.
π€ Automatically closed by a robot after apply link
was detected as broken.
Description:
The Staff Software Engineer position at Reddit focuses on lower-level (Linux and Kubernetes) systems engineering within the Compute Reliability and Efficiency team.
The role involves working on intra-cluster engineering problems related to performance, efficiency, and stability.
Responsibilities include tasks such as detection of node-level performance characteristics, schedulers for resource packing, Kubernetes integrations, and cluster upgrades.
The position requires collaborating with a team of software engineers to maintain Reddit's infrastructure platform and improve its availability, scalability, latency, and efficiency.
The Staff Software Engineer will be involved in performance and reliability analysis on Reddit's Linux-based Kubernetes fleet and contribute to the technical and strategic direction of the compute platform.
Requirements:
7+ years of experience in infrastructure domain with a focus on lower-level systems like Linux.
Proficiency in Go (Preferred), Rust, or Python programming languages.
Understanding of kernel primitives, CPU scheduling, userspace concerns, and packet processing.
Experience with Kubernetes or similar distributed systems development.
Strong troubleshooting skills from higher-level orchestration to lower-level runtime concerns.
Ability to design large systems, scope work, and collaborate effectively with other engineers.
Excellent communication skills to work with a service-oriented team and company.