Software Engineer III, Compute
Job Description
Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about.
With 100,000+ active communities and approximately 101M+ daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit redditinc. com. The Compute team is looking to hire a Software Engineer III that thrives at the intersection of infrastructure and software development.
This team’s challenges break into 2 domains, which we consider platform engineering and cluster engineering. Platform Engineering: Higher-level orchestration of both compute capacity and workload primitives to support our multi-cloud, multi-region, deployments.
A subset of current focuses include:Software automation that creates, manages, and destroys clusters in our fleet. APIs and controllers that support multi-cluster deployment and scheduling mechanics. Core SDKs that enable controller development in the larger organization.
Software that codifies out-of-cluster ancillary concerns such as network configurations and managed services. Cluster Engineering: Intra-cluster engineering problems involving balancing performance, efficiency, and stability. A subset of current focuses includeDetection of node-level performance characteristics and making availability decisions based on the data.
Schedulers that support more efficient packing of resources along with reactive rescheduling on the basis of changing compute availability. Kubernetes controllers that offer APIs in the cluster and perform reconciliation to reach a desired state.
Cluster upgrades, both mechanical process concerns and automation. As a member of the Compute team, your work will span these 2 domains, which are rich with challenging infrastructure and software engineering problems. Your work will directly impact hundreds of millions of users around the world.
Join us and help build the future of Reddit!In your day-to-day, you can expect to:Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit’s infrastructure. Deliver software to improve the availability, scalability, latency, and efficiency of Reddit’s Compute Platform.
Contribute feedback to the technical and strategic direction of the compute platform. Automate critical aspects of the development process such as service creation and management, as well as critical infrastructure operations. Share on-call responsibilities with the Compute team.
You have:3+ years of experience developing internet-scale software, preferably in the context of infrastructure. Language proficiency in Go. Experience developing on top of Kubernetes or similar distributed systems.
EWJD3