Software Engineer, Infrastructure
Unreal Gigs
San Francisco, california
Overview
Are you passionate about enabling large-scale compute efforts and enhancing the software infrastructure to support seamless research experiences? As a Software Engineer, Infrastructure, you will play a critical role in scaling our systems and creating robust tools for our team. You will build and manage tools, debug distributed systems, and design improvements to manage secrets, configurations, and stateful components. Your work will ensure our infrastructure is resilient and reliable, allowing other engineers to work more effectively.
Responsibilities
- Tool Development: Build and manage wrapper tools that allow code written for single hosts to scale seamlessly to large GPU clusters.
- Debugging: Debug distributed exceptions and improve our logging and tracing stack to enhance system reliability.
- System Improvements: Design and implement improvements to systems managing...