Senior Site Reliability Engineer
XperiencOps Inc
Pleasanton, california
The Senior Site Reliability Engineer (SRE) plays a vital role in ensuring the reliability, scalability, and performance of our enterprise software platform. This is a senior-level position that requires deep technical expertise, strong problem-solving skills, and the ability to collaborate effectively in a fast-paced, demanding environment. Our customers, the largest enterprises in the world, expect 24/7 platform availability and top-tier performance.
The ideal candidate has strong expertise in AWS cloud technologies, a deep understanding of serverless architectures (AWS Lambda), and a passion for building resilient systems to enhance the customer experience.
Platform Reliability:
- Design, implement, and manage highly available and scalable systems to meet customer expectations for 24/7 uptime.
- Monitor, troubleshoot, and resolve platform incidents using tools such as Sentry, New Relic, and custom monitoring...