Site Reliability Engineer
Radar
New York, new york
About the role
We're looking for Site Reliability Engineers to work on our production infrastructure. Radar is a high-throughput, data intensive application handling 1 billion+ API calls per day. Over the past year, Radar has been used from over 100M devices worldwide. We run a multi-availability zone architecture and a major initiative is to enhance our deployment to be multi-region.
The stack:
We use Terraform via TypeScript (CDKTF) to manage our infrastructure.
We deploy everything to AWS via Docker.
We use MongoDB deployed to Atlas.
We do blue-green and canary deployments via CircleCI CI/CD.
We monitor production with CloudWatch, honeycomb, Pingdom and PagerDuty.
DNS is managed by CloudFlare.
Most engineers are in the on-call rotation.
Our main server languages are TypeScript and Rust.
Our data pipelines are written in Airflow and Scala Spark.
We sponsor OpenStreetMaps,...