Senior DevOps Engineer (US Remote CT / ET timezone)
SEON Technologies
N/A
Job Details
Full-time
Full Job Description
Are you a technically skilled, hands-on, passionate DevOps Engineer & SRE? Join us at SEON and explore the opportunity to join a world-class research and development organization passionate about creating incredible experiences for our employees and customers through superior data-driven insights. These great experiences ensure that we can continue to execute our mission of making the internet a safer place to do business!
SEON provides an API-first solution that helps our customers (many of the world’s leading providers of digital experiences for consumer financial services, insurance, online entertainment, etc.) defend their customers against Fraud and Financial Crimes. With over 250 Fraud Fighters across four global offices (Austin, Budapest, London, and Jakarta), our goal remains unwavering: to make the internet safe for businesses and consumers to transact. Our achievements, including a record-breaking Series B funding round and recognition in TechCrunch, have led to recognition as the World’s quickest-growing fraud prevention company. We take pride in our rapid growth and mission to democratize fraud-fighting while empowering the best online businesses. Join us in our journey to make the internet safer for everyone.
What You’ll Do:
As a Senior DevOps Engineer focused on Site Reliability Engineering (SRE) at SEON, you will play a crucial role in maintaining and improving our cloud infrastructure's reliability, scalability, and performance. You will work closely with cross-functional teams to ensure our systems are robust, scalable, and efficient.
- Ensure the reliability, availability, and performance of our systems by implementing SRE best practices
- Develop and maintain comprehensive monitoring and alerting systems using tools such as Prometheus, Grafana, ELK stack, etc. Manage incident response and root cause analysis for production issues
- Conduct post-incident reviews to learn from failures and drive continuous improvement in the system’s reliability
- Continuously monitor and optimize the performance of cloud infrastructure to ensure efficient resource utilization and cost-effectiveness
- Automate routine tasks and processes to reduce manual intervention and increase efficiency
- Analyze current system capacity and plan for future growth to ensure the infrastructure can scale with increasing demands
- Define, measure, and monitor SLOs and SLIs to ensure that services meet their reliability targets
- Work closely with engineering, and product teams to provide feedback and suggestions on new architectures, ensuring they meet reliability and performance standards
- Develop and maintain comprehensive documentation for architecture, infrastructure, and troubleshooting processes.
- Provide on-call support to ensure the continuous availability of our applications and infrastructure
- Ensure that systems meet security and compliance requirements, performing regular audits and assessments based on the internal security team’s guidelines
- Stay current with new technologies and industry trends, evaluating their potential impact on our infrastructure and reliability practices
What You Bring:
- 8+ years of experience as a DevOps Engineer or in a similar software engineering role, with a focus on SRE principles and practices
- Ability to quickly troubleshoot complex issues related to system resources or different applications
- A proactive approach to identifying and resolving issues independently with a strong problem-solving attitude
- Proficiency with Kubernetes, AWS EKS preferred
- Expertise with Infrastructure as Code (Terraform)
- Extensive experience with high-performance, scalable, multi-region AWS infrastructure.
- Strong experience with monitoring and logging tools such as Prometheus, Grafana, Elasticsearch, and Kibana.
- Proficiency with incident management tools such as PagerDuty, Opsgenie, or similar platforms to manage on-call schedules and incident response processes effectively
- Familiarity with CI/CD pipelines and tools (eg. Github Actions TeamCity)
- Excellent communication and collaboration skills to work effectively with cross-functional teams
What We Offer:
- Employee stock ownership plan (ESOP)
- Flexible hours
- Generous Holiday allowance
- Access to significant opportunities for learning and development
- Private health insurance including dependants (inc. employee assistance & mental health support)
- Complimentary weekly language courses
- Enhanced Parental leave
What’s Next:
Sounds good? Great, we can’t wait to hear from you! Want to learn more about what it’s like to work at SEON first?
👉 Here you go: https://careers.seon.io/