Senior Site Reliability Engineer
Listed on 2026-01-12
-
IT/Tech
Cloud Computing, Systems Engineer
Drive Wealth is a global B2B financial technology organization dedicated to democratizing access to financial independence around the world. Our mission is realized through an API-based platform, empowering our partners to offer seamless investing and trading experiences to clients worldwide, all from their mobile devices.
Our technology provides partners with a modern, extensible toolkit, enabling traditional investment workflows and innovative techniques like fractional share ownership. Drive Wealth has evolved into a global platform offering trading of US equities, mutual funds, ETFs, fixed income, and options.
We seek enthusiastic professionals to contribute diverse perspectives and experiences to our Brokerage-as-a-Service platform. Our culture blends the pace and opportunity of a tech start-up with the impact, stability, and significance of Wall Street. We encourage creativity and experimentation while ensuring institutional-grade execution and regulatory compliance in everything we do. We value diversity and inclusion, celebrating the unique differences of our employees as we scale and grow together.
We’re guided by operating principles grounded in accountability, teamwork, integrity, and solutions built to scale. Join us!
The Role
As a Senior Site Reliability Engineer, you will enhance the reliability and performance of our Brokerage-as-a-Service platform during critical 7/24 operations. This role demands a proactive approach to managing technical challenges and system optimizations that align with our global operational strategies.
What You’ll Do- Support the SRE team in developing and implementing enhancements to support workflows, focusing on automation and efficiency improvements.
- Handle technical escalations, troubleshoot complex issues, and actively participate in on-call rotations to ensure rapid response and resolution during non-traditional hours.
- Adhere and administer incident and change management policies.
- Coordinate incident resolution efforts and implement change management protocols to maintain and enhance system reliability, especially during critical system operations at night.
- Work closely with the New York office to ensure smooth operation and alignment of SRE practices across time zones.
- 3+ years in a Senior SRE role or a similar position, demonstrating deep knowledge and expertise in site reliability engineering and operations.
- Working knowledge in REST APIs and understanding of API integration.
- Python proficiency in scripting for automation and system management, with a track record of developing and implementing automation solutions.
- SQL and Database expertise in transactional databases, including querying and troubleshooting.
- Analytical and troubleshooting skills with a demonstrated ability to perform troubleshooting and root cause analysis of technical issues.
- Availability for flexible work hours and willingness to cover US markets trading sessions, including L2 on-call coverage.
- Knowledge of Change Management Process and Risk Management.
- Experience in the brokerage or financial industry
- Proficient with cloud services, particularly AWS, and knowledgeable about cloud architecture best practices, including IAM, EC2, S3, and DynamoDB
- Experience maintaining and supporting containerized systems, with familiarity in orchestration tools
- Knowledge of Infrastructure as Code (IaC) practices and tools such as Terraform or Cloud Formation
- Ability to manage and troubleshoot job scheduling tools like Rundeck or Apache Airflow
- Advanced skills in managing containerized environments using Kubernetes and Open Shift
- Practical experience with Confluent Cloud for event streaming architectures
- Experience with Java applications and a basic understanding of using the browser developer console for front-end debugging
Additional Notes:
This role is critical for our continuous operations and requires a commitment to nighttime hours, aligning with the global nature of our financial services. Candidates must be prepared for intense collaboration periods and proactive communication across global teams.
Applicants must be authorized to work for any employer in…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).