HOT-HOT Disaster Recovery (DR)
Disaster recovery presents unique challenges in trading systems, particularly for Direct Market Access (DMA) environments.
During DMA implementation, the team overhauled the DR architecture to address these demands. Unlike program trading (which has more tolerance for brief interruptions), DMA requires continuous availability - even a single client session drop becomes critical.
HOT-HOT DR Implementation
The solution delivers multi-layer resiliency against:
- Connectivity loss (both market feeds and client sessions)
- Single server failure
- Complete data center outage
Architecture Components
1. Duplicated Client Connectivity
- Primary connection pool (local DC) → maintains latency while preventing server failures
- Reserve pool (secondary DC) → ensures failover capacity
2. Horizontally Scalable Core
- Primary DC deployment
- Real-time state replication to secondary DC
- Automatic failover with split-brain protection
3. Post-Recovery Integrity Checks
- Drop-copy reconciliation with execution venues
- Automated gap/duplication detection and resolution
Before Findev
After Findev


Results
- Transparent server failover: Client sessions remain uninterrupted during single-server failures
- Rapid DC recovery: Full data center failover completes within 1 minute
- Zero data loss: Verified through post-recovery reconciliation