Voice infrastructure fails differently than most enterprise software. When a CRM goes down, salespeople work from memory for an hour. When a collaboration tool goes down, people send emails. When voice infrastructure goes down, calls stop. For a collections operation mid-campaign, a contact center during peak hours, or a franchise network where AI is handling customer orders — that is an immediate, measurable revenue event.
This is why the redundancy standard for voice infrastructure is not comparable to general enterprise software. Five nines — 99.999% uptime — translates to roughly five minutes of downtime per year. Achieving that number requires a specific architectural approach. Two data centers is not enough. Four is the foundation.
Why Two Data Centers Is Not Sufficient
The instinct for most organizations is that two data centers — primary and failover — provides adequate redundancy. For most enterprise applications, it does.
Voice is different for two reasons.
First, voice calls are stateful and real-time. A SIP session in progress cannot be paused and resumed after a failover event the way a database transaction can be replayed. When a data center fails, active calls on that infrastructure end. The failover architecture determines how quickly new calls can be established on surviving infrastructure — and whether that happens in seconds or minutes.
Second, data center failures are not always binary. Power events, network connectivity issues, hardware failures, and maintenance windows create partial degradation scenarios where some traffic processes normally and some does not. A two-data-center architecture has limited options for rerouting traffic in these scenarios. With four nodes, partial degradation on one node does not affect overall capacity — traffic redistributes across three healthy paths.
What Four-Node Architecture Provides
Four geographically distributed data centers, operating in an active-active or active-standby configuration, give voice infrastructure a resilience profile that two nodes cannot match.
Geographic distribution matters because the failure modes that take down a data center are often regional — weather events, power grid issues, network backbone disruptions. Two data centers in the same metro area or on the same network segment are vulnerable to the same regional events. Four data centers distributed across different geographic zones and network paths eliminate this class of failure.
Active-active configuration means all four nodes are handling live traffic simultaneously, not sitting in standby waiting for a primary to fail. This has two benefits: failures are absorbed by the remaining nodes without a failover event, and the full capacity of the network is available continuously rather than half of it sitting idle as a hot standby.
Maintenance becomes non-disruptive. With four nodes, taking one offline for maintenance leaves three handling full load with headroom. Patching, upgrades, and hardware maintenance happen without scheduling windows that affect call handling.
The SLO That Requires This Architecture
A 99.999% uptime SLO is not a marketing claim that can be backed by a two-node architecture. The math does not support it. Two data centers with typical failover mechanisms — even fast ones — will experience brief outage windows during failover events that accumulate to more than five minutes per year over time.
Our infrastructure runs across four data centers — Calgary, Toronto, Las Vegas, and Culpeper — with a 99.999% uptime SLO. That SLO is achievable precisely because the redundancy architecture does not depend on failover events. Traffic redistributes automatically across the remaining nodes when any one node is degraded or offline. There is no failover window because there is no single point of failure to fail over from.
This architecture was a deliberate design decision, not an upgrade from a simpler foundation. Retrofitting four-node redundancy onto infrastructure designed for two nodes is a rebuild, not an enhancement.
What This Means for Operations That Depend on Voice
For collections operations running campaigns during specific morning and afternoon windows, every minute of downtime during those windows has a direct cost in contacts not made and recoveries not attempted.
For enterprise organizations where Teams Phone is the primary communications infrastructure for hundreds or thousands of employees, voice downtime is an organizational outage.
For franchise networks running AI voice agents to handle customer orders, downtime during peak hours is lost revenue and customer experience damage that does not recover.
In each of these environments, the question is not whether your voice infrastructure will ever have a problem. Every infrastructure has problems. The question is whether the architecture absorbs those problems without surfacing them as outages — or whether your operation stops when a data center has an issue.
The Bottom Line
Four data centers is not a luxury tier for voice infrastructure. It is the architecture required to deliver five-nines uptime in a real-world environment where partial degradation, maintenance, and regional events are routine.
Operations that treat voice reliability as a line item to optimize are making a different bet than operations that treat it as a foundation to protect. The difference shows up when something goes wrong.