From documentdb
Provides best practices for Azure DocumentDB high availability, cross-region disaster recovery, and automatic backup retention. Use when designing production topology, planning failover, or setting up DR.
How this skill is triggered — by the user, by Claude, or both
Slash command
/documentdb:high-availabilityThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Azure DocumentDB's resiliency model has three layers. Pick the right combination for the workload — production-critical workloads should use all three.
Azure DocumentDB's resiliency model has three layers. Pick the right combination for the workload — production-critical workloads should use all three.
| Layer | What it protects against | SLA contribution | Automatic? |
|---|---|---|---|
| In-region HA (standby shard per primary, synchronous replication) | Node / zone failures within a region | 99.99% | ✅ Failover is automatic; connection string is unchanged |
| Cross-region replica (active-passive, asynchronous) | Regional outage; provides read scale-out | + 0.005% → 99.995% combined | ❌ Promotion is customer-triggered (shared-responsibility DR); HA must be re-enabled on the promoted cluster |
| Automatic backups (35 d active / 7 d deleted clusters) | Accidental deletion or corruption | — | ✅ Continuous, no perf impact |
Applications connect to a cluster through a single connection string and endpoint regardless of shard count. The multi-shard topology is fully abstracted — a 16-shard cluster looks like one MongoDB endpoint to the driver.
How HA and cross-region replicas protect different failure modes:
| Failure scenario | Feature | No data loss (RPO = 0) | Survives region-wide outage | Automatic failover | Connection string preserved |
|---|---|---|---|---|---|
| Physical shard / zone failure | In-region HA | ✅ (synchronous) | ❌ | ✅ | ✅ |
| Regional outage | Cross-region replica | ❌ (asynchronous; RPO > 0) | ✅ | ❌ (customer-triggered) | ✅ ¹ |
¹ Only when the application uses the Global read-write connection string (<cluster>.global.mongocluster.cosmos.azure.com). The cluster-specific / "self" connection string becomes read-only after promotion.
| Scenario | Recommendation |
|---|---|
| Production cluster | Enable HA |
| Need 99.99% SLA | Enable HA |
| Need 99.995% SLA | Enable HA and create a cross-region replica |
| Automatic failover from node/zone failure | Enable HA |
| Cross-region disaster recovery | Create a replica cluster |
| Read scale-out within a single region (analytics / reporting offload) | Create a same-region replica (no DR benefit; you can have only one replica per primary, so this trades cross-region DR for in-region read offload) |
| Read scale-out across regions | Create a replica cluster |
| Availability-zone placement required | Enable HA (HA is required for AZ support) |
| Non-production / dev-test cluster | Disable HA to reduce cost |
| Recover from accidental delete/modify | Automatic backups (35-day retention for active clusters) |
npx claudepluginhub azure/documentdb-agent-kit --plugin documentdbProvides CDSS development patterns for drug interaction checking, dose validation, clinical scoring (NEWS2, qSOFA), and alert classification integrated into EMR workflows.