Database Sharding: Hotspots & Trade-offs for PMs/TPMs

1. The Final Bottleneck: When the "Write Engine" Runs Dry

We’ve used Replication to handle millions of Reads and PITR to ensure an RPO of zero. However, there is a brutal physical limit that Single-leader architecture cannot overcome: Every single Write command must pass through a single server.

When a product scales up and hits a threshold of, say, 10,000 Write QPS (Queries Per Second)—think Grab during peak hours or an e-wallet during a Flash Sale— the Primary DB’s CPU and Disk IOPS will hit a 100% ceiling. At this stage, throwing money at a bigger server (Vertical Scaling) is no longer viable because even the world's best hardware has its limits.

The ultimate solution for Big Tech is Database Sharding: breaking a massive data block into smaller pieces (Shards) and distributing them across multiple independent server clusters.

Database Architecture 4: Sharding: Breaking Physical Limits & Operational Pain

1. The Final Bottleneck: When the "Write Engine" Runs Dry

"Act as an Expert": Why Prompt Personas Destroy AI Accuracy

2. Life or Death in the Shard Key: The Hotspot Disaster

Anti-Pattern: "Naive" Intuition (Time-based Range)

Best Practice: Even Distribution (User ID Hash-based)

3. The Dark Side of Sharding: Trading Speed for Operational Pain

4. Auditing Sharded Clusters with the PEUF Framework

5. Series Summary: Data Architecture Mindset

Database Architecture 3: Backup, PITR & Recovery Metrics (RPO, RTO)