Start by identifying the system's nature. Is it Read-heavy (like a News Feed: 100 reads/1 write) or Write-heavy (like a Chat app)? Establish the geographic distribution of users and the latency SLA (e.g., "We need an end-to-end latency of under 200ms"). This dictates your entire caching and routing strategy.