Why this matters
Every design decision hinges on numbers. Do we need sharding? Only if writes exceed ~10k/sec. Do we need a cache? Only if read latency is a bottleneck. Do we need a CDN? Only if users are geographically far from origin. You can't answer any of these without doing back-of-envelope math first. Interviewers care about this more than any specific technology.
The goal isn't precision — it's order of magnitude. Knowing it's "10k QPS" vs "10M QPS" changes the architecture entirely. Knowing it's exactly 11,437 QPS vs 9,812 QPS changes nothing.