Scaling, Partitioning & Performance Quiz

Q1. EASY: In a streaming system, which metric best indicates that consumers are not keeping up with producers?




Q2. EASY: What is one effect of using large batch sizes and compressing messages in a messaging system?




Q3. EASY: If one partition in a distributed system becomes a hot partition (receiving much more traffic than others), how can you mitigate this hotspot?




Q4. EASY: In a messaging pipeline, what is the main purpose of implementing back-pressure?




Q5. EASY: What is a typical trade-off when replicating data across multiple availability zones or regions?




Q6. MEDIUM: Which partitioning strategy minimizes data movement when new partitions or nodes are added?




Q7. MEDIUM: In a system with consumer groups, what is the purpose of tracking an offset for each group?




Q8. MEDIUM: In a replicated messaging cluster, how is a new leader chosen for a partition after its leader fails?




Q9. MEDIUM: Why do many high-throughput systems use an append-only log on disk instead of random writes or only in-memory storage?




Q10. MEDIUM: Compared to standard queues, what is a known limitation of Amazon SQS FIFO queues?




Q11. MEDIUM: In Apache Kafka, what happens during a consumer group rebalance event?




Q12. HARD: In the context of high-throughput messaging, what does 'zero-copy' data transfer mean?




Q13. HARD: In a replicated log, when would a follower be removed from the In-Sync Replicas (ISR) set?




Q14. HARD: Which design choice helps a messaging system avoid back-pressure issues?




Q15. HARD: Requiring acknowledgments from all in-sync replicas (acks=all) instead of only the leader (acks=1) will generally:




system-design