- 2025
-
Fault Tolerance and High Availability -- Building Stable Distributed Systems
1. Airplane “Failures” and Passenger Safety Imagine a flight where the airplane might encounter engine failure or turbulence. To ensure safety, multiple backup systems and emergency plans are desi...
read more→
-
Practical Raft -- A Deep Dive into Distributed Replicated Log Systems
1. Team “Leader” Election and Task Synchronization Imagine a project team that needs to select a leader through voting. The leader then assigns tasks to ensure everyone executes according to plan....
read more→
-
Demystifying Distributed Consistency CAP Theorem and Raft Algorithm Explained
1. “Agreement” in Team Collaboration Imagine a group of friends planning a trip but living in different cities. Messages have delays and some may lose connection, so opinions may not be unified. T...
read more→
-
MapReduce in Practice -- An Introduction to Distributed Big Data Processing
1. Analogy: Efficient Collaboration in a Distributed Kitchen Imagine a large kitchen tasked with preparing thousands of dishes. If one chef does all the work, efficiency suffers. MapReduce works l...
read more→
-
Distributed Communication Essentials -- RPC and an Introduction to Go Concurrency
1. Opening the Magic Box of Distributed Communication: What is RPC? In distributed systems, different machines need to “talk” to each other to collaborate. RPC (Remote Procedure Call) is a magical...
read more→