Read MapReduce: Simplified Data Processing on Large Clusters. Skip sections 4 and 7.

Questions

Appendix A of the MapReduce paper contains a program with two classes WordCounter and Adder, as well as the main() function.

  • Now look at Figure 1 (execution overview) of the paper. Describe where and how each class/function will be invoked.
  • If a machine running WordCounter::Map() crashes, what parts of the system would fix the error (if any)?

Turn in your answers as part of Exercise 8.

If you want to learn more about distributed systems, take CSE 452.

If you want to learn more about databases, take CSE 344.

If you want to learn more about map and reduce, take CSE 341.

Further readings