Assignment 4

Deadline: Monday April 6, 2020 11:59pm

Introduction

Please answer the questions precisely and concisely. Every question can be answered in one or at most a few sentences. I will not have the patience to read long paragraphs or essays and you may lose credit for possibly correct answers.

Submission

Please submit your assignment prior to the due date & time via canvas.

Note: submissions must be plain text or PDF files or text entry within Canvas. Other formats, such as Microsoft Word, Apple Pages, or Adobe InDesign will not be accepted.

Reading

(1) Sanjay Ghemawat Howard Gobioff Shun-Tak Leung, The Google File System, Proceedings of the 19th ACM Symposium on Operating Systems Principles, ACM, Bolton Landing, NY (2003), pp. 20–43: You only need to read sections 1–5 if you don’t want to read the entire paper.
(2) Giuseppe DeCandia, Deniz Hastorun, et alia, Dynamo: Amazon’s Highly Available Key-value Store, 21st ACM Symposium on Operating Systems Principles, Stevenson, WA (2007).: Here also, you only need to read sections 1–5 if you don’t want to read the entire paper. You can also skip sections 3.1 and 3.2 (related work).

Questions

Question 1.

What is presented as a disadvantage of using a large chunk size in GFS and how was it fixed?

Question 2.

What are three primary reasons stated for separating data flow from the control flow in GFS?

Question 3.

What are the three advantages of virtual nodes in Dynamo’s design?

Question 4.

How does Dynamo control consistency with R and W values?