Big Data - Part II: Amjed Almousa, Ph.D. Cloud Computing & Big Data
Big Data - Part II: Amjed Almousa, Ph.D. Cloud Computing & Big Data
Need for parallel processing Amount of Data. (specially with data replication) Cost of licensing for large number of machines
Client
Master Server
File a
Chunk a.1 => server 1,2 Chunk a.2 => server 1,3
Chunk Server 1
Chunk Server 2
Chunk Server 3
a.1
a.2
a.1
b.1
b.1
a.2
Large Number of Files 300 Chunks Per File => Overload the Master
If a Node (server) needs to edit or append to a chunk, then it would need to Lease that chunk from the master server. Once the node is done editing, the change data is propagated to other copies of the chunk existing on other nodes. The lease is returned to the master node.
Lease Mechanism
Cloud Computing & Big Data. Amjed Almousa, Ph.D.
1. 2. 3. 4.
References
Cloud Computing & Big Data. Amjed Almousa, Ph.D.