登录查看更多内容

Event Driven point of view for Distributed Systems

Pradip Dharam

Python Developer, Data Engineer, pySpark, ML (NLP) Product's Features Development & Refactoring, PR Reviews, git, MongoDB, Familiar with Airflow, Docker. Prototyping, Pandas, Numpy, SQL, sklearn. Novice to CICD.

发布日期: 2023年10月6日

+ 关注

Lets take an example of state of the art Hadoop Distributed File System.

1. There is name node and data node.

2. When you save a file on HDFS, that file gets splitted into blocks, each block 128MB according to Hadoop 2.0

3. Each block gets replicated 3 times, considering block default replication factor of 3.

4. Consider my file has total 5 blocks. Those 5 blocks becomes 15, each block replicated 3 times.

5. Those 15 blocks gets distributed to data nodes. Name node keeps the metadata of that file, on which node the blocks and replicas are stored. Blocks replicas created for fault tolerance.

Now, how does name node knows if data blocks are alive?

Every data node sends the heart beats every 3 seconds name node, to let him know that "I am alive". Ha ha ..Sounds interesting, right?

If name node does not get 10 consecutive heart beats, then it assumes that the respective data node is dead.

Replication factor 3 for some of the files stored on dead data node goes down to 2. Name node then executes the fault tolerance mechanism & scales it to 3 again by creating the replica of respective blocks on alive data node. Name node then updates metadata.

To conclude, this is how any distributed systems can be designed using Event Driven Architecture. Events generated in case of HDFS are heart beats to let know that "I'm alive"

Follow me or share connection request if you would like to learn more from me.

要查看或添加评论，请登录

Pradip Dharam的更多文章

Construct Binary Tree from Preorder and Inorder Traversal

2024年12月28日

Construct Binary Tree from Preorder and Inorder Traversal

Preorder: [3, 9, 20, 15, 7], Inorder: [9, 3, 15, 20, 7] First in the preorder traversal is 3 which is root Locate the…
Permutations of distinct numbers - Not a brute force, but recursion

2024年9月29日

Permutations of distinct numbers - Not a brute force, but recursion

Permutations of given distinct elements
Next Greater Element using Stack

2024年8月25日

Next Greater Element using Stack

Find next greater element for each element in given array. Check problem statement at https://www.
Stack vs recursion. Recursion is better. Push element to bottom of stack. Reverse the stack.

2024年8月24日

Stack vs recursion. Recursion is better. Push element to bottom of stack. Reverse the stack.

Anything which is possible to solve using stack; it also can be solved using recursion. Since, system itself builds…
Dictionary in Python is hash map & complexity is amortized to O(1)

2024年8月7日

Dictionary in Python is hash map & complexity is amortized to O(1)

Dictionary in Python is hash map & complexity is amortized to O(1)
Unique pairs count with positive difference. Two Pointers. Solved!

2024年8月4日

Unique pairs count with positive difference. Two Pointers. Solved!

Given an array of integers, and a non-negative integer , count all distinct pairs with difference equal to , i.e.
Pair with a Difference existence. Two Pointers. Solved!

2024年8月4日

Pair with a Difference existence. Two Pointers. Solved!

Given an array of integers, and an integer , Determine whether or not there exist two elements in whose difference is…
Count pairs with given Sum in an array

2024年8月3日

Count pairs with given Sum in an array

Given an array of integers, and an integer , find the number of pairs of elements in the array whose sum is equal to .
Check whether Pair with a Sum exists. Solved!

2024年8月3日

Check whether Pair with a Sum exists. Solved!

Given an array of integers, and an integer , Determine whether or not there exist two elements in whose sum is exactly .
Merge Sort Algorithm - Recursion

2024年8月3日

Merge Sort Algorithm - Recursion

Print the sorted array using merge sort algorithm

See all articles

Event Driven point of view for Distributed Systems

Pradip Dharam

Python Developer, Data Engineer, pySpark, ML (NLP) Product's Features Development & Refactoring, PR Reviews, git, MongoDB, Familiar with Airflow, Docker. Prototyping, Pandas, Numpy, SQL, sklearn. Novice to CICD.

Pradip Dharam的更多文章

社区洞察

其他会员也浏览了

Big Data – Cluster Environment: Powered by Raspberry Pi-4, Hadoop, and Spark

Is Hadoop Sinking with the Emergence of AI & Machine Learning?

Day 1 - 15Day Databricks: Spark Architecture & Internal Working Mechanism

A Journey into the World of Big Data: Technologies and Experiences

BIG DATA PROCESSING

Demystifying Hadoop and Spark: Harnessing the Power of Big Data

Unlocking Big Data: Demystifying Hadoop as a Distributed Database

Hadoop: Powering Big Data Insights

Optimize your EMR cluster

Pradip Dharam的更多文章

Construct Binary Tree from Preorder and Inorder Traversal

Permutations of distinct numbers - Not a brute force, but recursion

Next Greater Element using Stack

Stack vs recursion. Recursion is better. Push element to bottom of stack. Reverse the stack.

Dictionary in Python is hash map & complexity is amortized to O(1)

Unique pairs count with positive difference. Two Pointers. Solved!

Pair with a Difference existence. Two Pointers. Solved!

Count pairs with given Sum in an array

Check whether Pair with a Sum exists. Solved!

Merge Sort Algorithm - Recursion

社区洞察

其他会员也浏览了

Big Data – Cluster Environment: Powered by Raspberry Pi-4, Hadoop, and Spark

Is Hadoop Sinking with the Emergence of AI & Machine Learning?

Day 1 - 15Day Databricks: Spark Architecture & Internal Working Mechanism

A Journey into the World of Big Data: Technologies and Experiences

BIG DATA PROCESSING

Demystifying Hadoop and Spark: Harnessing the Power of Big Data

Unlocking Big Data: Demystifying Hadoop as a Distributed Database

Hadoop: Powering Big Data Insights

Optimize your EMR cluster