登录查看更多内容

Partitioning & Replication

Pushkal Goyal

.

发布日期: 2024年1月2日

+ 关注

Partitioning: Process of dividing data into independent segments.

Need of Partitioning:

If the data is served without partitioning, every time a request comes to the machine, the whole data needs to be traversed, it will also be resource- and time-consuming.
In partitioning, there is less load on partitioned systems, unlike non-partitioned systems in case of failure because the load will be independent in case of partitioned systems, unlike the latter.
Partitioning is usually done in the form of indexes that are based on most used patterns.
Provides increased fault tolerance and reliability.
Used by NoSQL databases like CassandraDB, DynamoDB, etc.

The reason why these databases never offer immediate acknowledgment to write operations is that for a single write operation, multiple requests are generated to write in the main table and all different criteria-based partitions.

Replication: creating copies of data on multiple machines.

2 protocols:

1. Primary Replication Protocols:

All write requests are made by primary replicas and read requests are handled by backup replicas. The primary replica ensures that the request is also acknowledged by backup replicas to make sure no inconsistencies.
This will be useful in the case of banking where each operation is critical and hence we can't afford a single mistake.

2. Consensus Replication Protocols:

When any requests come more than half of the nodes need to acknowledge it successfully before getting it successful. Hence the name Consensus Protocol. eg: Raft and Paxos.

要查看或添加评论，请登录

Pushkal Goyal的更多文章

Merkle Tree :

2023年12月30日

Merkle Tree :

A concept popular in Distributed Systems. Merkle Tree is a binary tree used for easy search and secure verification of…
Interpreter and Compiler

2023年12月26日

Interpreter and Compiler

Python => interpreter & other languages like CPP => compiler-based languages. I didn't know the way I could see the…
Consistent Hashing

2023年12月25日

Consistent Hashing

It is one of the system design techniques used to optimize performance issues with horizontal hashing while scaling…
Circular Imports in Python

2023年12月24日

Circular Imports in Python

**packages : directory with __init__.py **modules : files with .
POSIX

2023年12月23日

POSIX

It stands for Portable Operating System Interface. This is a compliance introduced by the US government for procurement…

See all articles

Partitioning & Replication

Pushkal Goyal

.

Pushkal Goyal的更多文章

社区洞察

其他会员也浏览了

How we designed an effective data lake solution for a major healthcare provider—Creating a Single Source Solution for Efficient Medical Data Input

Data Sharding in Distributed Architectures: A Performance and Consistency Perspective

Availability vs Consistency in System Design part -4

Kafka Replication Protocol

Space-Based Architecture: Resolving Data Consistency, Performance, and Scalability Challenges in Distributed Systems

Application Design: Key Principles For Data-Intensive App Systems

Fixed Partitions (Design Pattern of Distributed Systems)

Key-Range Partitions (Design Pattern of Distributed Systems)

Low-Water Mark (LWM) (Design Pattern of Distributed Systems)

Pushkal Goyal的更多文章

Merkle Tree :

Interpreter and Compiler

Consistent Hashing

Circular Imports in Python

POSIX

社区洞察

其他会员也浏览了

How we designed an effective data lake solution for a major healthcare provider—Creating a Single Source Solution for Efficient Medical Data Input

Data Sharding in Distributed Architectures: A Performance and Consistency Perspective

Availability vs Consistency in System Design part -4

Kafka Replication Protocol

Space-Based Architecture: Resolving Data Consistency, Performance, and Scalability Challenges in Distributed Systems

Application Design: Key Principles For Data-Intensive App Systems

Fixed Partitions (Design Pattern of Distributed Systems)

Key-Range Partitions (Design Pattern of Distributed Systems)

Low-Water Mark (LWM) (Design Pattern of Distributed Systems)