登录查看更多内容

System Design Basics: Caching

Omar Ismail

Senior Software Engineer @ Digitinary | Java & Spring Expert ? | AWS & Microservices Architect ? | FinTech & Open Banking Innovator ?? | Digital Payments Expert ?? | Top 200 IT Content Creator in Jordan ?? | 40K+ ??

发布日期: 2022年1月31日

+ 关注

Thanks to original creator :

https://medium.com/geekculture/system-design-basics-caching-46b1614915f8

What is Cache?

Cache is a small memory, fast access local store where we store frequently accessed data. Caching is the technique of storing copies of frequently used application data in a layer of smaller, faster memory in order to improve data retrieval times, throughput, and compute costs.

Why do we use Cache?

Cache is based on the principle of locality. It means that frequently accessed data is kept close to the system. The two kinds of locality are:

Temporal locality, where data that has been referenced recently is likely to be referenced again (i.e.?time-based?locality).
It repeatedly refers to same data in short time span.
Spatial locality, where data that is stored near recently referenced data is also likely to be referenced again(i.e.?space-based?locality)
It only refers to data item which are closed together in memory. eg. Data stored together in an array

How does a Cache work?

When a request comes to a system, there can be two scenarios. If a copy of the data exists in cache it’s called a?cache hit, and when the data has to be fetched from the primary data store it’s called a?cache miss. The performance of a cache is measured by the number of cache hits out of the total number of requests.

Types of Cache

Application server cache: Placing a cache directly on an application server enables the local storage of response data. Each time a request is made to the service, the server will return local cached data if it exists. If it is not in the cache, the requesting node will query the data from primary store. The cache could be located both in memory (which is very fast) and on the node’s local disk (faster than going to network storage).
The problem with this type of caching comes in when you have a distributed system. It leads to a lot of cache misses because each instance has its own cache which does not contain information on other instances.
Distributed cache: Each node will have a part of the whole cache space, and then using the consistent hashing function each request can be routed to where the cache request could be found.
The cache is divided up using consistent hashing and each request can be routed to the node which contains the response for it.

Global Cache: A global cache is a single cache memory in front of the server instances. It is common for all instances and fetches data from the primary store in case it’s not present in the cache itself. It has added latency but has the advantage of considerably better cache performance.

CDN: CDN(Content Distribution Network) is used where a large amount of static content is served. The response can be HTML file, CSS file, JavaScript file, pictures, videos, etc. First, request ask the CDN for data, if it exists then the data will be returned. If not, the CDN will query the backend servers and then cache it locally.

Cache Writing Policies

A?Cache Policy?is a set of rules which define how the data will be loaded (and evicted) from a cache memory. A cache is made of copies of data, and is thus transient storage, so when writing we need to decide when to write to the cache and when to write to the primary data store.

The most common cache writing policies are as follows:

领英推荐

Exploring Key Distributed System Algorithms and…

Vertisystem 1 年前

Top 10 API Headers and Their Significance in API…

Sidharth Shukla 1 年前

The ScyllaDB Sync: July 2024

ScyllaDB 8 个月前

Write-through caching: Under this scheme, data is written into the cache and the corresponding database at the same time. The cache ensures fast retrieval, and since data is written simultaneously in the cache and primary data store, there is complete consistency.
However, since every write operation has to be done twice it accounts for more latency.

Write-around caching: In write around caching, data is written directly to primary data store. The cache in turn checks with primary data store to keep itself in sync. There is a downside here that cache might be behind the primary storage at times, but the added latency is countered by the primary data store always being consistent.

Write-back caching: In write back caching data is written first into the cache, and then into the primary data store. There can be two scenarios here, either the primary storage is updated directly after the cache, or that the cache memory is persisted after an entry removal (in case the cache memory was already full). In this scenario the entry is tagged with a?dirty bit?to mark that the data is out of sync.
Write back caching is prone to data loss and should only be used in write-heavy operations where write speed is very important.

Cache Eviction Policies

Cache eviction policies define a set of rules that decide what data must be removed when the cache is full and a new entry is to be added.

A good replacement policy will ensure that the cached data is as relevant as possible to the application, that is, it utilises the principle of locality to optimise for cache hits.

Therefore a cache eviction policy can only be defined on the basis of what data is stored in the cache. Some of the popular cache eviction algorithms are:

First In First Out (FIFO): This cache evicts the first entry in the cache regardless of how many times it was called
Least Recently Used (LRU): Evicts the least recently used items first.
Most Recently Used (MRU): Evicts the most recently used items first.
Least Frequently Used (LFU): Counts how often an entry was read from cache. Those that are used least often are discarded first.
Random Replacement (RR): Randomly selects a candidate item and discards it to make space when necessary.

Redis: Distributed Caching

Redis is an in-memory data store that is most often used as a distributed cache. It offers a variety of efficient data structures designed to allow very fast access to your data. It is used as a caching mechanism in most of the distributed systems.

Redis has the option to also persist to a disk so the cache isn’t lost if the server restarts. And Redis can also be built in a cluster which spreads the cache across multiple servers. Technically Redis can even be used as your primary database as well, although more often we use it as a cache to reduce the load on more feature-rich databases that are meant for persisting data.

Redis is written in C, so it does not have multithreading capabilities like it’s java based counterparts like?Hazelcast. Redis is unique in that it supports other data structures like sorted sets, hash sets and a pub/sub mechanism. It’s also extensible via lua scripting. It is probably the most popular and widely used of the two products. Especially outside of the Java ecosystem.

Harsh Soni

3 年

?????? ??

Ahmed A.

Senior .Net Developer at FlairsTech

3 年

Interesting... Hazelcast is many times faster.?Redis is single-threaded, so it does not efficiently scale for larger loads

Ahmed Saladin

Software Engineer (Node.js, Angular, React)

3 年

What is amazing about your articles is their simplicity and information, Thanks for your effort.

Bestow Tech

Your Trusted Online Shopping Mall at Bestow Technolog Solutions

3 年

Hi, I am a professional graphic designer. Are You Looking Eye-Catching Creative And Premium Quality Design For Stand Out Your Product/Services from a huge Crowd and reach the target audience?? Contact me:? ?? Email: [email protected] Download now:: 1.envato.market/vn0dE3

查看更多评论

要查看或添加评论，请登录

Omar Ismail的更多文章

?? Mastering Spring Boot Performance: The Ultimate Optimization Guide

2025年3月10日

?? Mastering Spring Boot Performance: The Ultimate Optimization Guide

Spring Boot is a powerful framework that simplifies application development, but without proper optimizations…
Mastering Spring Data JPA: Essential Best Practices for Performance, Scalability, and Maintainability

2025年3月10日

Mastering Spring Data JPA: Essential Best Practices for Performance, Scalability, and Maintainability

Spring Data JPA simplifies database interactions, reducing boilerplate code and enabling seamless ORM…

1 条评论
Unlocking Java’s Hidden Secrets: Surprising Facts Even Experts Might Not Know

2025年3月10日

Unlocking Java’s Hidden Secrets: Surprising Facts Even Experts Might Not Know

Java has stood the test of time, evolving into one of the most robust and versatile programming languages. While many…
Monolithic vs. Microservice Architecture in Spring Boot

2025年2月28日

Monolithic vs. Microservice Architecture in Spring Boot

Introduction Software architecture plays a critical role in the scalability, maintainability, and performance of an…
The Ultimate Guide to Code Review Best Practices

2025年2月27日

The Ultimate Guide to Code Review Best Practices

Introduction Code reviews are a fundamental part of modern software development. A well-executed code review process…
Mastering Advanced Spring Boot 3 in 2025: Best Practices & Future Trends

2025年2月27日

Mastering Advanced Spring Boot 3 in 2025: Best Practices & Future Trends

Introduction Spring Boot has been the go-to framework for Java developers aiming to build scalable, production-ready…
Best Practices for Java Collections and Generics: Optimizing Code for Performance, Memory, and Type Safety

2025年2月25日

Best Practices for Java Collections and Generics: Optimizing Code for Performance, Memory, and Type Safety

Java Collections Framework and Generics are foundational to writing efficient, type-safe, and maintainable…

1 条评论
?? Database Versioning with Liquibase in Spring Boot: Best Practices & Naming Conventions ??

2025年2月20日

?? Database Versioning with Liquibase in Spring Boot: Best Practices & Naming Conventions ??

Managing database changes efficiently is crucial for maintaining stability and consistency in any application…

2 条评论
Understanding PUT vs. PATCH in REST APIs with Spring Boot

2025年2月19日

Understanding PUT vs. PATCH in REST APIs with Spring Boot

Introduction In RESTful APIs, both PUT and PATCH are used to update resources, but they serve different purposes…
Real-Time Data Streaming in Spring Boot: A Deep Dive into Server-Sent Events (SSE) and Long-Polling

2025年2月19日

Real-Time Data Streaming in Spring Boot: A Deep Dive into Server-Sent Events (SSE) and Long-Polling

Introduction Real-time data streaming is essential for interactive applications that require live updates, such as…

2 条评论

See all articles

System Design Basics: Caching

Omar Ismail

Senior Software Engineer @ Digitinary | Java & Spring Expert ? | AWS & Microservices Architect ? | FinTech & Open Banking Innovator ?? | Digital Payments Expert ?? | Top 200 IT Content Creator in Jordan ?? | 40K+ ??

领英推荐

Omar Ismail的更多文章

社区洞察

其他会员也浏览了

RAID 5 & RAID 6

Deep Dive into Caching in System Design part 14

Distributed File Systems, Simplified!

Robust DolphinDB – How does DolphinDB Achieve Scalability, Reliability, Resilience, Consistency, and Monitorability

Distributed Snapshots

Building Blocks of Tech Brilliance: A Deep Dive into System Design Essentials

Mastering Distributed Cache: A Blueprint for Scalability, Performance, and Availability

Demystifying Latency: a critical aspect of Data-Intensive Scalable Architectures

Rocks DB: One of tool to achieve lowest latency

eShopOnWeb Architecture (8/16) – uses in memory caching to avoid sending unnecessary queries to the DB

领英推荐

Omar Ismail的更多文章

?? Mastering Spring Boot Performance: The Ultimate Optimization Guide

Mastering Spring Data JPA: Essential Best Practices for Performance, Scalability, and Maintainability

Unlocking Java’s Hidden Secrets: Surprising Facts Even Experts Might Not Know

Monolithic vs. Microservice Architecture in Spring Boot

The Ultimate Guide to Code Review Best Practices

Mastering Advanced Spring Boot 3 in 2025: Best Practices & Future Trends

Best Practices for Java Collections and Generics: Optimizing Code for Performance, Memory, and Type Safety

?? Database Versioning with Liquibase in Spring Boot: Best Practices & Naming Conventions ??

Understanding PUT vs. PATCH in REST APIs with Spring Boot

Real-Time Data Streaming in Spring Boot: A Deep Dive into Server-Sent Events (SSE) and Long-Polling

社区洞察

其他会员也浏览了

RAID 5 & RAID 6

Deep Dive into Caching in System Design part 14

Distributed File Systems, Simplified!

Robust DolphinDB – How does DolphinDB Achieve Scalability, Reliability, Resilience, Consistency, and Monitorability

Distributed Snapshots

Building Blocks of Tech Brilliance: A Deep Dive into System Design Essentials

Mastering Distributed Cache: A Blueprint for Scalability, Performance, and Availability

Demystifying Latency: a critical aspect of Data-Intensive Scalable Architectures

Rocks DB: One of tool to achieve lowest latency

eShopOnWeb Architecture (8/16) – uses in memory caching to avoid sending unnecessary queries to the DB