登录查看更多内容

Demystifying Latency: a critical aspect of Data-Intensive Scalable Architectures

Nagendra Sharma

Leader Google Cloud Platform(GCP) | Applications, Data & Cloud Innovation | GenAI & Advanced Analytics Expertise | Driving Scalable Tech Solutions

发布日期: 2024年11月11日

It is not pragmatic to design and build systems with no-latency. Even for human brain need a pause to ' think.' Let's start with the three essential questions- What is the latency, how is it calculated, and how understanding latency will help design composable applications into two or more categories (Online/Real-time and Batch/Offline-Data) Applications?

Latency:?

Generally, latency is one of the opportunities that thrives enormous improvements in scalable architecture designs.?It is defined as the delay between a request/action and response (webpage/mobile app) to that action. It is often referred to as the time is taken (data packet travel duration)in the total round trip time in computer networking terms.
Online( Real-time) vs. batch ( offline-data) applications categorization This is my viewpoint that the level of acceptance criteria of latency (milliseconds) is the critical factor in categorizing Real-time applications & Batch applications.

How Latency is calculated/measured or what are the factors that contributed to high latency:- Multiple Layers ( Full-Stack) of Application Design

Lack of edge computing:?Cloud computing, DB & storage are not hosted in the same 'availability zone/AZ,' or compute/storage is hosted in dispersed in multi-data centers ( in case of on-premise).
Frontend development (Code)

Static content (Gzip)
Choosing to React or Angular Javascript framework/library according to use case of SPA( single page architecture) requirement for Mobile or web apps.
Backend development (Code) Code - function points, repetitive loops wrappers ( Microservice to access old code/program)
Network Bandwidth & low processing power: Slower network and low processing compute(?worker nodes) I/O ops Lack of intelligent load balancer(ALB/ELB)
Middleware & Security Rate limiting/DDoS Message transport?
Databases SQL and No-SQL

Sharding/ Horizontal scaling (No-SQL)
Indexing for RDBMS

The below figure explains the different layers of a simple web application design:-

How many steps ping a URL work in the background for us L1 Cache Reference L2 Cache Reference Branch Mispredict Main Memory Reference Mutex lock/unlock Compress bytes transport bytes over the network Read bytes sequentially from memory. roundtrip from the datacenter disk seeks ( read and write data) Read bytes sequentially from Network. Read sequentially from disk. Send packet /round trip.
What are other pragmatic factors & solutions that need to be considered: Generally, it takes a longer time and a slower response while sending data packets across data centers located far away from each other. Sharing data/files/content globally is expensive.?(CDN- Content delivery networks, Horizontal Scaling) ?Reading & writing data to Disks ( I/O ops): Disks seek slower than memory?( Try to avoid disk seeks by Caching as much as possible) Network bandwidth?( compression algorithm, Vertical Scaling) Compute?( GPU over CPU size for working nodes in case of online/ETL processing speed matters to business) ?Choosing wisely-Reads and write?data access patterns, 'writes' are costlier than 'reads.'?( using DBs their Database Engines having Fractal Tree design instead of B-Tree)

Here is the headroom equation that is very useful for capacity estimation, measuring latency, and suggesting headroom over a period of time. Moreover, this equitation encourages the project team to optimize the performance of the system in production continuously.

·?????? Headroom= [(Ideal Usage PercentageMaximum Capacity)- Current Usage Growth(t)- Optimization(Projects(t))]

·?????? *Headroom Time= HeadroomGrowth(t)- Optimization(Projects(t))]

To design the data-intensive application and platform, I want to explicitly explain the core principle to avoid the latency of reading data from the disk & writing data to the disk. After the hard drive, SSD(solid-state disk) is an amicable solution, but it has limitations ( high cost and not scalable). There are many processes and methods are available that are being used by No-SQL database engines, sharding (horizontal scaling) and caching( Key-value pair). Caching and Sharding are unique concepts that evolve data access patterns. To dive deep, 'consistent hashing' overcomes much more traditional horizontal scaling challenges; what if we need to add more nodes or a couple of nodes are inactive in the DB cluster.

领英推荐

Understanding Kafka System Design: Diving into Kafka…

Lavakumar Thatisetti 1 年前

The Superhero Guide to Turbocharged APIs: 5…

John Murillo-Giraldo 8 个月前

Navigating the Scalability Maze: Ensuring Robust…

Shivam Bawa 6 个月前

3-D Scalable Solution- Cube:

The below figure is trying to explain the scalability rules as a 3D:-

X-Axis:??Horizontal Scaling, Read Replicas, Services, and Data Replications. This is the cheapest solution.
Y-Axis:?Scaling is done by splitting services, Functions, and Methods. It is costlier than X-axis because compute/Storage is vertically scaling to achieve high performance and low response( CPU to GPU, SSD- Storage)
Z-axis:?It is defined for keeping customer location and requests as the center point. It is the costliest solution.

'Fractal Tree Indexing'?over Binary Tree Data structure a solution for eliminating I/O ops(Disk Seek):

Here comes the fantastic concept of?'Fractal Tree Indexing'?of Tokutek.?Over the traditional 'Binary-Tree' algorithm for reading and writing data at B-tree leaf nodes (at disk). I like this algorithm for two remarkable aspects:-

Write latency problem is solved.
Data consistency ( as soon as data is written, read data access pattern will be consistent)

The database engine of MySQL that is TokuDB is built using 'Fractal tree indexing.' MongoDB is using TokuMx as DB engine, which is also using 'Fractal Tree Indexing.'

Before diving deep, let's start with the?Binary Tree?data structure used to store large data blocks (RAM/Disk). Beyond RAM/Main memory, data need to be inserted into the disk as B-tree leaf nodes. The Fractal Tree is like a Binary tree in implementation. Additionally, each internal node of Fractal Tree Indexing has a 'buffer.'?

Write DataOps:?Temporarily, data related to upsert(update/insert) are stored in these buffers. Once the buffer of a node is filled, data is flushed to its child node buffer. Later buffer data is replicated/written to the leaf node(disk), and hence, there will not be latency while writing data. During a power outage or failure event, the buffers' data is serialized to disk, so messages in the internal node's buffer will not be lost.
Read DataOps:??The data consistency is also being maintained during the read operations. Like Binary tree, Fractal Tree indexing algorithm follows the same query path from the root node to the leaf node. Hence, each query for reading, writing, or updating will be aware of the current data state.

The above explains that the Fractal Tree indexing can reduce disk I/O operations, and 'disk seeks' is one of the latency's critical aspects.

Conclusion:??Global and scalable data-intensive designs sometimes add latency to get the value of consistency and availability. As per the CAP (Consistency, Availability, and Partition tolerance) theorem, at the most, two dimensions can be achieved, whereas we may either compromise or will trade-off for one aspect. By reducing disk seeks (I/O), having an edge and CDN solution, apply caching across layers, having a cache eviction policy ( cache hit), right sharding approach, one can build a low latency solution. It is essential to calculate headroom* for each component of the architected system so that the seasonality (peak/headroom time) trend can be projected for a website or service. The key factors that will be adding latency should be identified, and one should derive related solutions to minimize the latency.

References: The Art of Scalability?

要查看或添加评论，请登录

Nagendra Sharma的更多文章

Designing Generative AI Solutions for Agentic AI: Strategy, Roadmap, and Design Principles

2025年1月13日

Designing Generative AI Solutions for Agentic AI: Strategy, Roadmap, and Design Principles

Building a Generative AI solution for Agentic AI requires a robust framework that integrates scalability, adaptability,…
Leveraging Generative AI (GenAI) to Build Scalable Solutions on GCP(Google Cloud Platform)

2025年1月4日

Leveraging Generative AI (GenAI) to Build Scalable Solutions on GCP(Google Cloud Platform)

As organizations navigate an increasingly complex digital landscape, the need for scalable, efficient, and intelligent…

1 条评论
How future Innovative Modernization and True Transformation Strategies are driven by V R( Virtual Reality) !

2020年7月2日

How future Innovative Modernization and True Transformation Strategies are driven by V R( Virtual Reality) !
Democratization of Data-Science : “Positioning a ‘Right Language’ for Data Processing and Augmented Analytics” -Julia, Python or Scala

2020年5月11日

Democratization of Data-Science : “Positioning a ‘Right Language’ for Data Processing and Augmented Analytics” -Julia, Python or Scala

In addition to have the scalable, read heavy and No-SQL DB, It's mandatory to have right data processing language for…
Digital Program Transformation:Take the Right RISKS - Choose ‘Promotion’ Focused over ' Prevention’ Focused

2018年1月30日

Digital Program Transformation:Take the Right RISKS - Choose ‘Promotion’ Focused over ' Prevention’ Focused

Be the RIGHT risk takers, disrupt yourself before disrupting others. We are all equal before a wave!! In Program…

1 条评论
Importance & Need of Customer Journey Mapping (CJM) in enriching the Customer Experience (CX) for Digital Transformation (Opportunities)

2017年11月27日

Importance & Need of Customer Journey Mapping (CJM) in enriching the Customer Experience (CX) for Digital Transformation (Opportunities)

Based on the research & analytics, for Digital Transformation & revamping of organization- The customer experience map…
Accelerated Role of "Stream Processing" & it's Protocols & Technologies usage for Digital Transformation of Organizations

2017年10月12日

Accelerated Role of "Stream Processing" & it's Protocols & Technologies usage for Digital Transformation of Organizations

In the Digital Transformation, for the usage of AI & BI technologies tools for recommendations, the IT & DT industry…

4 条评论
Designing Software Systems with CLCS : Continuous Learning and Continuous Scalability

2017年7月13日

Designing Software Systems with CLCS : Continuous Learning and Continuous Scalability

A new approach of CLCS (Continuous Learning and Continuous Scalability)While designing medium to large software systems…

1 条评论
Important & critical factors for managing effectively "LLE-Lower Level Environments"(Non-Production Environments)

2015年11月8日

Important & critical factors for managing effectively "LLE-Lower Level Environments"(Non-Production Environments)

Why should we need to give importance to LLE over production environments? In the current transformation digital world…

1 条评论

See all articles

Demystifying Latency: a critical aspect of Data-Intensive Scalable Architectures

Nagendra Sharma

Leader Google Cloud Platform(GCP) | Applications, Data & Cloud Innovation | GenAI & Advanced Analytics Expertise | Driving Scalable Tech Solutions

领英推荐

Nagendra Sharma的更多文章

社区洞察

其他会员也浏览了

The Creation of a Powerful AI Driven Compute Platform

Integrating GraphQL with Serverless Architectures: Opportunities and Challenges

OpenSearch Index, Shards, Nodes and Clusters

Building Blocks of Tech Brilliance: A Deep Dive into System Design Essentials

Unlocking the Power of Observability with OpenTelemetry

Mastering Distributed Cache: A Blueprint for Scalability, Performance, and Availability

Guide to Kubernetes StatefulSet – When to Use It and Examples

Leveraging S3 for Distributed Concurrency Control in Data Processing

Understanding System Design Acronyms: CAP, PACELC, BASE, SOLID, and KISS

eShopOnWeb Architecture (8/16) – uses in memory caching to avoid sending unnecessary queries to the DB

领英推荐

Nagendra Sharma的更多文章

Designing Generative AI Solutions for Agentic AI: Strategy, Roadmap, and Design Principles

Leveraging Generative AI (GenAI) to Build Scalable Solutions on GCP(Google Cloud Platform)

How future Innovative Modernization and True Transformation Strategies are driven by V R( Virtual Reality) !

Democratization of Data-Science : “Positioning a ‘Right Language’ for Data Processing and Augmented Analytics” -Julia, Python or Scala

Digital Program Transformation:Take the Right RISKS - Choose ‘Promotion’ Focused over ' Prevention’ Focused

Importance & Need of Customer Journey Mapping (CJM) in enriching the Customer Experience (CX) for Digital Transformation (Opportunities)

Accelerated Role of "Stream Processing" & it's Protocols & Technologies usage for Digital Transformation of Organizations

Designing Software Systems with CLCS : Continuous Learning and Continuous Scalability

Important & critical factors for managing effectively "LLE-Lower Level Environments"(Non-Production Environments)

社区洞察

其他会员也浏览了

The Creation of a Powerful AI Driven Compute Platform

Integrating GraphQL with Serverless Architectures: Opportunities and Challenges

OpenSearch Index, Shards, Nodes and Clusters

Building Blocks of Tech Brilliance: A Deep Dive into System Design Essentials

Unlocking the Power of Observability with OpenTelemetry

Mastering Distributed Cache: A Blueprint for Scalability, Performance, and Availability

Guide to Kubernetes StatefulSet – When to Use It and Examples

Leveraging S3 for Distributed Concurrency Control in Data Processing

Understanding System Design Acronyms: CAP, PACELC, BASE, SOLID, and KISS

eShopOnWeb Architecture (8/16) – uses in memory caching to avoid sending unnecessary queries to the DB