登录查看更多内容

Full Power of Vector Unleashed | In-Depth Look at Adaptive Request Concurrency Mechanism

Greptime

Industry-leading time series database for IoT and Observability with real-time insights and lower costs at any scale.

发布日期: 2025年2月12日

Introduction of Vector

Vector, a high-performance end-to-end observability data pipeline written in Rust, frequently encounters scenarios where upstream and downstream processing rates don't match. Often, Vector's actual throughput exceeds the processing capacity of downstream databases like Elasticsearch or ClickHouse.

To prevent overloading downstream services, Vector typically employs a rate-limiting mechanism. For example, in the HTTP Sink, users can configure request.rate_limit_duration_secs and request.rate_limit_num to control the rate. Below is an example where Vector limits the downstream rate to 10 ops(operations per second):

sinks:
  my-sink:
    request:
      rate_limit_duration_secs: 1
      rate_limit_num: 10

However, static rate limiting isn't a perfect solution, as it struggles to adapt to dynamic conditions.

Why Adaptive Request Concurrency is Needed

There are two major challenges with static rate limiting:

Under-utilization of Resources: If the rate limit is set too low, the downstream database, which is capable of handling more traffic, may experience under-utilization of its resources.
Overloading Downstream Databases: Conversely, if the rate limit is set too high, excessive traffic may overwhelm the downstream database, causing cascading system failures.

The optimal rate is always constrained by factors like the number of Vector instances, the capacity of downstream services, and the volume of data being sent. Static rate limiting can't dynamically adjust to these ever-changing factors.

Limitations of Static Rate Limiting

The following diagram illustrates the limitations of static rate limiting:

Adaptive Request Concurrency: Overcoming the Static Rate Limiting Bottleneck

To address the limitations of static rate limiting, the Vector team designed an Adaptive Request Concurrency mechanism (ARC), inspired by the experiences shared in Netflix's blog post, Performance Under Load. ARC dynamically adjusts the data flow rate based on current system statistics, maximizing system throughput and preventing the resource wastage and overload issues associated with static rate limiting.

领英推荐

The January 2024 MinIO Newsletter

MinIO 1 年前

FiftyOne Computer Vision Community Update – October…

Voxel51 1 年前

??GovCon Insights by G2Xchange | 11-28-23

G2X - The GovCon Growth Platform 1 年前

How ARC Works

When ARC is enabled, Vector monitors two key metrics to dynamically adjust the flow control strategy:

RTT (Round Trip Time): Reflects the latency of the requests, indicating the response speed.
HTTP Response Codes: Indicates the success or failure of requests, with error codes (e.g., 429 and 503) providing insights into the health of downstream services.

Using these metrics, Vector employs the AIMD (Additive Increase Multiplicative Decrease) algorithm to adjust the traffic:

Additive Increase: If RTT is stable or decreasing and HTTP response codes indicate success (e.g., 200 OK), it suggests that the downstream service can handle more traffic, prompting Vector to linearly increase the throughput. For example, it may increase operations per second from 10 to 11, then 12, and so on.
Multiplicative Decrease: If RTT increases or HTTP response codes return errors (e.g., 429 Too Many Requests or 503 Service Unavailable), it indicates that the downstream service is overloaded. Vector then exponentially reduces the traffic, for example, reducing the operations per second from 20 to 10, and then to 5. The rate of decrease can be configured using request.adaptive_concurrency.decrease_ratio.

The following diagram illustrates how Vector dynamically adjusts traffic in ARC mode based on system load:

(Figure 2: Vector's Decision Logic in ARC Mode)

AIMD Algorithm and TCP Congestion Control

The AIMD algorithm is derived from TCP congestion control and is widely used in network protocols to effectively balance bandwidth utilization and network stability. In Vector, AIMD helps maintain high throughput while preventing system crashes caused by overload, ensuring system stability.

ARC is implemented as a Tower Layer in Vector. For those interested, the implementation details can be explored in this file.

Configuring ARC

ARC is currently supported by all HTTP data-receiving sinks. Taking ClickHouse Sink as an example, users can enable Adaptive Request Concurrency with the following configuration:

sinks:
  clickhouse_internal:
    type: clickhouse
    inputs:
      - log_stream_1
      - log_stream_2
    host: https://clickhouse-prod:8123
    table: prod-log-data
    request:
      concurrency: adaptive

With this configuration, Vector will use Adaptive Request Concurrency to write data to ClickHouse. After enabling ARC, Vector will automatically adjust the data transmission rate based on real-time system conditions, ensuring that the system remains stable and efficient, even under varying loads.

Adaptive Request Concurrency (ARC) is an effective solution

Adaptive Request Concurrency (ARC) is an effective solution for handling mismatched processing rates between upstream and downstream services in modern complex systems. By monitoring RTT and HTTP response codes in real-time, Vector can dynamically adjust the request rate, maximizing resource utilization and ensuring the health of downstream databases. Compared to static rate limiting, ARC provides a more flexible, intelligent approach that improves the overall efficiency and reliability of the system while maintaining stability.

Full Power of Vector Unleashed | In-Depth Look at Adaptive Request Concurrency Mechanism

Greptime

Industry-leading time series database for IoT and Observability with real-time insights and lower costs at any scale.

Introduction of Vector

Why Adaptive Request Concurrency is Needed

Limitations of Static Rate Limiting

Adaptive Request Concurrency: Overcoming the Static Rate Limiting Bottleneck

领英推荐

How ARC Works

AIMD Algorithm and TCP Congestion Control

Configuring ARC

Adaptive Request Concurrency (ARC) is an effective solution

Observability Unlocked

305 位关注者

Greptime的更多文章

其他会员也浏览了

Top trends in Big Data for 2024 and beyond

Introducing Milvus 2.5: Built-in Full-Text Search and More!

The Guide To Stream Processing

Unveiling the Future: 6 Data Engineering Trends for 2024 and Beyond

How LLMs are Automating Data Engineering Tasks?

Edition 54: Was the MoD Cyberattack Avoidable?

How to implement Consistent Hashing

The Power of Databricks Data Intelligence Platform

Leveraging AI & Automation in Data Engineering: 4 Essential Frameworks for Adoption

Hello 2023 and hello data transformation

Introduction of Vector

Why Adaptive Request Concurrency is Needed

Limitations of Static Rate Limiting

Adaptive Request Concurrency: Overcoming the Static Rate Limiting Bottleneck

领英推荐

How ARC Works

AIMD Algorithm and TCP Congestion Control

Configuring ARC

Adaptive Request Concurrency (ARC) is an effective solution

Observability Unlocked

305 位关注者

Greptime的更多文章

How Vector Remap Enhances Log Data Parsing and Storage in Observability

VictoriaLogs Source?Reading

Streamlining Log Management with OpenTelemetry - Best Practices for Capturing, Parsing, and Storing Logs

Choosing the Right Log Aggregation Tool for Performance, Compression, and Cost Efficiency

What is Log Aggregation? Key Factors to consider for a good Log Management System

An OpenTelemetry Python Example — Building a Tesla Data Monitor

Scaling Prometheus: Deploying a Database Cluster for Long-Term Storage in Kubernetes

Are you Listening? - Emit and Capture Signals with OpenTelemetry Instrumentation in NodeJS

What is OpenTelemetry —— an Introduction for Beginners

What is Semantic Convention in Observability and Why it Matters

其他会员也浏览了

Top trends in Big Data for 2024 and beyond

Introducing Milvus 2.5: Built-in Full-Text Search and More!

The Guide To Stream Processing

Unveiling the Future: 6 Data Engineering Trends for 2024 and Beyond

How LLMs are Automating Data Engineering Tasks?

Edition 54: Was the MoD Cyberattack Avoidable?

How to implement Consistent Hashing

The Power of Databricks Data Intelligence Platform

Leveraging AI & Automation in Data Engineering: 4 Essential Frameworks for Adoption

Hello 2023 and hello data transformation