登录查看更多内容

Token Bucket Algorithm in a Distributed AWS Lambda Environment

Gabriel L.

Software Engineer | Solution Architecture | Data integration Solutions | AWS Cloud native solutions with Python | Microservices architecture | Clean Code | Rest APIs | Api Gateway Governance | SOLID | TDD | YAGNI | DRY

发布日期: 2024年10月19日

The token bucket algorithm is crucial for network traffic management, as it controls the rate of packet transmission over a network based on token availability. Each token represents the right to transmit a certain volume of data. Tokens are added to a bucket at a consistent rate, never surpassing the bucket's limit. If the bucket is full, incoming packets must either wait for new tokens to be available or be dropped.

Now, applying this concept to a distributed environment with AWS Lambda, we can design a solution to integrate with our vendors while keeping the throughput below the stipulated TPS (Transactions Per Second), offering a low-cost and scalable solution.

Distributed Token Bucket Model

In this model, an AWS Lambda function named "vendor-outbound-gateway-lambda-service" is responsible for making requests to the vendor's REST APIs.

Infrastructure Components:

SQS Queue: vendor-outbound-gateway-service-request-channel
SQS Dead Letter Queue (DLQ): vendor-outbound-gateway-service-request-channel-dlq
SNS Topic: vendor-outbound-gateway-service-reply-channel

Application Components:

AWS Lambda: vendor-outbound-gateway-service

All communication between the system and the vendor occurs through this Lambda service. Messages are passed through the SQS queue, and responses are handled via the SNS topic. Large payloads are stored in S3, and only the object keys are passed through the channels.

The vendor imposes a rate limit of 10 TPS. To stay within this limit, the token bucket algorithm can be employed where each Lambda instance represents a token. Each instance has a fixed runtime (AWS Lambda timeout), and tokens (Lambda instances) are never generated beyond the system's capacity (concurrency). As instances finish processing, new tokens (Lambda instances) are made available.

Configuring the Token Bucket

Each token can process up to 10 requests, which corresponds to an SQS batch size of 10. For example, with an AWS Lambda concurrency setting of 1 (i.e., 1 token), each token can handle 10 requests at a time.

Increasing the concurrency to 2 tokens allows for 20 requests to be processed, and so on. However, this model assumes that each Lambda instance completes execution in exactly 1 second, enabling the processing of 10 requests per second (per instance).

But real-world factors like latency complicate this. For example, if each request takes 5 seconds plus additional overhead for Lambda cold starts, deserialization, and response processing, the total processing time could reach 10 seconds. In this case, 10 requests every 10 seconds means that the throughput is effectively 1 TPS per token. To achieve 10 TPS, you would need 10 tokens.

领英推荐

Building Serverless GraphQL APIs with AWS Lambda

Centizen, Inc. 10 个月前

Modern Performant Applications Require Modern Storage

Cloudian Inc 2 年前

How to Build Scalable Systems: Lessons from…

AlphaDot Technologies 3 个月前

Calculating Visibility Timeout and Token Availability

In this scenario, the ideal visibility timeout for the SQS queue would be at least 15 seconds. This ensures that if a batch of messages is polled but no token is available, the messages will return to the queue after 15 seconds, allowing time for a new token to be released. Messages that are in flight will be processed within this window, ensuring that no duplicate messages are sent before their current processing is complete.

Throughput Measurement

Each token performs a fixed number of requests to the vendor (up to 10 per token).
Each token has X seconds to process and respond before the visibility timeout forces a reattempt.

Throughput is calculated as:

Throughput = {requests per token} {number of tokens available} {time until next token is available}

For example, if each token processes 10 requests in 5 seconds, the throughput would be: 10(requests) * 5 (tokens) / 10s (latency) = 5 TPS

Problems with this Solution

Variable Latency and Unpredictability: Factors like Lambda cold starts, network delays, or vendor response times can cause requests to take longer or shorter than expected, leading to fluctuating throughput.

Dynamic rate limiting: Adjust the rate of requests dynamically, based on real-time vendor response times.
Exceeding Vendor Rate Limits Due to Faster Token Release: If tokens (Lambda instances) finish faster than anticipated, you risk breaching the vendor's 10 TPS limit.
Long Queue Time and Message Duplication: Delays may occur if no tokens are available, leading to processing delays or even duplicated messages.
Handling Vendor Feedback: The vendor may return 429 (Too Many Requests) or retry-later headers, which need to be handled gracefully.

By refining this approach with dynamic rate limiting, backoff strategies, and appropriate timeout configurations, you can ensure smooth and efficient integration with the vendor while staying within their rate limits.

https://github.com/GabrielSlima/token-bucket-algorithm-aws-lambda

Jo?o Paulo Cardoso Fa?anha

4 个月

Top demais ! Você é o cara !

1 次回应

查看更多评论

要查看或添加评论，请登录

Gabriel L.的更多文章

Understanding the Domain Name System (DNS): A Comprehensive Guide

2025年1月6日

Understanding the Domain Name System (DNS): A Comprehensive Guide

A Brief History of the Internet The ARPANET and NSFNET ARPANET Representation - A historical representation of ARPANET…

Token Bucket Algorithm in a Distributed AWS Lambda Environment

Gabriel L.

Software Engineer | Solution Architecture | Data integration Solutions | AWS Cloud native solutions with Python | Microservices architecture | Clean Code | Rest APIs | Api Gateway Governance | SOLID | TDD | YAGNI | DRY

Distributed Token Bucket Model

Infrastructure Components:

Application Components:

Configuring the Token Bucket

领英推荐

Calculating Visibility Timeout and Token Availability

Throughput Measurement

Problems with this Solution

Gabriel L.的更多文章

社区洞察

其他会员也浏览了

Step-by-Step Guide to Using Grafana: Unlocking the Power of Data Visualization

Kafka vs SQS: Differences

2025 - Week 6 (3 Feb - 9 Feb)

Week 25 (17 Jun - 23 Jun)

What’s new at AWS re:Invent 2023

Serverless Computing with AWS Lambda:

The Journey to Modernization – Part 7 – Best Practices for operating and maintaining a serverless stack

ioTips: Top 60 AWS ECS Best Practices & Tips :: Operational Excellence, Data Analytics, Governance, and Compliance - Part-3

Top Tips for AWS S3 Performance Optimization

Comprehensive Guide to Microsoft Service Fabric: Advanced Features, Integration with Azure, and Use Cases

Distributed Token Bucket Model

Infrastructure Components:

Application Components:

Configuring the Token Bucket

领英推荐

Calculating Visibility Timeout and Token Availability

Throughput Measurement

Problems with this Solution

Gabriel L.的更多文章

Understanding the Domain Name System (DNS): A Comprehensive Guide

社区洞察

其他会员也浏览了

Step-by-Step Guide to Using Grafana: Unlocking the Power of Data Visualization

Kafka vs SQS: Differences

2025 - Week 6 (3 Feb - 9 Feb)

Week 25 (17 Jun - 23 Jun)

What’s new at AWS re:Invent 2023

Serverless Computing with AWS Lambda:

The Journey to Modernization – Part 7 – Best Practices for operating and maintaining a serverless stack

ioTips: Top 60 AWS ECS Best Practices & Tips :: Operational Excellence, Data Analytics, Governance, and Compliance - Part-3

Top Tips for AWS S3 Performance Optimization

Comprehensive Guide to Microsoft Service Fabric: Advanced Features, Integration with Azure, and Use Cases