登录查看更多内容

Publishing Events - Outbox Pattern

?? Saral Saxena ??????

?11K+ Followers | Linkedin Top Voice || Associate Director || 15+ Years in Java, Microservices, Kafka, Spring Boot, Cloud Technologies (AWS, GCP) | Agile , K8s ,DevOps & CI/CD Expert

发布日期: 2020年10月30日

In the world of Microservice architecture, services along with updating their own local data store they also need to notify other services within the organization about the changes occurred. This is where Event-driven architecture has its prominence and Apache Kafka here becomes a de-facto standard to capture and store these change records; wherein individual services can publish the changes as events to a Kafka topic that can be consumed by the other services.

One of the functional design requirements was to de-couple the Kafka producing logic from individual Microservices and still reliably publish events to Kafka thus avoiding dealing with technical constraints such as handling connection failures to the Kafka cluster, writing retry logic, and handling too many requests made. Security also wanted to avoid embedding the Kafka client TLS certificates in the Microservices (Lambda) layer.

The services had to be designed and built without any direct interaction with the underlying Confluent platform i.e. Apache Kafka Broker and Schema Registry thus avoiding any messaging constructs.

Furthermore, all the API called events (success and failure) had to be captured from different sources within the solution and published to a single Kafka topic.

we decided on leveraging the Outbox pattern and implemented it using the recommended 2-step process. First, committing the message to a persistent data store (an Outbox table) and then a separate service polls the Outbox table and publishes the message to a Apache Kafka topic.

The idea of this pattern is to have an “Outbox” table and instead of publishing the API-called events directly into Kafka, the messages are published into the Outbox table in an AVRO compatible format. Data is collected from all the source systems (Microservices + Logs Consumer Service as shown in the architecture) and is committed to the Outbox table. This also allowed us to consume, transform and collect both the success (2XX) and failure events (4XX & 5XX logs available on a separate Kafka topic from API Gateway) and publish them to a single location (i.e. the Outbox table). Gateway) and publish them to a single location (i.e. the Outbox table). Another component (Producer Service as shown in the architecture) would then poll these events asynchronously and publish them to Kafka. Once the message is published successfully to the Kafka topic, the message is then marked in DB as published.

Design

The Kafka AVRO schema file to publish events to Kafka is stored locally along with the microservices as the design goal was to not integrate with the Schema Registry. This schema file is used to create the Event records in the Outbox table.
All API-called data is stored in a jsonb column in the Outbox table. This provides the ability to store messages as JSON which also strips out any insignificant whitespaces.
Along with other metadata fields such as Kafka topic name, the schema for the Outbox table is modelled to have a published Date Time column. The Producer Service updates this column once the message is read and published to Kafka thus allowing at-least-once delivery of the event.
Enabled DB pessimistic transaction locks with timeouts support provided by Spring Data JPA when reading data from the Outbox table which ensures only 1 worker will be able to process a message/record at a time and subsequently update the published column in the DB.
As the solution is deployed on AWS, IAM database authentication feature is enabled which gives permissions to the Lambda functions (Microservices) and the ECS containers (Logs Consumer & Producer Service) to use an authentication token instead of a password when connecting to the DB instance.

Benefits

This pattern ensures at-least-once delivery of messages to Kafka. The events are persisted in the “Outbox” table indefinitely with an ability to replay them in the future in case the messages are no longer available on Kafka topic thus allowing any offline Kafka consumers to read messages at their will. This approach also allows taking regular backups of the DB before cleaning up the messages. The messages can also be moved to low-cost data storage such as AWS S3 where the DB backups are actually stored.

Limitations

This approach has few limitations though.

Any updates/breaking changes to the schema would not be detected until the messages are polled from the Outbox table and published to the Kafka topic. Such changes need to be applied to all the services. AVRO Schemas now support fingerprints that allow tagging the messages to evolve under that schema. A solution is to record fingerprints along with the event in the Outbox table and implement business rules in the Producer Service to filter and publish the messages in AVRO compatible formats.
Individual Microservice could potentially end up having its own Outbox table within its own domain to avoid the data published and shared by each Microservice. Improvement: An option here is to proxy the publishing logic by introducing a REST-ful interface in front of the Outbox table thus making it easier to publish messages via a single service endpoint which controls and manages the Events datastore.

要查看或添加评论，请登录

?? Saral Saxena ???????的更多文章

Validating Payloads with Spring Boot 3.4.0

2025年2月1日

Validating Payloads with Spring Boot 3.4.0

First, let’s examine a controller that receives a object. This object contains fields such as: first name, last name…
Limitations of Java Executor Framework.

2024年12月26日

Limitations of Java Executor Framework.

The Java Executor Framework has inherent limitations that affect its performance in high-throughput, low-latency…
??Structured Logging in Spring Boot 3.4??

2024年12月8日

??Structured Logging in Spring Boot 3.4??

Spring Boot 3.4 has been released ??, and as usual, I want to introduce you to some of its new features.
Sending large payload as response in optimized way

2024年12月1日

Sending large payload as response in optimized way

Handling large payloads in a Java microservices application, sending large responses efficiently while maintaining…
Disaster Recovery- Strategies

2024年11月30日

Disaster Recovery- Strategies

Backup and Restore This is the simplest of the approaches and as the name implies, it involves periodically performing…
Memory Optimization Techniques for Spring Boot Applications with Practical Coding Strategies

2024年10月27日

Memory Optimization Techniques for Spring Boot Applications with Practical Coding Strategies

Learn practical coding strategies to optimize memory usage in Spring Boot applications. This guide covers efficient…
Designing CI/CD Pipeline

2024年9月28日

Designing CI/CD Pipeline

Problem statement You are responsible for designing and implementing a CI/CD pipeline for a large-scale microservices…
Calculate CPU for containers in k8s dynamically

2024年9月27日

Calculate CPU for containers in k8s dynamically

It’s possible to dynamically resize the CPU on containers in k8s with the feature gate “InPlacePodVerticalScaling”…
Downside of the Executor Service with context to thread local

2024年9月22日

Downside of the Executor Service with context to thread local

The executor service creates a pool of threads that you can submit tasks to. The benefit of this approach is that you…

See all articles

Publishing Events - Outbox Pattern

?? Saral Saxena ??????

?11K+ Followers | Linkedin Top Voice || Associate Director || 15+ Years in Java, Microservices, Kafka, Spring Boot, Cloud Technologies (AWS, GCP) | Agile , K8s ,DevOps & CI/CD Expert

Design

Benefits

Limitations

?? Saral Saxena ???????的更多文章

社区洞察

其他会员也浏览了

Saga Pattern in Microservices

Introduction to Apache Kafka

CQRS Pattern in Microservices

The Challenges of Event-Driven Architecture: Dealing with the Dual Write Anti-Pattern

Kafka vs RabbitMQ: a straight-to-the-point comparison

Building Transaction Apache Hudi Data Lake with Streaming ETL from Multiple Kinesis Streams and Joining using Apache Flink | Hands on Lab

Apache Flink vs. Kafka Streams: A Comprehensive Comparison

Kafka's Evolution: Zookeeper vs. KRaft

Understanding Apache Kafka: Architecture, Components, and Real-Life Use Cases

Ripping apart the Kafka Broker Architecture !!

Design

Benefits

Limitations

?? Saral Saxena ???????的更多文章

Validating Payloads with Spring Boot 3.4.0

Limitations of Java Executor Framework.

??Structured Logging in Spring Boot 3.4??

Sending large payload as response in optimized way

Disaster Recovery- Strategies

Memory Optimization Techniques for Spring Boot Applications with Practical Coding Strategies

Designing CI/CD Pipeline

Calculate CPU for containers in k8s dynamically

Downside of the Executor Service with context to thread local

社区洞察

其他会员也浏览了

Saga Pattern in Microservices

Introduction to Apache Kafka

CQRS Pattern in Microservices

The Challenges of Event-Driven Architecture: Dealing with the Dual Write Anti-Pattern

Kafka vs RabbitMQ: a straight-to-the-point comparison

Building Transaction Apache Hudi Data Lake with Streaming ETL from Multiple Kinesis Streams and Joining using Apache Flink | Hands on Lab

Apache Flink vs. Kafka Streams: A Comprehensive Comparison

Kafka's Evolution: Zookeeper vs. KRaft

Understanding Apache Kafka: Architecture, Components, and Real-Life Use Cases

Ripping apart the Kafka Broker Architecture !!