登录查看更多内容

Is Kafka good for only big data streaming and Event-driven systems should only use Azure Service Bus?

Piyush Porwal

Tech Enthusiast & Learner| Engineering Manager at Microsoft | Career Growth Mentor | Architect | Sharing Food For Thoughts

发布日期: 2024年4月3日

Recently, I was in discussions with a colleague for one problem to decide should we use Kafka queues or Azure Service Bus and we learnt some good concepts in trying to answer that question. Sharing that to answer here in which use case which one to use and why?

Let’s learn the basics and architecture first of both of them.

Azure Service Bus:

While working with Microsoft, Service Bus becomes like a default tech to use in event based programming. This supports pub-sub approach where publisher could send messages using different topics and subscriber could opt to receive which one it is interested in. This approach greatly enables two services within one application to communicate with each other with low coupling and enabling each service to plan their own scaling needs. This is a huge win in building a event driven distributed application. I talked about the communication designs in one of video here as well, feel free to check for other options.

High Level architecture:

Messages in queues are ordered and timestamped on arrival. Once the broker accepts the message, the message is always held durably in triple-redundant storage, spread across availability zones if the namespace is zone-enabled. Service Bus keeps messages in memory or volatile storage until client reports them as accepted.

Features:

Service bus supports various features on top of being a basic queue, namely you can not only do topic-based message delivery, but also could have auto-forwarding or dead-lettering kind of features available to use.

Designing Scalable Architecture with SB

Now how does this benefit me as an application developer who is looking for a scalable architecture to be built? Let’s consider a scenario where we have a web application that needs to communicate with a backend service to process orders. We’ll use Azure Service Bus to facilitate this communication.

Setting up Azure Service Bus: First, we need to create a Service Bus namespace in the Azure portal. Once created, we’ll obtain the connection string, which we’ll use to connect our application to the Service Bus.
Creating Queues/Topics: Within the Service Bus namespace, we’ll create a queue or topic. For our demo, let’s create a queue named “OrderQueue” where orders will be sent by the web application.
Sending Messages: In our web application, when a user submits an order, the application will create a message containing the order details (e.g., order ID, items, customer information) and send it to the “OrderQueue” in the Service Bus. This can be achieved using the Azure Service Bus SDK or client libraries. Here is a sample code for it below.
Processing Messages: On the backend side, we’ll have a service or function that listens to messages on the “OrderQueue.” When a new message arrives, the service retrieves the order details from the message and processes them accordingly. This could involve updating databases, sending confirmation emails, or performing other business logic.
Error Handling and Retry: It’s important to implement error handling and retry mechanisms to handle transient errors and ensure message delivery reliability. Azure Service Bus provides built-in features such as dead-letter queues and automatic retry policies to facilitate this.
Scaling and Performance: As the application grows, we may need to scale our Service Bus resources to handle increased message volume. Azure Service Bus allows us to scale queues/topics horizontally by adjusting throughput units or vertically by upgrading to premium tiers.
Monitoring and Analytics: Azure Service Bus provides monitoring and analytics capabilities to track message throughput, latency and other metrics. We can use Azure Monitor or integrate with other monitoring tools for comprehensive visibility into our messaging infrastructure.

领英推荐

Kafka Streams vs. Apache Flink: Choosing the Right…

VARAISYS PVT. LTD. 9 个月前

Kafka GitOps: Opening up Kafka without giving up…

Lenses.io 9 个月前

Kafka: Navigating the World of Data Streaming

Anthill 11 个月前

using System;
using Microsoft.Azure.ServiceBus;
using System.Text;
using System.Threading.Tasks;

class ServiceBusMessageSender
{
    static async Task Main(string[] args)
    {
        string serviceBusConnectionString = "YOUR_SERVICE_BUS_CONNECTION_STRING";
        string queueName = "YOUR_QUEUE_NAME";

        IQueueClient queueClient = new QueueClient(serviceBusConnectionString, queueName);

        // Create a new message
        string messageBody = "Your order message data";
        Message message = new Message(Encoding.UTF8.GetBytes(messageBody));

        try
        {
            // Send the message to the queue
            await queueClient.SendAsync(message);
            Console.WriteLine("Message sent to the queue successfully.");
        }
        catch (Exception ex)
        {
            Console.WriteLine($"An error occurred: {ex.Message}");
        }
        finally
        {
            // Close the queue client
            await queueClient.CloseAsync();
        }
    }
}

By following these steps, we can effectively use Azure Service Bus to facilitate reliable and asynchronous communication between components in our architecture, ensuring seamless order processing and scalability for our application.

Moving to Apache Kafka

Kafka is a well-known name in IT industry hence may not need lot of introductions here but let’s understand some of the key areas which makes it shine for building real-time streaming data pipelines compared to RabbitMQ or Amazon Kinesis:

Events in Kafka represent occurrences in the world or in a business and are composed of a key, value, timestamp and optional metadata headers. Producers publish events to Kafka, while consumers subscribe to and process them. Kafka’s design allows for high scalability by decoupling producers and consumers, ensuring no waiting time for either party. Events are stored in topics, akin to folders in a filesystem and are durably retained based on configurable settings. Topics are partitioned across Kafka brokers for scalability, with events written to partitions based on their keys. Replication ensures fault tolerance and high availability by maintaining multiple copies of data across brokers.

Now let’s use it to write a trading app:

using Confluent.Kafka;
using Newtonsoft.Json;
using System;
using System.Threading;

class Program
{
    static async Task Main(string[] args)
    {
        // Kafka broker configuration
        var config = new ProducerConfig
        {
            BootstrapServers = "localhost:9092"
        };

        // Initialize Kafka producer
        using (var producer = new ProducerBuilder<Null, string>(config).Build())
        {
            // Sample data for trading
            var tradeData = new
            {
                Symbol = "AAPL",
                Price = 150.20,
                Quantity = 100
            };

            // Serialize trade data to JSON
            var tradeJson = JsonConvert.SerializeObject(tradeData);

            // Publish a trade message
            await producer.ProduceAsync("trading_topic", new Message<Null, string> { Value = tradeJson });
        }

        // Initialize Kafka consumer configuration
        var consumerConfig = new ConsumerConfig
        {
            BootstrapServers = "localhost:9092",
            GroupId = "trading_group",
            AutoOffsetReset = AutoOffsetReset.Earliest
        };

        // Initialize Kafka consumer
        using (var consumer = new ConsumerBuilder<Ignore, string>(consumerConfig).Build())
        {
            // Subscribe to the trading topic
            consumer.Subscribe("trading_topic");

            // Consume trade messages
            while (true)
            {
                try
                {
                    var consumeResult = consumer.Consume(CancellationToken.None);
                    Console.WriteLine($"Received trade message: {consumeResult.Message.Value}");
                }
                catch (ConsumeException e)
                {
                    Console.WriteLine($"Error occurred: {e.Error.Reason}");
                }
            }
        }
    }
}

Which one to use for messaging system for your use case?

The choice is tough to make because both of them provide really good solutions for being scalable, fault tolerant and event driven design. Earlier, I used to believe Kafka is good use case only for log streaming or big data streaming use cases, but that turns out to be inaccurate. While, ASB is a popular choice for the applications I am working on and it is indeed a great solution, there is no doubt that Kafka also supports lot of those use cases too.

One place where Kafka has a huge benefit is the large community using it in the industry. Service bus has a really good?.NET ecosystem and Azure level of high documentation too. Kafka suits really well for big data needs where as service has message ordering preserved, which Kafka does only at partition level.

要查看或添加评论，请登录

Piyush Porwal的更多文章

Commonly used Quick SQL Optimization Tricks for Turbocharged Database Queries

2024年3月15日

Commonly used Quick SQL Optimization Tricks for Turbocharged Database Queries

#techinsights Let’s look at some commonly used SQL optimization tricks specifically tailored for SQL that will help…

1 条评论
Leadership in Tech: 10 Super Essential Lessons Learnt for Making High Performing Team

2024年2月26日

Leadership in Tech: 10 Super Essential Lessons Learnt for Making High Performing Team

Starting on lighter note: “Guess why did the scarecrow become a team leader? Because he was outstanding in his field of…
Edge AI: Bringing Intelligence to where action happens

2024年2月12日

Edge AI: Bringing Intelligence to where action happens

As Digital India evolving, it is easy to expect traffic violators to get auto detected. Ever wondered how could that…
simplifying code ## refactoring

2024年1月29日

simplifying code ## refactoring

#oneminutewisdom “Every time you refactor, it’s an act of kindness to your future self.”?—?Martin Fowler Do you also…
Navigating a Successful Career in Software Development: 16 Tips for Growth and Excellence

2024年1月22日

Navigating a Successful Career in Software Development: 16 Tips for Growth and Excellence

#careerinsights I am excited to share some of my learnings and insights on navigating a fulfilling career in ever…
4 Pillars of Low Latency in Microservices: Unlocking Efficiency

2024年1月16日

4 Pillars of Low Latency in Microservices: Unlocking Efficiency

Responsiveness is super important in this dynamic world and the approach for low latency has become a central theme…
Can New ERA Of Web 3.0 Solve My Problems?

2024年1月4日

Can New ERA Of Web 3.0 Solve My Problems?

#TechTrends Will the New ERA of Web 3.0 Solve all of my problems? Embarking on a journey that spans decades, I have…
Have you ever faced challenges while decoupling Microservices in cross team setup and wondered how to foster resiliency?

2023年7月24日

Have you ever faced challenges while decoupling Microservices in cross team setup and wondered how to foster resiliency?

While handling an outage situation a while back, realized how API driven inter-team/inter-service dependency is causing…

1 条评论
How Cloud Computing bridges the divide between developed and developing nations?

2023年6月16日

How Cloud Computing bridges the divide between developed and developing nations?

Cloud Computing is the Noah’s ark of modern technology. Most apps commonly in use today rely on Cloud Computing to…

See all articles

Is Kafka good for only big data streaming and Event-driven systems should only use Azure Service Bus?

Piyush Porwal

Tech Enthusiast & Learner| Engineering Manager at Microsoft | Career Growth Mentor | Architect | Sharing Food For Thoughts

领英推荐

Piyush Porwal的更多文章

社区洞察

其他会员也浏览了

Running Kafka on a Single Node in K8s Cluster

What is Databricks?

Azure Blob Storage

Harnessing the Power of Apache Kafka in Real-Time Data Streaming

Apache Spark on AWS

Just Enough Spark! Core Concepts Revisited !!

Kafka Architecture

Building Transaction Apache Hudi Data Lake with Streaming ETL from Multiple Kinesis Streams and Joining using Apache Flink | Hands on Lab

Kafka's Evolution: Zookeeper vs. KRaft

领英推荐

Piyush Porwal的更多文章

Commonly used Quick SQL Optimization Tricks for Turbocharged Database Queries

Leadership in Tech: 10 Super Essential Lessons Learnt for Making High Performing Team

Edge AI: Bringing Intelligence to where action happens

simplifying code ## refactoring

Navigating a Successful Career in Software Development: 16 Tips for Growth and Excellence

4 Pillars of Low Latency in Microservices: Unlocking Efficiency

Can New ERA Of Web 3.0 Solve My Problems?

Have you ever faced challenges while decoupling Microservices in cross team setup and wondered how to foster resiliency?

How Cloud Computing bridges the divide between developed and developing nations?

社区洞察

其他会员也浏览了

Running Kafka on a Single Node in K8s Cluster

What is Databricks?

Azure Blob Storage

Harnessing the Power of Apache Kafka in Real-Time Data Streaming

Apache Spark on AWS

Just Enough Spark! Core Concepts Revisited !!

Kafka Architecture

Building Transaction Apache Hudi Data Lake with Streaming ETL from Multiple Kinesis Streams and Joining using Apache Flink | Hands on Lab

Kafka's Evolution: Zookeeper vs. KRaft