Azure Event Hub

Azure Event Hub

Azure Event Hubs is a distributed stream processing platform and event ingestion service managed by Microsoft. Event Hubs gives organizations a fully-managed solution to receive and process millions of events per second (without concerning about infrastructure and SLA). Azure Event Hubs is truly distinctive vs. Apache Kafka as it has a seamless integration with other services in Azure.

Event hubs is a highly scalable telemetry service offering one-way communication with the HTTP/AMQP protocol. You can send events from anywhere: a website, an app, an IoT device, a software, etc. Azure Event Hubs is distinct from Azure IoT Hub as communication is one way and not two ways.

When should Azure Event Hubs be used?

Event Hubs is the component to use for real-time and/or streaming data use cases:

  • Real-time reporting
  • Capture streaming data into files for further processing and analysis – e.g. capturing data from micro-service applications or a mobile app
  • Make data available to stream-processing and analytics services – e.g. when scoring an AI algorithm
  • Telemetry streaming & processing
  • Application logging

Key concepts

  • An Event Hub Producer Client is a source of telemetry data, diagnostics information, usage logs, or other log data, as part of an embedded device solution, a mobile device application, a game title running on a console or other device, some client or server-based business solution, or a web site.
  • An EventHub Consumer Client picks up such information from the Event Hub and processes it. Processing may involve aggregation, complex computation, and filtering. Processing may also involve distribution or storage of the information in a raw or transformed fashion. Event Hub consumers are often robust and high-scale platform infrastructure parts with built-in analytics capabilities, like Azure Stream Analytics, Apache Spark, or Apache Storm.
  • A partition is an ordered sequence of events that is held in an Event Hub. Azure Event Hubs provides message streaming through a partitioned consumer pattern in which each consumer only reads a specific subset, or partition, of the message stream. As newer events arrive, they are added to the end of this sequence. The number of partitions is specified at the time an Event Hub is created and cannot be changed.
  • A consumer group is a view of an entire Event Hub. Consumer groups enable multiple consuming applications to each have a separate view of the event stream, and to read the stream independently at their own pace and from their own position. There can be at most 5 concurrent readers on a partition per consumer group; however, it is recommended that there is only one active consumer for a given partition and consumer group pairing. Each active reader receives all of the events from its partition; if there are multiple readers on the same partition, then they will receive duplicate events.

The W’s of Azure Event Hubs

The W’s section emphasizes on three pillar actions “Why, When and Where” Azure Event Hubs comes into play.

Why?

  • Azure Event Hubs allows you to raise a data pipeline capable of processing a huge number of events per second with low latency.
  • It can process data from parallel sources and connect them to different infrastructures and services.
  • It supports repeated replay of stored data.

When?

  • To validate many publishers and to save the events in a Blob Storage or Data Lake.
  • When you want to get timely insights on the business application.
  • To obtain reliable messaging or flexibility for Big Data Applications.
  • For seamless integration with data and analytics services to create big data pipeline.

Where?

  • Anomaly Detection
  • Application Logging
  • Archiving data
  • Telemetry processing
  • Live Dashboarding

要查看或添加评论,请登录

Rohit Singh的更多文章

  • Delta Lake

    Delta Lake

    A Delta Lake is an open-source storage layer designed to run on top of an existing data lake and improve its…

  • API Testing

    API Testing

    API testing is a process that involves making requests to an API endpoint and verifying the response. It's also known…

  • SAP MM

    SAP MM

    SAP MM stands for "Materials Management." SAP MM (Materials Management) is a SAP ERP Central Component (ECC) module…

  • Gap analysis

    Gap analysis

    A gap analysis is a method of assessing the performance of a business unit to determine whether business requirements…

  • Azure Cognitive Services

    Azure Cognitive Services

    Microsoft Azure Cognitive Services provides a variety of pre-trained powerful AI tools and models that gives the…

  • UX Design

    UX Design

    User experience (UX) design is the process of creating products that are easy and enjoyable to use. This includes…

  • Apache Hadoop YARN

    Apache Hadoop YARN

    Apache Hadoop YARN (Yet Another Resource Negotiator) is a system for managing resources and scheduling tasks in a…

  • Eventbridge

    Eventbridge

    AWS EventBridge is a serverless event bus service that makes it easier to build event-driven applications by allowing…

  • SOC

    SOC

    A security operations center (SOC) is a central team that oversees and manages an organization's security stance. This…

  • SQL Queries

    SQL Queries

    Structured query language (SQL) is a programming language for storing and processing information in a relational…

社区洞察

其他会员也浏览了