登录查看更多内容

Kafka and Kafka Connect

Amit Nijhawan

Engineer at Confluent : Apache Kafka | Data in Motion

发布日期: 2023年12月11日

+ 关注

In this guide, you can learn the following foundational information about Apache Kafka and Kafka Connect:

What Apache Kafka and Kafka Connect are
What problems Apache Kafka and Kafka Connect solve
Why Apache Kafka and Kafka Connect are useful
How data moves through an Apache Kafka and Kafka Connect pipeline

Apache Kafka

Apache Kafka is an open source publish/subscribe messaging system. Apache Kafka provides a flexible, fault tolerant, and horizontally scalable system to move data throughout datastores and applications. A system is fault tolerant if the system can continue operating even if certain components of the system stop working. A system is horizontally scalable if the system can be expanded to handle larger workloads by adding more machines rather than by improving a machine's hardware.

For more information on Apache Kafka, see the following resources:

Kafka Connect

Kafka Connect is a component of Apache Kafka that solves the problem of connecting Apache Kafka to datastores such as MongoDB. Kafka Connect solves this problem by providing the following resources:

A fault tolerant runtime for transferring data to and from datastores.
A framework for the Apache Kafka community to share solutions for connecting Apache Kafka to different datastores.

领英推荐

Introduction to Apache Kafka

Brij kishore Pandey 5 个月前

Kafka Concepts

?? Saral Saxena ?????? 2 个月前

Learn Kafka In Just 5 minutes

Shrey Batra 2 年前

The Kafka Connect framework defines an API for developers to write reusable connectors. Connectors enable Kafka Connect deployments to interact with a specific datastore as a data source or a data sink. The MongoDB Kafka Connector is one of these connectors.

For more information on Kafka Connect, see the following resources:

Confluent Kafka Connect Page
Apache Kafka Official Documentation, Kafka Connect Guide
Building your First Connector for Kafka Connect from the Apache Software Foundation

Use Kafka Connect instead of Producer/Consumer Clients when Connecting to Datastores

While you could write your own application to connect Apache Kafka to a specific datastore using producer and consumer clients, Kafka Connect may be a better fit for you. Here are some reasons to use Kafka Connect:

Kafka Connect has a fault tolerant distributed architecture to ensure a reliable pipeline.
There are a large number of community maintained connectors for connecting Apache Kafka to popular datastores like MongoDB, PostgreSQL, and MySQL using the Kafka Connect framework. This reduces the amount of boilerplate code you need to write and maintain to manage database connections, error handling, dead letter queue integration, and other problems involved in connecting Apache Kafka with a datastore.
You have the option to use a managed Kafka Connect cluster from Confluent.

Diagram

The following diagram shows how information flows through an example data pipeline built with Apache Kafka and Kafka Connect. The example pipeline uses a MongoDB cluster as a data source, and a MongoDB cluster as a data sink.

All connectors and datastores in the example pipeline are optional, and you can swap them out for the connectors and datastores you need for your deployment.

Kafka and Kafka Connect

Amit Nijhawan

Engineer at Confluent : Apache Kafka | Data in Motion

Apache Kafka

Kafka Connect

领英推荐

Use Kafka Connect instead of Producer/Consumer Clients when Connecting to Datastores

Diagram

更多精彩文章

社区洞察

其他会员也浏览了

A Comprehensive Overview Of Apache Kafka

SuperMap iServer Temporary Resource Storage Selection and Configuration

Leveraging PostgreSQL and Robust API Design for Scalable Applications

Postgres vs. MongoDB: a Complete Comparison in 2023

Dismantle your monolith with Change Data Capture and Apache Kafka

Apache Cassandra vs ScyllaDB

Creating a highly available and fault-tolerant Postgresql database cluster

?? Apache Kafka Internals-Part1

Comparing Apache Kafka and Apache Pulsar: A Comprehensive Technical-Professional Analysis

MONGO DB

Apache Kafka

Kafka Connect

领英推荐

Use Kafka Connect instead of Producer/Consumer Clients when Connecting to Datastores

Diagram

How to Configure Salesforce Platform Events Source Connector

2024年2月6日

How to run Apache Kafka without Zookeeper

2024年2月1日

Kafka Topic Replication

2024年1月8日

Choosing the number of partitions for a topic

2023年12月19日

What happens when you enter Google/any URL on the browser?

2023年12月11日

Apache Kafka: Core Concepts and Use Cases

2021年9月27日

Developer Blog

2017年12月26日

社区洞察

其他会员也浏览了

A Comprehensive Overview Of Apache Kafka

SuperMap iServer Temporary Resource Storage Selection and Configuration

Leveraging PostgreSQL and Robust API Design for Scalable Applications

Postgres vs. MongoDB: a Complete Comparison in 2023

Dismantle your monolith with Change Data Capture and Apache Kafka

Apache Cassandra vs ScyllaDB

Creating a highly available and fault-tolerant Postgresql database cluster

?? Apache Kafka Internals-Part1

Comparing Apache Kafka and Apache Pulsar: A Comprehensive Technical-Professional Analysis

MONGO DB