登录查看更多内容

Configuring MSK (Managed Streaming for Apache Kafka) on a Server

Amit Yadav

Senior Software Developer @ IDFC first bank(Digital Banking) | Career Consultant| BFSI | Fin-tech | Ex-Kiya AI | Ex-Cerner Corporation

发布日期: 2024年5月20日

Amazon Managed Streaming for Apache Kafka (MSK) is a fully managed service that makes it easy to build and run applications that use Apache Kafka for streaming data. This article will guide you through the steps required to configure MSK on a server.

Amazon Managed Streaming for Apache Kafka (MSK) Reference: https://aws.amazon.com/msk/

Prerequisites

AWS Account: You must have an active AWS account or you can create one here.
AWS CLI: Ensure that the AWS Command Line Interface (CLI) is installed and configured with appropriate permissions.
Apache Kafka: Basic understanding of Apache Kafka.
Linux Server: A Linux server (e.g., EC2 instance) with sudo privileges.
Java: Install java-1.8.0

Step 1: Install and Configure AWS CLI

Install the AWS CLI on your Linux server using below command. If it is already installed, ensure it's up to date.

Install AWS CLI

sudo apt update
sudo apt install awscli -y

Configure AWS CLI

After installation, configure the AWS CLI with your credentials.

aws configure

You will be prompted to enter your AWS Access Key ID, Secret Access Key, region, and output format.

Step 2: Create an MSK Cluster

Define the Configuration

Create a configuration JSON file named msk-config.json with the following content:

{
  "ClusterName": "AkMSKCluster",
  "BrokerNodeGroupInfo": {
    "InstanceType": "kafka.m5.large",
    "ClientSubnets": [
      "subnet-xxxxxxxx",
      "subnet-xxxxxxxx"
    ],
    "SecurityGroups": [
      "sg-xxxxxxxx"
    ]
  },
  "NumberOfBrokerNodes": 3,
  "EncryptionInfo": {
    "EncryptionInTransit": {
      "ClientBroker": "TLS",
      "InCluster": true
    }
  },
  "KafkaVersion": "2.8.0",
  "LoggingInfo": {
    "BrokerLogs": {
      "CloudWatchLogs": {
        "Enabled": true,
        "LogGroup": "/aws/msk/AkMSKCluster"
      }
    }
  }
}

Note: ClusterName, InstanceType and other details needs to be configured based on requirements and also make sure to replace subnet-xxxxxxxx and sg-xxxxxxxx with your actual subnet IDs and security group IDs.

Create the Cluster

Use the AWS CLI to create the MSK cluster with the configuration file, which might take some time.

aws kafka create-cluster --cli-input-json file://msk-config.json

You can monitor the status of your cluster using:

aws kafka describe-cluster --cluster-arn <cluster-arn>

Note: Replace <cluster-arn> with the ARN of your cluster, which you can obtain from the output of the previous command.

Step 3: Configure Kafka Clients

Install Kafka Client

Step-1: Download and extract Apache Kafka on your Linux server:

领英推荐

What is Apache Kafka?

Omar Ismail 1 年前

Comparing Zookeeper and KRaft in Kafka

Mohamad Mahdi Bayat 4 个月前

Jenkins Server setup with dynamic worker nodes

Shishir Khandelwal 4 年前

wget https://downloads.apache.org/kafka/2.8.0/kafka_2.12-2.8.0.tgz

Step-2: Unzip the file at a specific location using below command.

tar -xzf kafka_2.12-2.8.0.tgz
cd kafka_2.12-2.8.0

Step-3: On your client machine, create a JAAS configuration file named "users_jaas.conf" that contains the user credentials stored in your secret:

KafkaClient { 
	org.apache.kafka.common.security.scram.ScramLoginModule required 
	"username": "username",
	 "password": "pass"; 
	
};

Step-4: Create a file named?kafka.client.truststore.jks?in a?./tmp?directory.

Use the following command to copy the JDK key store file from your JVM cacerts folder into the kafka.client.truststore.jks file that you created in the previous step.

cp /usr/lib/jvm/JDKFolder/jre/lib/security/cacerts /tmp/kafka.client.truststore.jks

Note: Replace JDKFolder with the name of the JDK folder on your instance. For example, your JDK folder might be named java-1.8.0-openjdk-1.8.0.201.b09-0.amzn2.x86_64.

Step-5: Create a "client_sasl.properties" file inside? the bin directory of kafka installation with below details.

security.protocol=SASL_SSL
sasl.mechanism=SCRAM-SHA-512
ssl.truststore.location=<path-to-keystore-file>/kafka.client.truststore.jks

Step-6: Use the following command to export your JAAS config file as KAFKA_OPTS environment parameter.

export KAFKA_OPTS=-Djava.security.auth.login.config=<path-to-jaas-file>/users_jaas.conf

Set Up Kafka Producer and Consumer

Move to the bin directory of Kafka installation.

Usecase-1: Create a new topic:

./kafka-topics.sh --create --bootstrap-server <broker-1>:9096 --command-config client_sasl.properties --replication-factor 2 --partitions 1 --topic NewTopic

Usecase-2: Start a Kafka producer:

./kafka-console-producer.sh --broker-list <broker-1>:9096 --producer.config client_sasl.properties --topic NewTopic

Type some messages to produce to the topic.

Usecase-3: To consume these messages, start a Kafka consumer in another terminal:

./kafka-console-consumer.sh --bootstrap-server <broker-1>:9096 --consumer.config client_sasl.properties --topic NewTopic --from-beginning

You should see the messages you produced in the consumer terminal.

Usecase-4: To see the list of topic in a broker, use below command:

./kafka-consumer-groups.sh --list --bootstrap-server <broker-1>:9096 --command-config client_sasl.properties

Conclusion

By following these steps, you can configured MSK on your server using Linux commands. You have created an MSK cluster, configured Kafka clients, produced and consumed the topics. This setup allows you to utilize the power of Apache Kafka in a fully managed environment provided by AWS, ensuring scalability, security, and ease of management for your streaming data applications.

Sunil George

Cloud security & Compliance, performance optimization, DevOps & cloud management, FinOps, and cloud observability.

3 个月

Your article sounds like the ultimate guide to setting up and securing an MSK cluster, ensuring smooth data management and efficiency for organizations leveraging the power of Kafka on AWS. You've got the whole package: detailed instructions, practical insights, and an engaging writing style. Well done, you!

1 次回应

查看更多评论

Configuring MSK (Managed Streaming for Apache Kafka) on a Server

Amit Yadav

Senior Software Developer @ IDFC first bank(Digital Banking) | Career Consultant| BFSI | Fin-tech | Ex-Kiya AI | Ex-Cerner Corporation

Prerequisites

Step 1: Install and Configure AWS CLI

Install AWS CLI

Configure AWS CLI

Step 2: Create an MSK Cluster

Define the Configuration

Create the Cluster

Step 3: Configure Kafka Clients

Install Kafka Client

领英推荐

Set Up Kafka Producer and Consumer

Conclusion

更多精彩文章

社区洞察

其他会员也浏览了

Kafka Alternatives

Alternative to Kubernetes: APACHE MESOS

9. On RHEL, SQL assessments, HTTP requests in load testing, Windows workload cost reduction in AKS, and livestream goodness.

Apache Kafka Load Testing Using JMeter

Redis for high performance

How To Install Jenkins In Ec2 Instance

Hangman Part II: Deploying to AWS (ECR, VPC, RDS, ECS, ALB, and Autoscaling)

Apache Kafka

Hosting a Golang MongoDB app on a droplet(digital ocean): part 2

Prerequisites

Step 1: Install and Configure AWS CLI

Install AWS CLI

Configure AWS CLI

Step 2: Create an MSK Cluster

Define the Configuration

Create the Cluster

Step 3: Configure Kafka Clients

Install Kafka Client

领英推荐

Set Up Kafka Producer and Consumer

Conclusion

Understanding and Overcoming Work Culture Dysfunction

2024年3月17日

Building and Managing Healthy Relationships with Stakeholders

2024年3月3日

社区洞察

其他会员也浏览了

Kafka Alternatives

Alternative to Kubernetes: APACHE MESOS

9. On RHEL, SQL assessments, HTTP requests in load testing, Windows workload cost reduction in AKS, and livestream goodness.

Apache Kafka Load Testing Using JMeter

Redis for high performance

How To Install Jenkins In Ec2 Instance

Hangman Part II: Deploying to AWS (ECR, VPC, RDS, ECS, ALB, and Autoscaling)

Apache Kafka

Hosting a Golang MongoDB app on a droplet(digital ocean): part 2