Startseite
Produkte
Streams for Apache Kafka
2.9
Using Streams for Apache Kafka on RHEL in KRaft mode
Chapter 3. Getting started

Dieser Inhalt ist in der von Ihnen ausgewählten Sprache nicht verfügbar.

Chapter 3. Getting started

Streams for Apache Kafka is distributed in a ZIP file that contains installation artifacts for the Kafka components.

Note

The Kafka Bridge has separate installation files. For information on installing and using the Kafka Bridge, see Using the Streams for Apache Kafka Bridge.

3.1. Installation environment
Link kopieren

Streams for Apache Kafka runs on Red Hat Enterprise Linux. The host (node) can be a physical or virtual machine (VM). Use the installation files provided with Streams for Apache Kafka to install Kafka components. You can install Kafka in a single-node or multi-node environment.

Single-node environment: A single-node Kafka cluster runs instances of Kafka components on a single host. This configuration is not suitable for a production environment.
Multi-node environment: A multi-node Kafka cluster runs instances of Kafka components on multiple hosts.

We recommended that you run Kafka and other Kafka components, such as Kafka Connect, on separate hosts. By running the components in this way, it’s easier to maintain and upgrade each component.

Kafka clients establish a connection to the Kafka cluster using the bootstrap.servers configuration property. If you are using Kafka Connect, for example, the Kafka Connect configuration properties must include a bootstrap.servers value that specifies the hostname and port of the hosts where the Kafka brokers are running. If the Kafka cluster is running on more than one host with multiple Kafka brokers, you specify a hostname and port for each broker. Each Kafka broker is identified by a node.id.

3.1.1. Data storage considerations
Link kopieren

Apache Kafka stores records in data logs, which are configured using the log.dirs property. For more information, see Section 5.3.2, “Data logs”.

Efficient data storage is essential for Strimzi to operate effectively, and block storage is strongly recommended. Strimzi has been tested only with block storage, and file storage solutions like NFS are not guaranteed to work.

Common block storage types supported by Kubernetes include:

Cloud-based block storage solutions:
- Amazon EBS (for AWS)
- Azure Disk Storage (for Microsoft Azure) **Persistent Disk (for Google Cloud)
Persistent storage (for bare metal deployments) using local persistent volumes
Storage Area Network (SAN) volumes accessed by protocols like Fibre Channel or iSCSI

3.1.2. File systems
Link kopieren

Kafka uses a file system for storing messages. Streams for Apache Kafka is compatible with the XFS and ext4 file systems, which are commonly used with Kafka. Consider the underlying architecture and requirements of your deployment when choosing and setting up your file system.

For more information, refer to Filesystem Selection in the Kafka documentation.

3.1.3. Disk usage
Link kopieren

Solid-state drives (SSDs), though not essential, can improve the performance of Kafka in large clusters where data is sent to and received from multiple topics asynchronously.

Note

Replicated storage is not required, as Kafka provides built-in data replication.

3.2. Downloading Streams for Apache Kafka
Link kopieren

A ZIP file distribution of Streams for Apache Kafka is available for download from the Red Hat website. You can download the latest version of Red Hat Streams for Apache Kafka from the Streams for Apache Kafka software downloads page.

For Kafka and other Kafka components, download the amq-streams-<version>-kafka-bin.zip file
For Kafka Bridge, download the amq-streams-<version>-bridge-bin.zip file.
For installation instructions, see Using the Streams for Apache Kafka Bridge.

3.3. Setting up the Kafka installation directory and user
Link kopieren

After downloading Streams for Apache Kafka, set up the directory structure and configure a dedicated user to manage the Kafka installation. This provides proper permissions and a consistent environment for running Kafka.

The following setup is assumed throughout the rest of this guide, but you can adapt it to suit your specific requirements.

Installation directory: Install Kafka in the /opt/kafka/ directory to maintain a standard location. This guide assumes all Kafka commands are executed from this directory.
Dedicated Kafka User: Create a system user, such as kafka, to manage the installation. Change ownership of the /opt/kafka/ directory to this user to ensure secure access. You can use any username that fits your environment.

3.4. Installing Kafka
Link kopieren

Use the Streams for Apache Kafka ZIP files to install Kafka on Red Hat Enterprise Linux. You can install Kafka in either a single-node or multi-node environment. This procedure focuses on installing a single Kafka instance on a single host (node). For this setup, Kafka is installed in the /opt/kafka/ directory, and a dedicated Kafka user (kafka) is used to manage the installation.

The Streams for Apache Kafka installation files include the binaries for running other Kafka components, like Kafka Connect, Kafka MirrorMaker 2, and Kafka Bridge. In a single-node environment, you can run these components from the same host where you installed Kafka. However, we recommend that you add the installation files and run other Kafka components on separate hosts.

Prerequisites

You have downloaded the installation files.
You have reviewed the supported configurations in the Streams for Apache Kafka 2.9 on Red Hat Enterprise Linux Release Notes.
You are logged in to Red Hat Enterprise Linux as admin (root) user.

Procedure

Install Kafka on your host.

Add a new Kafka user and group:
```
groupadd kafka
useradd -g kafka kafka
passwd kafka
```
This step creates the necessary user and group for managing Kafka.
Extract and move the contents of the amq-streams-<version>-kafka-bin.zip file and place it in the /opt/kafka directory:
```
unzip amq-streams-<version>-kafka-bin.zip -d /opt
mv /opt/kafka*redhat* /opt/kafka
```
Change the ownership of the /opt/kafka directory to the Kafka user:
```
chown -R kafka:kafka /opt/kafka
```
Create the /var/lib/kafka directory for storing Kafka data and change its ownership to the Kafka user:
```
mkdir /var/lib/kafka
chown -R kafka:kafka /var/lib/kafka
```
You can now run a default configuration of Kafka as a single-node cluster.
(Optional) Run other Kafka components, like Kafka Connect, on the same host.
To run other components, specify the hostname and port to connect to the Kafka broker using the bootstrap.servers property in the component configuration.
Example bootstrap servers configuration pointing to a single Kafka broker on the same host
```
bootstrap.servers=localhost:9092
```
However, we recommend installing and running Kafka components on separate hosts.
(Optional) Install Kafka components on separate hosts.
1. Extract the Kafka installation files into the /opt/kafka/ directory on each host.
2. Change the ownership of the /opt/kafka/ directory to the Kafka user on each host.
3. Update the bootstrap.servers property to connect the components to the Kafka brokers running on different hosts.
  Example bootstrap servers configuration pointing to Kafka brokers on different hosts
  bootstrap.servers=kafka0.<host_ip_address>:9092,kafka1.<host_ip_address>:9092,kafka2.<host_ip_address>:9092
  You can use this configuration for Kafka Connect, MirrorMaker 2, and the Kafka Bridge.

3.5. Running a Kafka cluster in KRaft mode
Link kopieren

Configure and run a Kafka cluster in KRaft mode. Nodes in the cluster perform the role of broker, controller, or both.

Broker role: Manages the storage and passing of messages.
Controller role: Coordinates the cluster and manages metadata.
Combined role: A single node acts as both broker and controller.

The minimum recommended setup is three brokers and three controllers with topic replication for stability and availability. The internal __cluster_metadata topic stores cluster-wide information.

Prerequisites

Streams for Apache Kafka is installed on each host, and the configuration files are available.
The procedure uses kafka-storage.sh, kafka-server-start.sh, and kafka-metadata-quorum.sh tools.

Procedure

Generate a unique Kafka cluster ID using the kafka-storage tool:
```
./bin/kafka-storage.sh random-uuid
```
Save this ID as it is reused for all nodes.
Create a configuration file for each node.
Base the configuration file on Kafka’s provided examples:
- controller.properties for controller-only nodes
- broker.properties for broker-only nodes
- server.properties for nodes with both roles
Note
The example server.properties file uses controller.quorum.voters by default, which is intended for static quorum setup. To use dynamic quorum (recommended), replace this with controller.quorum.bootstrap.servers, as shown in this procedure. The controller.properties file uses controller.quorum.bootstrap.servers by default.
Adjust the following properties for each node:
```
node.id=1
process.roles=broker,controller
controller.quorum.bootstrap.servers=node1:9093,node2:9093,node3:9093
listeners=PLAINTEXT://0.0.0.0:9092,CONTROLLER://0.0.0.0:9093
advertised.listeners=PLAINTEXT://node1:9092
inter.broker.listener.name=PLAINTEXT
controller.listener.names=CONTROLLER
log.dirs=/var/lib/kafka/data
```
Important considerations:
- Each node must have a unique node.id.
- listeners hostname (0.0.0.0) binds to all interfaces but can be changed to a specific hostname or IP address if needed for each node.
- advertised.listeners must reflect the actual address that clients use to connect to the Kafka node.
- The log.dirs path in the configuration file defines where metadata is stored. If not set, it defaults to a temporary location, which is cleared on reboot and suitable only for development.
Bootstrap the controller quorum (one controller only):
```
./bin/kafka-storage.sh format --cluster-id <uuid> --standalone --config ./config/controller.properties
```
This initializes the metadata log and bootstraps the quorum. Use it on one node only. Replace <uuid> with the generated cluster ID.
Note
To bootstrap with multiple controllers, use --initial-controllers instead.

Start the controller:

./bin/kafka-server-start.sh ./config/controller.properties

Add additional controllers to the quorum.
Make sure controller.quorum.bootstrap.servers is correctly set across all nodes and the controllers are started.
Format the remaining nodes (brokers):
```
./bin/kafka-storage.sh format --cluster-id <uuid> --config ./config/server.properties --no-initial-controllers
```
Use the appropriate configuration file based on the role of each node. Use the same cluster ID for all nodes. This step prepares each node’s storage without modifying the controller quorum.

Start each broker node:

./bin/kafka-server-start.sh ./config/server.properties

Check that Kafka is running:
```
jcmd | grep kafka
```
Returns:
```
process ID kafka.Kafka ./config/server.properties
```
Check the logs of each node to ensure that they have successfully joined the KRaft cluster:
```
tail -f ./logs/server.log
```

Next steps

You can now create topics, and send and receive messages from the brokers.

For brokers passing messages, you can use topic replication across the brokers in a cluster for data durability. Configure topics to have a replication factor of at least three and a minimum number of in-sync replicas set to 1 less than the replication factor (replication.factor=3, with min.insync.replicas=2). For more information, see Section 9.7, “Creating a topic”.

3.6. Sending and receiving messages from a topic
Link kopieren

This procedure describes how to start the Kafka console producer and consumer clients and use them to send and receive several messages.

A new topic is automatically created in step one. Topic auto-creation is controlled using the auto.create.topics.enable configuration property (set to true by default). Alternatively, you can configure and create topics before using the cluster. For more information, see Topics.

Prerequisites

Procedure

Start the Kafka console producer and configure it to send messages to a new topic:

./bin/kafka-console-producer.sh --broker-list <bootstrap_address> --topic <topic-name>

For example:

./bin/kafka-console-producer.sh --broker-list localhost:9092 --topic my-topic

Enter several messages into the console. Press Enter to send each individual message to your new topic:
```
>message 1
>message 2
>message 3
>message 4
```
When Kafka creates a new topic automatically, you might receive a warning that the topic does not exist:
```
WARN Error while fetching metadata with correlation id 39 :
{4-3-16-topic1=LEADER_NOT_AVAILABLE} (org.apache.kafka.clients.NetworkClient)
```
The warning should not reappear after you send further messages.

In a new terminal window, start the Kafka console consumer and configure it to read messages from the beginning of your new topic.

./bin/kafka-console-consumer.sh --bootstrap-server <bootstrap_address> --topic <topic-name> --from-beginning

For example:

./bin/kafka-console-consumer.sh --bootstrap-server localhost:9092 --topic my-topic --from-beginning

The incoming messages display in the consumer console.

Switch to the producer console and send additional messages. Check that they display in the consumer console.
Stop the Kafka console producer and then the consumer by pressing Ctrl+C.

3.7. Stopping the Streams for Apache Kafka services
Link kopieren

You can stop Kafka services by running a script. After running the script, all connections to the Kafka services are terminated.

Procedure

Stop the Kafka node.
```
./bin/kafka-server-stop.sh
```
Confirm that the Kafka node is stopped.
```
jcmd | grep kafka
```

3.8. Performing a graceful rolling restart of Kafka brokers
Link kopieren

This procedure shows how to do a graceful rolling restart of brokers in a multi-node cluster. A rolling restart is usually required following an upgrade or change to the Kafka cluster configuration properties.

Note

Some broker configurations do not need a restart of the broker. For more information, see Updating Broker Configs in the Apache Kafka documentation.

After you perform a restart of a broker, check for under-replicated topic partitions to make sure that replica partitions have caught up.

To achieve a graceful restart with no loss of availability, ensure that you are replicating topics and that at least the minimum number of replicas (min.insync.replicas) replicas are in sync. The min.insync.replicas configuration determines the minimum number of replicas that must acknowledge a write for the write to be considered successful.

For a multi-node cluster, the standard approach is to have a topic replication factor of at least 3 and a minimum number of in-sync replicas set to 1 less than the replication factor. If you are using acks=all in your producer configuration for data durability, check that the broker you restarted is in sync with all the partitions it’s replicating before restarting the next broker.

Single-node clusters are unavailable during a restart, since all partitions are on the same broker.

Prerequisites

Streams for Apache Kafka is installed on each host, and the configuration files are available.
The Kafka cluster is operating as expected.
Check for under-replicated partitions or any other issues affecting broker operation. The steps in this procedure describe how to check for under-replicated partitions.

Procedure

Perform the following steps on each Kafka broker. Complete the steps on the first broker before moving on to the next. Perform the steps on the brokers that also act as controllers last. Otherwise, the controllers need to change on more than one restart.

Stop the Kafka broker:
```
./bin/kafka-server-stop.sh
```
Make any changes to the broker configuration that require a restart after completion.
For further information, see the following:
- Configuring Kafka
- Upgrading Kafka nodes

Restart the Kafka broker:

./bin/kafka-server-start.sh -daemon ./config/kraft/server.properties

Check that Kafka is running:
```
jcmd | grep kafka
```
Returns:
```
process ID kafka.Kafka ./config/kraft/server.properties
```
Check the logs of each node to ensure that they have successfully joined the KRaft cluster:
```
tail -f ./logs/server.log
```
Wait until the broker has zero under-replicated partitions. You can check from the command line or use metrics.
- Use the kafka-topics.sh command with the --under-replicated-partitions parameter:
  ./bin/kafka-topics.sh --bootstrap-server <broker_host>:<port> --describe --under-replicated-partitions
  For example:
  ./bin/kafka-topics.sh --bootstrap-server localhost:9092 --describe --under-replicated-partitions
  The command provides a list of topics with under-replicated partitions in a cluster.
  Topics with under-replicated partitions
  Topic: topic3 Partition: 4 Leader: 2 Replicas: 2,3 Isr: 2 Topic: topic3 Partition: 5 Leader: 3 Replicas: 1,2 Isr: 1 Topic: topic1 Partition: 1 Leader: 3 Replicas: 1,3 Isr: 3 # …
  Under-replicated partitions are listed if the ISR (in-sync replica) count is less than the number of replicas. If a list is not returned, there are no under-replicated partitions.
- Use the UnderReplicatedPartitions metric:
  kafka.server:type=ReplicaManager,name=UnderReplicatedPartitions
  The metric provides a count of partitions where replicas have not caught up. You wait until the count is zero.
  Tip
  Use the Kafka Exporter to create an alert when there are one or more under-replicated partitions for a topic.

Checking logs when restarting

If a broker fails to start, check the application logs for information. You can also check the status of a broker shutdown and restart in the ./logs/server.log application log.

Dieser Inhalt ist in der von Ihnen ausgewählten Sprache nicht verfügbar.

Chapter 3. Getting started

3.1. Installation environment
Link kopieren

3.1.1. Data storage considerations
Link kopieren

3.1.2. File systems
Link kopieren

3.1.3. Disk usage
Link kopieren

3.2. Downloading Streams for Apache Kafka
Link kopieren

3.3. Setting up the Kafka installation directory and user
Link kopieren

3.4. Installing Kafka
Link kopieren

3.5. Running a Kafka cluster in KRaft mode
Link kopieren

3.6. Sending and receiving messages from a topic
Link kopieren

3.7. Stopping the Streams for Apache Kafka services
Link kopieren

3.8. Performing a graceful rolling restart of Kafka brokers
Link kopieren

Lernen

Testen, kaufen und verkaufen

Communitys

Über Red Hat Dokumentation

Mehr Inklusion in Open Source

Über Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

Dieser Inhalt ist in der von Ihnen ausgewählten Sprache nicht verfügbar.

Chapter 3. Getting started

3.1. Installation environmentLink kopierenLink in die Zwischenablage kopiert!

3.1.1. Data storage considerationsLink kopierenLink in die Zwischenablage kopiert!

3.1.2. File systemsLink kopierenLink in die Zwischenablage kopiert!

3.1.3. Disk usageLink kopierenLink in die Zwischenablage kopiert!

3.2. Downloading Streams for Apache KafkaLink kopierenLink in die Zwischenablage kopiert!

3.3. Setting up the Kafka installation directory and userLink kopierenLink in die Zwischenablage kopiert!

3.4. Installing KafkaLink kopierenLink in die Zwischenablage kopiert!

3.5. Running a Kafka cluster in KRaft modeLink kopierenLink in die Zwischenablage kopiert!

3.6. Sending and receiving messages from a topicLink kopierenLink in die Zwischenablage kopiert!

3.7. Stopping the Streams for Apache Kafka servicesLink kopierenLink in die Zwischenablage kopiert!

3.8. Performing a graceful rolling restart of Kafka brokersLink kopierenLink in die Zwischenablage kopiert!

Lernen

Testen, kaufen und verkaufen

Communitys

Über Red Hat Dokumentation

Mehr Inklusion in Open Source

Über Red Hat

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

3.1. Installation environment
Link kopieren

3.1.1. Data storage considerations
Link kopieren

3.1.2. File systems
Link kopieren

3.1.3. Disk usage
Link kopieren

3.2. Downloading Streams for Apache Kafka
Link kopieren

3.3. Setting up the Kafka installation directory and user
Link kopieren

3.4. Installing Kafka
Link kopieren

3.5. Running a Kafka cluster in KRaft mode
Link kopieren

3.6. Sending and receiving messages from a topic
Link kopieren

3.7. Stopping the Streams for Apache Kafka services
Link kopieren

3.8. Performing a graceful rolling restart of Kafka brokers
Link kopieren