홈
제품
Red Hat AMQ
2021.Q3
Using AMQ Streams on RHEL
Chapter 9. Using AMQ Streams with MirrorMaker 2.0

이 콘텐츠는 선택한 언어로 제공되지 않습니다.

Chapter 9. Using AMQ Streams with MirrorMaker 2.0

MirrorMaker 2.0 is used to replicate data between two or more active Kafka clusters, within or across data centers.

Data replication across clusters supports scenarios that require:

Recovery of data in the event of a system failure
Aggregation of data for analysis
Restriction of data access to a specific cluster
Provision of data at a specific location to improve latency

Note

MirrorMaker 2.0 has features not supported by the previous version of MirrorMaker. However, you can configure MirrorMaker 2.0 to be used in legacy mode.

Additional resources

9.1. MirrorMaker 2.0 data replication
링크 복사

MirrorMaker 2.0 consumes messages from a source Kafka cluster and writes them to a target Kafka cluster.

MirrorMaker 2.0 uses:

Source cluster configuration to consume data from the source cluster
Target cluster configuration to output data to the target cluster

MirrorMaker 2.0 is based on the Kafka Connect framework, connectors managing the transfer of data between clusters. A MirrorMaker 2.0 MirrorSourceConnector replicates topics from a source cluster to a target cluster.

The process of mirroring data from one cluster to another cluster is asynchronous. The recommended pattern is for messages to be produced locally alongside the source Kafka cluster, then consumed remotely close to the target Kafka cluster.

MirrorMaker 2.0 can be used with more than one source cluster.

Figure 9.1. Replication across two clusters

By default, a check for new topics in the source cluster is made every 10 minutes. You can change the frequency by adding refresh.topics.interval.seconds to the source connector configuration. However, increasing the frequency of the operation might affect overall performance.

9.2. Cluster configuration
링크 복사

You can use MirrorMaker 2.0 in active/passive or active/active cluster configurations.

In an active/active configuration, both clusters are active and provide the same data simultaneously, which is useful if you want to make the same data available locally in different geographical locations.
In an active/passive configuration, the data from an active cluster is replicated in a passive cluster, which remains on standby, for example, for data recovery in the event of system failure.

The expectation is that producers and consumers connect to active clusters only.

A MirrorMaker 2.0 cluster is required at each target destination.

9.2.1. Bidirectional replication (active/active)
링크 복사

The MirrorMaker 2.0 architecture supports bidirectional replication in an active/active cluster configuration.

Each cluster replicates the data of the other cluster using the concept of source and remote topics. As the same topics are stored in each cluster, remote topics are automatically renamed by MirrorMaker 2.0 to represent the source cluster. The name of the originating cluster is prepended to the name of the topic.

Figure 9.2. Topic renaming

MirrorMaker 2.0 bidirectional architecture

By flagging the originating cluster, topics are not replicated back to that cluster.

The concept of replication through remote topics is useful when configuring an architecture that requires data aggregation. Consumers can subscribe to source and remote topics within the same cluster, without the need for a separate aggregation cluster.

9.2.2. Unidirectional replication (active/passive)
링크 복사

The MirrorMaker 2.0 architecture supports unidirectional replication in an active/passive cluster configuration.

You can use an active/passive cluster configuration to make backups or migrate data to another cluster. In this situation, you might not want automatic renaming of remote topics.

You can override automatic renaming by adding IdentityReplicationPolicy to the source connector configuration. With this configuration applied, topics retain their original names.

9.2.3. Topic configuration synchronization
링크 복사

Topic configuration is automatically synchronized between source and target clusters. By synchronizing configuration properties, the need for rebalancing is reduced.

9.2.4. Data integrity
링크 복사

MirrorMaker 2.0 monitors source topics and propagates any configuration changes to remote topics, checking for and creating missing partitions. Only MirrorMaker 2.0 can write to remote topics.

9.2.5. Offset tracking
링크 복사

MirrorMaker 2.0 tracks offsets for consumer groups using internal topics.

The offset sync topic maps the source and target offsets for replicated topic partitions from record metadata
The checkpoint topic maps the last committed offset in the source and target cluster for replicated topic partitions in each consumer group

Offsets for the checkpoint topic are tracked at predetermined intervals through configuration. Both topics enable replication to be fully restored from the correct offset position on failover.

MirrorMaker 2.0 uses its MirrorCheckpointConnector to emit checkpoints for offset tracking.

9.2.6. Synchronizing consumer group offsets
링크 복사

The __consumer_offsets topic stores information on committed offsets, for each consumer group. Offset synchronization periodically transfers the consumer offsets for the consumer groups of a source cluster into the consumer offsets topic of a target cluster.

Offset synchronization is particularly useful in an active/passive configuration. If the active cluster goes down, consumer applications can switch to the passive (standby) cluster and pick up from the last transferred offset position.

To use topic offset synchronization:

Enable the synchronization by adding sync.group.offsets.enabled to the checkpoint connector configuration, and setting the property to true. Synchronization is disabled by default.
Add the IdentityReplicationPolicy to the source and checkpoint connector configuration so that topics in the target cluster retain their original names.

For topic offset synchronization to work, consumer groups in the target cluster cannot use the same ids as groups in the source cluster.

If enabled, the synchronization of offsets from the source cluster is made periodically. You can change the frequency by adding sync.group.offsets.interval.seconds and emit.checkpoints.interval.seconds to the checkpoint connector configuration. The properties specify the frequency in seconds that the consumer group offsets are synchronized, and the frequency of checkpoints emitted for offset tracking. The default for both properties is 60 seconds. You can also change the frequency of checks for new consumer groups using the refresh.groups.interval.seconds property, which is performed every 10 minutes by default.

Because the synchronization is time-based, any switchover by consumers to a passive cluster will likely result in some duplication of messages.

9.2.7. Connectivity checks
링크 복사

A heartbeat internal topic checks connectivity between clusters.

The heartbeat topic is replicated from the source cluster.

Target clusters use the topic to check:

The connector managing connectivity between clusters is running
The source cluster is available

MirrorMaker 2.0 uses its MirrorHeartbeatConnector to emit heartbeats that perform these checks.

9.3. ACL rules synchronization
링크 복사

If AclAuthorizer is being used, ACL rules that manage access to brokers also apply to remote topics. Users that can read a source topic can read its remote equivalent.

Note

OAuth 2.0 authorization does not support access to remote topics in this way.

9.4. Synchronizing data between Kafka clusters using MirrorMaker 2.0
링크 복사

Use MirrorMaker 2.0 to synchronize data between Kafka clusters through configuration.

The previous version of MirrorMaker continues to be supported, by running MirrorMaker 2.0 in legacy mode.

The configuration must specify:

Each Kafka cluster
Connection information for each cluster, including TLS authentication
The replication flow and direction
- Cluster to cluster
- Topic to topic
Replication rules
Committed offset tracking intervals

This procedure describes how to implement MirrorMaker 2.0 by creating the configuration in a properties file, then passing the properties when using the MirrorMaker script file to set up the connections.

Note

MirrorMaker 2.0 uses Kafka Connect to make the connections to transfer data between clusters. Kafka provides MirrorMaker sink and source connectors for data replication. If you wish to use the connectors instead of the MirrorMaker script, the connectors must be configured in the Kafka Connect cluster. For more information, refer to the Apache Kafka documentation.

Before you begin

A sample configuration properties file is provided in ./config/connect-mirror-maker.properties.

Prerequisites

You need AMQ Streams installed on the hosts of each Kafka cluster node you are replicating.

Procedure

Open the sample properties file in a text editor, or create a new one, and edit the file to include connection information and the replication flows for each Kafka cluster.

The following example shows a configuration to connect two clusters, cluster-1 and cluster-2, bidirectionally. Cluster names are configurable through the clusters property.

clusters=cluster-1,cluster-2 

cluster-1.bootstrap.servers=CLUSTER-NAME-kafka-bootstrap-PROJECT-NAME:443 
cluster-1.security.protocol=SSL 
cluster-1.ssl.truststore.password=TRUSTSTORE-NAME
cluster-1.ssl.truststore.location=PATH-TO-TRUSTSTORE/truststore.cluster-1.jks
cluster-1.ssl.keystore.password=KEYSTORE-NAME
cluster-1.ssl.keystore.location=PATH-TO-KEYSTORE/user.cluster-1.p12_

cluster-2.bootstrap.servers=CLUSTER-NAME-kafka-bootstrap-<my-project>:443 
cluster-2.security.protocol=SSL 
cluster-2.ssl.truststore.password=TRUSTSTORE-NAME
cluster-2.ssl.truststore.location=PATH-TO-TRUSTSTORE/truststore.cluster-2.jks_
cluster-2.ssl.keystore.password=KEYSTORE-NAME
cluster-2.ssl.keystore.location=PATH-TO-KEYSTORE/user.cluster-2.p12_

cluster-1->cluster-2.enabled=true 
cluster-1->cluster-2.topics=.* 
cluster-2->cluster-1.enabled=true 
cluster-2->cluster-1B->C.topics=.* 

replication.policy.separator=- 
sync.topic.acls.enabled=false 
refresh.topics.interval.seconds=60 
refresh.groups.interval.seconds=60

clusters=cluster-1,cluster-2



cluster-1.bootstrap.servers=CLUSTER-NAME-kafka-bootstrap-PROJECT-NAME:443


cluster-1.security.protocol=SSL


cluster-1.ssl.truststore.password=TRUSTSTORE-NAME
cluster-1.ssl.truststore.location=PATH-TO-TRUSTSTORE/truststore.cluster-1.jks
cluster-1.ssl.keystore.password=KEYSTORE-NAME
cluster-1.ssl.keystore.location=PATH-TO-KEYSTORE/user.cluster-1.p12_

cluster-2.bootstrap.servers=CLUSTER-NAME-kafka-bootstrap-<my-project>:443


cluster-2.security.protocol=SSL


cluster-2.ssl.truststore.password=TRUSTSTORE-NAME
cluster-2.ssl.truststore.location=PATH-TO-TRUSTSTORE/truststore.cluster-2.jks_
cluster-2.ssl.keystore.password=KEYSTORE-NAME
cluster-2.ssl.keystore.location=PATH-TO-KEYSTORE/user.cluster-2.p12_

cluster-1->cluster-2.enabled=true


cluster-1->cluster-2.topics=.*


cluster-2->cluster-1.enabled=true


cluster-2->cluster-1B->C.topics=.*



replication.policy.separator=-


sync.topic.acls.enabled=false


refresh.topics.interval.seconds=60


refresh.groups.interval.seconds=60

Copy to Clipboard

Toggle word wrap

1: Each Kafka cluster is identified with its alias.
2: Connection information for cluster-1, using the bootstrap address and port 443. Both clusters use port 443 to connect to Kafka using OpenShift Routes.
3: The ssl. properties define TLS configuration for cluster-1.
4: Connection information for cluster-2.
5: The ssl. properties define the TLS configuration for cluster-2.
6: Replication flow enabled from the cluster-1 cluster to the cluster-2 cluster.
7: Replicates all topics from the cluster-1 cluster to the cluster-2 cluster.
8: Replication flow enabled from the cluster-2 cluster to the cluster-1 cluster.
9: Replicates specific topics from the cluster-2 cluster to the cluster-1 cluster.
10: Defines the separator used for the renaming of remote topics.
11: When enabled, ACLs are applied to synchronized topics. The default is false.
12: The period between checks for new topics to synchronize.
13: The period between checks for new consumer groups to synchronize.

OPTION: If required, add a policy that overrides the automatic renaming of remote topics. Instead of prepending the name with the name of the source cluster, the topic retains its original name.
This optional setting is used for active/passive backups and data migration.
```
replication.policy.class=io.strimzi.kafka.connect.mirror.IdentityReplicationPolicy
```
```
replication.policy.class=io.strimzi.kafka.connect.mirror.IdentityReplicationPolicy
```
Copy to Clipboard Toggle word wrap
OPTION: If you want to synchronize consumer group offsets, add configuration to enable and manage the synchronization:
```
refresh.groups.interval.seconds=60
sync.group.offsets.enabled=true 
sync.group.offsets.interval.seconds=60 
emit.checkpoints.interval.seconds=60 
```
```
refresh.groups.interval.seconds=60
sync.group.offsets.enabled=true 
```
1
```
sync.group.offsets.interval.seconds=60 
```
2
```
emit.checkpoints.interval.seconds=60 
```
3
Copy to Clipboard Toggle word wrap
1
Optional setting to synchronize consumer group offsets, which is useful for recovery in an active/passive configuration. Synchronization is not enabled by default.
2
If the synchronization of consumer group offsets is enabled, you can adjust the frequency of the synchronization.
3
Adjusts the frequency of checks for offset tracking. If you change the frequency of offset synchronization, you might also need to adjust the frequency of these checks.

Start ZooKeeper and Kafka in the target clusters:

su - kafka
/opt/kafka/bin/zookeeper-server-start.sh -daemon /opt/kafka/config/zookeeper.properties

su - kafka
/opt/kafka/bin/zookeeper-server-start.sh -daemon /opt/kafka/config/zookeeper.properties

Copy to Clipboard

Toggle word wrap

/opt/kafka/bin/kafka-server-start.sh -daemon /opt/kafka/config/server.properties

/opt/kafka/bin/kafka-server-start.sh -daemon /opt/kafka/config/server.properties

Copy to Clipboard

Toggle word wrap

Start MirrorMaker with the cluster connection configuration and replication policies you defined in your properties file:
```
/opt/kafka/bin/connect-mirror-maker.sh /config/connect-mirror-maker.properties
```
```
/opt/kafka/bin/connect-mirror-maker.sh /config/connect-mirror-maker.properties
```
Copy to Clipboard Toggle word wrap
MirrorMaker sets up connections between the clusters.

For each target cluster, verify that the topics are being replicated:

/bin/kafka-topics.sh --bootstrap-server <BrokerAddress> --list

/bin/kafka-topics.sh --bootstrap-server <BrokerAddress> --list

Copy to Clipboard

Toggle word wrap

9.5. Using MirrorMaker 2.0 in legacy mode
링크 복사

This procedure describes how to configure MirrorMaker 2.0 to use it in legacy mode. Legacy mode supports the previous version of MirrorMaker.

The MirrorMaker script /opt/kafka/bin/kafka-mirror-maker.sh can run MirrorMaker 2.0 in legacy mode.

Prerequisites

You need the properties files you currently use with the legacy version of MirrorMaker.

/opt/kafka/config/consumer.properties
/opt/kafka/config/producer.properties

Procedure

Edit the MirrorMaker consumer.properties and producer.properties files to turn off MirrorMaker 2.0 features.

For example:

replication.policy.class=org.apache.kafka.mirror.LegacyReplicationPolicy 

refresh.topics.enabled=false 
refresh.groups.enabled=false
emit.checkpoints.enabled=false
emit.heartbeats.enabled=false
sync.topic.configs.enabled=false
sync.topic.acls.enabled=false

replication.policy.class=org.apache.kafka.mirror.LegacyReplicationPolicy



refresh.topics.enabled=false


refresh.groups.enabled=false
emit.checkpoints.enabled=false
emit.heartbeats.enabled=false
sync.topic.configs.enabled=false
sync.topic.acls.enabled=false

Copy to Clipboard

Toggle word wrap

1: Emulate the previous version of MirrorMaker.
2: MirrorMaker 2.0 features disabled, including the internal checkpoint and heartbeat topics

Save the changes and restart MirrorMaker with the properties files you used with the previous version of MirrorMaker:
```
su - kafka /opt/kafka/bin/kafka-mirror-maker.sh \
--consumer.config /opt/kafka/config/consumer.properties \
--producer.config /opt/kafka/config/producer.properties \
--num.streams=2
```
```
su - kafka /opt/kafka/bin/kafka-mirror-maker.sh \
--consumer.config /opt/kafka/config/consumer.properties \
--producer.config /opt/kafka/config/producer.properties \
--num.streams=2
```
Copy to Clipboard Toggle word wrap
The consumer properties provide the configuration for the source cluster and the producer properties provide the target cluster configuration.
MirrorMaker sets up connections between the clusters.

Start ZooKeeper and Kafka in the target cluster:

su - kafka
/opt/kafka/bin/zookeeper-server-start.sh -daemon /opt/kafka/config/zookeeper.properties

su - kafka
/opt/kafka/bin/zookeeper-server-start.sh -daemon /opt/kafka/config/zookeeper.properties

Copy to Clipboard

Toggle word wrap

su - kafka
/opt/kafka/bin/kafka-server-start.sh -daemon /opt/kafka/config/server.properties

su - kafka
/opt/kafka/bin/kafka-server-start.sh -daemon /opt/kafka/config/server.properties

Copy to Clipboard

Toggle word wrap

For the target cluster, verify that the topics are being replicated:

/bin/kafka-topics.sh --bootstrap-server <BrokerAddress> --list

/bin/kafka-topics.sh --bootstrap-server <BrokerAddress> --list

Copy to Clipboard

Toggle word wrap

이 콘텐츠는 선택한 언어로 제공되지 않습니다.

Chapter 9. Using AMQ Streams with MirrorMaker 2.0

9.1. MirrorMaker 2.0 data replication
링크 복사

9.2. Cluster configuration
링크 복사

9.2.1. Bidirectional replication (active/active)
링크 복사

9.2.2. Unidirectional replication (active/passive)
링크 복사

9.2.3. Topic configuration synchronization
링크 복사

9.2.4. Data integrity
링크 복사

9.2.5. Offset tracking
링크 복사

9.2.6. Synchronizing consumer group offsets
링크 복사

9.2.7. Connectivity checks
링크 복사

9.3. ACL rules synchronization
링크 복사

9.4. Synchronizing data between Kafka clusters using MirrorMaker 2.0
링크 복사

9.5. Using MirrorMaker 2.0 in legacy mode
링크 복사

자세한 정보

평가판, 구매 및 판매

커뮤니티

Red Hat 문서 정보

보다 포괄적 수용을 위한 오픈 소스 용어 교체

Red Hat 소개

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

이 콘텐츠는 선택한 언어로 제공되지 않습니다.

Chapter 9. Using AMQ Streams with MirrorMaker 2.0

9.1. MirrorMaker 2.0 data replication링크 복사링크가 클립보드에 복사되었습니다!

9.2. Cluster configuration링크 복사링크가 클립보드에 복사되었습니다!

9.2.1. Bidirectional replication (active/active)링크 복사링크가 클립보드에 복사되었습니다!

9.2.2. Unidirectional replication (active/passive)링크 복사링크가 클립보드에 복사되었습니다!

9.2.3. Topic configuration synchronization링크 복사링크가 클립보드에 복사되었습니다!

9.2.4. Data integrity링크 복사링크가 클립보드에 복사되었습니다!

9.2.5. Offset tracking링크 복사링크가 클립보드에 복사되었습니다!

9.2.6. Synchronizing consumer group offsets링크 복사링크가 클립보드에 복사되었습니다!

9.2.7. Connectivity checks링크 복사링크가 클립보드에 복사되었습니다!

9.3. ACL rules synchronization링크 복사링크가 클립보드에 복사되었습니다!

9.4. Synchronizing data between Kafka clusters using MirrorMaker 2.0링크 복사링크가 클립보드에 복사되었습니다!

9.5. Using MirrorMaker 2.0 in legacy mode링크 복사링크가 클립보드에 복사되었습니다!

자세한 정보

평가판, 구매 및 판매

커뮤니티

Red Hat 문서 정보

보다 포괄적 수용을 위한 오픈 소스 용어 교체

Red Hat 소개

Theme

Red Hat legal and privacy links

Red Hat legal and privacy links

9.1. MirrorMaker 2.0 data replication
링크 복사

9.2. Cluster configuration
링크 복사

9.2.1. Bidirectional replication (active/active)
링크 복사

9.2.2. Unidirectional replication (active/passive)
링크 복사

9.2.3. Topic configuration synchronization
링크 복사

9.2.4. Data integrity
링크 복사

9.2.5. Offset tracking
링크 복사

9.2.6. Synchronizing consumer group offsets
링크 복사

9.2.7. Connectivity checks
링크 복사

9.3. ACL rules synchronization
링크 복사

9.4. Synchronizing data between Kafka clusters using MirrorMaker 2.0
링크 복사

9.5. Using MirrorMaker 2.0 in legacy mode
링크 복사