Kafka Add Partitions In Topic Example

Yatin BatraMarch 18th, 2025Last Updated: March 14th, 2025

0 69 5 minutes read

Kafka is a widely used messaging queue with powerful features. Messages are stored in topics, which are divided into partitions. Sometimes, we may need to increase the number of partitions in a topic. Let us delve into understanding how to add partitions to a Kafka topic.

1. Introduction

Apache Kafka is a distributed event streaming platform designed to handle high-throughput, fault-tolerant, and real-time data processing. It is widely used in large-scale enterprise applications for messaging, event-driven architectures, log aggregation, and real-time analytics. Kafka enables applications to publish, subscribe, store, and process streams of records efficiently.

Kafka is highly scalable and provides strong durability guarantees, making it an excellent choice for building data pipelines, microservices architectures, and stream processing applications. It consists of key components such as producers, topics, partitions, consumers, brokers, and consumer groups, all working together to ensure seamless data flow.

1.1 What is a Topic in Kafka?

A Topic in Kafka is a logical channel or category to which producers send messages and from which consumers read messages. It acts as a virtual queue that organizes messages for specific use cases. Kafka topics are fundamental to the event-driven model and support multiple publishers and subscribers.

1.1.1 Characteristics of Kafka Topics

Below are some key characteristics of Kafka topics:

Topics store events/messages in a distributed manner across brokers.
Messages in a topic are immutable and persist for a configurable retention period.
Topics can be configured with different retention policies (time-based or size-based).
Producers can publish messages to multiple topics.
Consumers subscribe to topics to process data asynchronously.

1.2 What are Partitions in Kafka?

Partitions is a key feature of Kafka that enables horizontal scalability and fault tolerance. A topic is divided into multiple partitions, each acting as an independent log stored on different brokers. This division allows Kafka to distribute messages across multiple nodes, ensuring high availability and parallel processing.

1.2.1 Benefits of Kafka Partitions

Below are some key benefits of Kafka partitions:

Enable parallelism by allowing multiple consumers to read from different partitions simultaneously.
Support high-throughput data ingestion and real-time processing.
Ensure scalability by distributing partitions across multiple brokers in a Kafka cluster.
Improve fault tolerance by replicating partitions across brokers.

1.3 How Kafka handles Partitioning?

Each partition is identified by an integer ID (starting from 0). Producers determine which partition a message is sent to, either through a partitioning strategy (e.g., key-based hashing) or randomly. Consumers within the same consumer group distribute the workload by reading from different partitions.

1.3.1 Why to increase Kafka Partitions?

Adding more partitions to a Kafka topic can enhance performance and scalability. Here are several reasons why increasing partitions might be necessary:

Increase parallelism: More partitions allow multiple consumers to read messages concurrently, reducing processing time.
Distribute load: Kafka balances partitions across brokers, preventing any single broker from becoming a bottleneck.
Handle growing traffic: If the number of producers or consumers increases, additional partitions help accommodate higher message volume.
Reduce consumer lag: High-throughput systems benefit from additional partitions by reducing the risk of consumer lag, ensuring timely message processing.
Enable finer-grained scaling: By adjusting the number of partitions dynamically, organizations can scale their Kafka infrastructure based on demand.

1.3.2 Considerations when Increasing Partitions

Rebalancing: Adding partitions may cause a rebalance of consumer groups, temporarily affecting performance.
Ordering Guarantees: Kafka only guarantees order within a single partition. Increasing partitions can affect strict ordering requirements.
Storage Impact: More partitions mean additional storage overhead and increased metadata management.
Broker Capacity: Ensure brokers have enough resources to handle additional partitions without causing resource exhaustion.

By properly managing partitions, Kafka users can optimize performance while maintaining a scalable and fault-tolerant event-driven system.

2. Guide to Adding Partitions

2.1 Creating a Kafka Topic

For brevity, we’re skipping the Kafka setup as it’s beyond this tutorial’s scope. However, you can easily host it using Docker. Use the following command to create the topic my_topic.

kafka-topics.sh --create --topic my_topic --bootstrap-server localhost:9092 --partitions 3 --replication-factor 1

The command kafka-topics.sh --create --topic my_topic --bootstrap-server localhost:9092 --partitions 3 --replication-factor 1 creates a new Kafka topic named my_topic. The --bootstrap-server localhost:9092 option specifies the Kafka broker to connect to. The --partitions 3 option defines three partitions for the topic, allowing parallelism and scalability. The --replication-factor 1 ensures that each partition has only one copy, meaning no redundancy for fault tolerance.

2.2 Using Kafka CLI

To increase the number of partitions for an existing topic, use the following Kafka CLI command:

kafka-topics.sh --zookeeper localhost:2181 --alter --topic my_topic --partitions 5

2.2.1 Command Explanation and Output

The command kafka-topics.sh --zookeeper localhost:2181 --alter --topic my_topic --partitions 5 is used to increase the number of partitions for an existing Kafka topic named my_topic to 5. The --zookeeper localhost:2181 option specifies the ZooKeeper instance managing the Kafka cluster, while --alter indicates that we are modifying an existing topic. Note that this command can only increase partitions, not decrease them. Upon successful execution, the output will be:

1	`Topic 'my_topic' changed. Number of partitions increased to 5.`

2.3 Using Java API

You can also increase partitions programmatically using the Kafka AdminClient API in Java.

import org.apache.kafka.clients.admin.AdminClient;
import org.apache.kafka.clients.admin.AdminClientConfig;
import org.apache.kafka.clients.admin.NewPartitions;
 
import java.util.Collections;
import java.util.Properties;
import java.util.concurrent.ExecutionException;
 
public class KafkaPartitionAdder {
    public static void main(String[] args) {
        String bootstrapServers = "localhost:9092";
        String topicName = "my_topic";
        int newPartitionCount = 5;
 
        // Configure AdminClient
        Properties props = new Properties();
        props.put(AdminClientConfig.BOOTSTRAP_SERVERS_CONFIG, bootstrapServers);
 
        try (AdminClient adminClient = AdminClient.create(props)) {
            // Create new partition request
            NewPartitions newPartitions = NewPartitions.increaseTo(newPartitionCount);
             
            // Apply changes
            adminClient.createPartitions(Collections.singletonMap(topicName, newPartitions)).all().get();
 
            System.out.println("Partitions increased successfully!");
        } catch (ExecutionException | InterruptedException e) {
            e.printStackTrace();
        }
    }
}

2.3.1 Code Explanation and Output

The Java code defines a class KafkaPartitionAdder that increases the number of partitions for an existing Kafka topic using the Kafka AdminClient API. It connects to a Kafka broker running on localhost:9092 and specifies the topic name my_topic with a new partition count of 5. The AdminClient is configured using properties, and a request to increase partitions is created using NewPartitions.increaseTo(newPartitionCount). The createPartitions method applies the changes asynchronously, ensuring that the topic’s partitions are updated. If successful, it prints:

1	`Partitions increased successfully!`

Otherwise, any exceptions encountered during execution are caught and printed.

3. Frequent Mistakes to Avoid When Increasing Partitions

When increasing the number of partitions in a Kafka topic, several important factors must be considered to ensure smooth operation and avoid potential issues. Kafka’s partitioning mechanism impacts message distribution, ordering, consumer rebalancing, and fault tolerance. Below are key considerations to keep in mind before adding partitions:

Data Redistribution: Kafka does not automatically redistribute existing messages across new partitions. Newly added partitions will only receive new messages while existing messages remain in their original partitions. This can lead to an uneven data distribution if not managed properly.
Keyed Messages: If a producer assigns messages to partitions based on a key (e.g., user ID, order ID), increasing the number of partitions may disrupt the original ordering. Since Kafka uses a hashing algorithm to determine partition assignment, adding partitions changes the hash distribution, potentially leading to key-based message reordering.
Consumer Rebalancing: Adding new partitions to a topic triggers a rebalance among consumers in the consumer group. During this process, some consumers may be temporarily paused while ownership of partitions is reassigned, which can momentarily impact performance and processing efficiency.
Replication Factor: Increasing the number of partitions does not affect the replication factor of a topic. If replication is required for new partitions, manual configuration is needed to assign replicas to brokers. Without proper replication, new partitions may lack fault tolerance and data redundancy.

4. Conclusion

Adding partitions to a Kafka topic is a useful way to scale applications and improve performance. However, careful consideration must be given to message ordering, consumer rebalancing, and data distribution.

Kafka Add Partitions In Topic Example

1. Introduction

1.1 What is a Topic in Kafka?

1.1.1 Characteristics of Kafka Topics

1.2 What are Partitions in Kafka?

1.2.1 Benefits of Kafka Partitions

1.3 How Kafka handles Partitioning?

1.3.1 Why to increase Kafka Partitions?

1.3.2 Considerations when Increasing Partitions

2. Guide to Adding Partitions

2.1 Creating a Kafka Topic

2.2 Using Kafka CLI

2.2.1 Command Explanation and Output

2.3 Using Java API

2.3.1 Code Explanation and Output

3. Frequent Mistakes to Avoid When Increasing Partitions

4. Conclusion

Thank you!

Yatin Batra

Thank you!

1. Introduction

1.1 What is a Topic in Kafka?

1.1.1 Characteristics of Kafka Topics

1.2 What are Partitions in Kafka?

1.2.1 Benefits of Kafka Partitions

1.3 How Kafka handles Partitioning?

1.3.1 Why to increase Kafka Partitions?

1.3.2 Considerations when Increasing Partitions

2. Guide to Adding Partitions

2.1 Creating a Kafka Topic

2.2 Using Kafka CLI

2.2.1 Command Explanation and Output

2.3 Using Java API

2.3.1 Code Explanation and Output

3. Frequent Mistakes to Avoid When Increasing Partitions

4. Conclusion

Thank you!

Related Articles

Thank you!