Apache Kafka Advance

Featured

Data Engineering, Science & Technology

27 Lessons

18 hours 30 mins

Complexity : Difficult

Free

We assume that you already have understood the basics of Kafka, we will move to the advance topics.

Prerequisite: If you are new to Apache Kafka, it would be advisable for you to first complete Self-paced Apache Kafka Basic course.

Do you want to join live class instead and learn Apache Kafka directly from an expert trainer? Register for upcoming Apache Kafka Advance live boot-camp and up skill your understanding & knowledge about Apache Kafka.

Hurry, limited seats available every month.

Lessons

Going Deeper with Apache Kafka
Difficult
Gautam Goswami
1. Multiple Nodes Zookeeper Cluster
  Difficult
  80 mins
  Gautam Goswami
  
  A multi-node cluster involves multiple interconnected computers or servers, referred to as nodes. These nodes work together to distribute the workload, improving performance, fault tolerance, and scalability. Tasks can be distributed among nodes, allowing parallel processing and more efficient resource utilization. Multi-node clusters are often used in high-performance computing (HPC), data processing, and distributed computing environments.
2. Multiple Nodes – Multiple Broker Kafka Clusters
  Difficult
  70 mins
  Gautam Goswami
  
  A multi-node Apache Kafka cluster is a distributed and scalable messaging system that comprises multiple Kafka broker nodes working together to handle the storage, processing, and distribution of data. Apache Kafka is designed for high throughput, fault tolerance, and horizontal scalability. Here's an overview of the key components and concepts involved in a multi-node Kafka cluster:
3. Kafka Consumer Groups and Scalability
  Difficult
  50 mins
  Gautam Goswami
  
  Applications that read data from Kafka topics are known as consumers. Applications integrate a Kafka client library to read from Apache Kafka. Excellent client libraries exist for almost all programming languages that are popular today including Python, Java, Go, and others.
4. Consumer Offsets in Kafka
  Difficult
  50 mins
  Gautam Goswami
  
  In Apache Kafka, a consumer offset is a critical concept that helps keep track of a consumer's progress in reading messages from a Kafka topic.
5. Challenges with Apache Kafka Consumer
  Difficult
  30 mins
  Gautam Goswami
  
  Apache Kafka consumers are applications that read data from Kafka topics and process it.
6. Custom Key and Value Serializer in Kafka
  Difficult
  70 mins
  Gautam Goswami
  
  In Apache Kafka, producers and consumers exchange messages in the form of key-value pairs. When working with Kafka, it's essential to serialize the data into bytes before sending it to Kafka and deserialize it back into its original format when consuming messages. Kafka allows us to use custom serializers and deserializers for keys and values even though Publisher API provides serializers like IntegerSerializer, StringSerializer etc, same sense of deserializer. The serializer is used by the message publisher while deserializer is used by the message consumer. In short-form it refers as Kafka SerDe
7. Replication and Partition management in Kafka
  Difficult
  120 mins
  Gautam Goswami
  
  Kafka Replication means having multiple copies of the data, spread across multiple servers/brokers. This helps in maintaining high availability in case one of the brokers goes down and is unavailable to serve the requests. Data Replication helps prevent data loss by writing the same data to more than one broker. In Kafka, replication means that data is written down not just to one broker, but many. The replication factor is a topic setting and is specified at topic creation time. This replication factor is configured at the topic level, and the unit of replication is the topic partition.
8. Kafka Custom Partitioning
  Difficult
  Gautam Goswami
9. Kafka Custom Searilezer
  Difficult
  Gautam Goswami
10. Creating a Java Producer with Custom Partitioning and Serializers in Kafka
  Difficult
  40 mins
  Gautam Goswami
11. Introduction to KafkaConnect API
  Difficult
  80 mins
  Gautam Goswami
  
  Kafka Connect is a framework for scalably and reliably streaming data between Apache Kafka and other systems. It makes it simple to quickly define connectors that move large data collections into and out of Kafka. Kafka Connect can ingest entire databases or collect metrics from all your application servers into Kafka topics, making the data available for stream processing with low latency. An export job can deliver data from Kafka topics into secondary storage and query systems or into batch systems for offline analysis.
12. Types of Kafka Connectors
  Difficult
  50 mins
  Gautam Goswami
  
  Apache Kafka connectors are components that allow you to integrate Kafka with other systems, enabling the seamless transfer of data between Kafka and various data sources or sinks. There are two main types of Kafka connectors: source connectors and sink connectors.
13. Understanding of Schema Registry in Kafka
  Difficult
  70 mins
  Gautam Goswami
  
  A schema registry is a centralized service that manages schemas for data exchanged between systems in a distributed architecture, and it is often used in conjunction with Apache Kafka. The primary purpose of a schema registry is to enforce a shared schema for the data that flows through a messaging system, ensuring consistency and compatibility between producers and consumers of data.
14. Communicating with Kafka Broker Via REST API
  Difficult
  40 mins
  Gautam Goswami
Apache Kafka Administration
Difficult
Gautam Goswami
Real-time Data Streaming With Kafka (Workshop)
Difficult
Gautam Goswami

Python Programming

Apache Kafka Basic

Madhubani Art

Apache Kafka Advance

Create Online S...

Apache Kafka Advance