Description
About the course topic
Introduction
Originally written in Scala and Java, Apache Kafka is a fast, horizontally scaling, fault-tolerant messaging platform for distributed data streaming first started at LinkedIn. It provides a publisher-subscriber mechanism for processing and storing data streams in a fault-tolerant way. It is used for building real-time data pipelines by streaming social data, Geo-spatial data or sensor data from various devices.
Kafka acts like a plugin for Spark, Hadoop, Storm, HBase, Flink and many others for big data analytics.
Using Kafka for real-time data streaming
- To build real-time streaming applications that react to streams to do real-time data analytics.
- To transform, react, aggregate, and join real-time data flows.
- To perform complex event processing.
The most common uses for Kafka include stream processing, messaging, website activity tracking, log aggregation and operational metrics.
Pre-requisites
- If you are new to Kafka, it is advisable to first complete Self paced kafka basic-course or join Instructor-led Apache kafka basic course.
Course overview
Highlights
- Instructor-led live sessions
- 20+ Lessons
- Real-time data streaming workshops & projects
- Quizzes & assignments
- Access to self-paced course contents
- Post-training mentorship and guidance
- Once your order is confirmed, you can find the live meeting details under “My Orders” section of your account here.
What will you learn from this program
Lessons
- Introduction to data streaming & subsequent processing
- Introduction to Apache Kafka
- Installation & Set-up of Apache Kafka
- Setting up single node cluster
- Access to self-paced course contents
- Post-training mentorship and guidance
- Once your order is confirmed, you can find the live meeting details under “My Orders” section of your account here.
Know your instructor
Gautam Goswami
- Role : Solution Architect
- Experience : 22 Years
- Specialist in : Kafka Streaming, Big Data, Hadoop, Druid










Reviews
There are no reviews yet.