Apache Kafka
Apache Kafka
Foundation in 2011.
Decoupling Systems: Allows independent development and scaling of producers and consumers.
Why Use Kafka?
•Ideal for modern data-driven applications.
•Integrates with popular big data frameworks like Spark, Flink, and Hadoop.
Core Functions:
•Publish and Subscribe: Enables real-time messaging between producers and consumers through
topics.
•Durable Storage: Persistently stores data streams on disk, allowing replay and recovery.
•Scalable Partitioning: Divides topics into partitions for parallel and distributed data processing.
•Fault Tolerance: Ensures data availability and reliability through replication across brokers.
•Real-Time Stream Processing: Processes and analyzes data streams in real time using Kafka
consumption.
regions.
Producers and Consumers
•Producers: Send data to Kafka topics.
streams.
across partitions.
•Leader Replica: Handles all read and write requests for a partition.
•Follower Replicas: Maintain copies and take over if the leader fails.