devops-exercises/topics/kafka/README.md
abregman 99c4e02ecf Rename exercises dir
Name it instead "topics" so it won't be
strange if some topics included "exercises" directory.
2022-08-02 01:53:56 +03:00

1.4 KiB

Apache Kafka

Kafka Exercises

Name Topic Objective & Instructions Solution Comments

Kafka Self Assessment

Kafka 101

What is Kafka?

kafka.apache.org: "Apache Kafka is an open-source distributed event streaming platform used by thousands of companies for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications."

In other words, Kafka is a sort of distributed log where you can store events, read them and distribute them to different services and do it in high-scale and real-time.

What Kafka is used for?
  • Real-time e-commerce
  • Banking
  • Health Care
  • Automotive (traffic alerts, hazard alerts, ...)
  • Real-time Fraud Detection
What is a "Producer" in regards to Kafka?

An application that publishes data to the Kafka cluster.

Kafka Architecture

What's in a Kafka cluster?
  • Broker: a server with kafka process running on it. Such server has local storage. In a single Kafka clusters there are usually multiple brokers.