Lessons Learned Scaling Stateful Kafka Streams Topologies

« Kafka Summit London 2023

Kafka Streams is a powerful engine to do stream processing, and its stateful operations have allowed us to implement event-driven architectures in a simple, efficient and productive way. Our use case is about real estate listings websites, and at relatively low volumes of data (few millions) everything worked out of the box. However, when we started scaling things got a bit more difficult: High latency on every rolling-update, topologies eternally in rebalance, write stalls, excessive AWS bills and even losing data. I will explain a bunch of actions we have done that helped us scale our topologies to process hundreds of millions of listings: Use kubernetes StatefulSets, tune RocksDB configurations, use Horizontal Pod Scaling wisely, activate consumer Rack Awareness, and more.

Presenter

Ferran Galí i Reniu

LIFULL Connect

Ferran Galí is passionate about web scale distributed systems. Working on Big Data technologies for several years he gained expertise solving problems that require a massive amount of data processing. Right now he is working at LifullConnect materializing products that involve real-time data processing, all that embracing Kafka, Avro, Kotlin and ElasticSearch.

Lessons Learned Scaling Stateful Kafka Streams Topologies

Presenter

Ferran Galí i Reniu

Related Links

How Confluent Completes Apache Kafka eBook

Leverage a cloud-native service 10x better than Apache Kafka

Confluent Developer Center

Spend less on Kafka with Confluent, come see how