Apache Kafkaยฎ๏ธ ๋น„์šฉ ์ ˆ๊ฐ ๋ฐฉ๋ฒ• ๋ฐ ์ตœ์ ์˜ ๋น„์šฉ ์„ค๊ณ„ ์•ˆ๋‚ด ์›จ๋น„๋‚˜ | ์ž์„ธํžˆ ์•Œ์•„๋ณด๋ ค๋ฉด ์ง€๊ธˆ ๋“ฑ๋กํ•˜์„ธ์š”

Presentation

Enhancing Apache Kafka for Large Scale Real-Time Data Pipeline at Tencent

ยซ Kafka Summit APAC 2021

In this session we share our experience of building a real-time data pipelines at Tencent PCG - one that handles 20 trillion daily messages with 700 clusters and 100Gb/s bursting traffic from a single app. We discuss our roadmap of enhancing Kafka to break its limits in terms of scalability, robustness and cost of operation. We first built a proxy layer that aggregates physical clusters in a way agnostic to the clients. While this architecture solves many operational problems, it requires significant development to stay future-proof. With retrospection with our customer and careful study of the ongoing work from the community, we then designed a region federation solution in the broker layer, which allows us to deploy clusters at a much larger scale than previously possible, while at the same time providing better failure recovery and operability. We discuss how we make this development compatible with KIP-500 and KIP-405, and the two KIP (693, 694) that we submitted for discussion.

Chinese Japanese Korean

Related Links

How Confluent Completes Apache Kafka eBook

Leverage a cloud-native service 10x better than Apache Kafka

Confluent Developer Center

Spend less on Kafka with Confluent, come see how