Freight Clusters: Up to 90% savings at GBps+ scale | Learn more

Technology Blog

Data Products, Data Contracts, and Change Data Capture

Change data capture is a popular method to connect database tables to data streams, but it comes with drawbacks. The next evolution of the CDC pattern, first-class data products, provide resilient pipelines that support both real-time and batch processing while isolating upstream systems...

Adam Bellemare

Exploring Apache Flink 1.19: Features, Improvements, and More

Check out all the highlights from the Apache Flink® 1.19 release!

Martijn Visser

Introducing Apache Kafka 3.7

Apache Kafka 3.7 introduces updates to the Consumer rebalance protocol, an official Apache Kafka Docker image, JBOD support in Kraft-based clusters, and more!

Stanislav Kozlovski

Real-time full-text search with Luwak and Samza

Apr 13, 2015

This is an edited transcript of a talk given by Alan Woodward and Martin Kleppmann at FOSDEM 2015. Traditionally, search works like this: you have a large corpus of documents, […]

Martin Kleppmann

A Comprehensive REST Proxy for Kafka

Mar 25, 2015

As part of Confluent Platform 1.0 released about a month ago, we included a new Kafka REST Proxy to allow more flexibility for developers and to significantly broaden the number […]

Ewen Cheslack-Postava

Apache Kafka 0.8.2.1 release

Mar 13, 2015

The Apache Kafka community just announced the 0.8.2.1 release. This is a a bug fix release and fixes 4 critical issues reported in the 0.8.2.0 release (the full list of […]

Jun Rao

How to Choose the Number of Topics/Partitions in a Kafka Cluster?

Mar 12, 2015

Note For the latest, check out the blog posts Apache Kafka® Made Simple: A First Glimpse of a Kafka Without ZooKeeper and Apache Kafka Supports 200K Partitions Per Cluster.

Jun Rao

Turning the database inside-out with Apache Samza

Mar 1, 2015

This is an edited and expanded transcript of a talk I gave at Strange Loop 2014. The video recording (embedded below) has been watched over 8,000 times. For those of […]

Martin Kleppmann

Why Avro for Kafka Data?

Feb 25, 2015

If you are getting started with Kafka one thing you’ll need to do is pick a data format. The most important thing to do is be consistent across your usage. […]

Jay Kreps

Announcing the Confluent Platform 1.0

Feb 25, 2015

We are very excited to announce general availability of Confluent Platform 1.0, a stream data platform powered by Apache Kafka, that enables high-throughput, scalable, reliable and low latency stream data […]

Neha Narkhede

Putting Apache Kafka To Use: A Practical Guide to Building an Event Streaming Platform (Part 2)

Feb 25, 2015

This is the second part of our guide on streaming data and Apache Kafka. In part one I talked about the uses for real-time data streams and explained the concept of […]

Jay Kreps

Putting Apache Kafka To Use: A Practical Guide to Building an Event Streaming Platform (Part 1)

Feb 25, 2015

Data systems have mostly focused on the passive storage of data. Phrases like “data warehouse” or “data lake” or even the ubiquitous “data store” all evoke places data goes to […]

Jay Kreps

Stream Processing, CEP, Event Sourcing, and Data Streaming Explained

Jan 29, 2015

Some people call it stream processing. Others call it event streaming, complex event processing (CEP), or CQRS event sourcing. Sometimes, such buzzwords are just smoke and mirrors, invented by companies […]

Martin Kleppmann

What’s coming in Apache Kafka 0.8.2

Dec 1, 2014

I am very excited to tell you about the forthcoming 0.8.2 release of Apache Kafka. Kafka is a fault-tolerant, low-latency, high-throughput distributed messaging system used in data pipelines at several […]

Neha Narkhede

Use CL60BLOG to get an additional $60 of free Confluent Cloud

Get started