Version 3.2.2 of Kafka for JUnit released

Version 3.2.2 of Kafka for JUnit has been released. Please migrate to this patch version of Kafka for JUnit if you experience any issues with embedded Kafka Connect deployments with any of the former releases of the 3.2.x line.

more ...

Version 3.2.1 of Kafka for JUnit released

Version 3.2.1 of Kafka for JUnit has been released. It increases all Kafka dependencies to 3.2.3. If you're using Kafka for JUnit 3.2.0 and experience some odd behavior wrt. Kafka Clients (seemingly lost messages, ...) then please migrate to this patch version of Kafka for JUnit.

more ...


Sampling from data streams

Sampling data from a continuous stream of data is a useful technique to efficiently extrapolate information from a potentially large body of data. There are a couple of sampling strategies in literature that vary in their degree of complexity. I'd like to introduce you to a rather simple sampling strategy that is easy to implement as well as easy to reason about and might take you a long way until you have to go for more advanced solutions. I'm talking about Bernoulli sampling.

more ...