Version 3.2.1 of Kafka for JUnit released

Version 3.2.1 of Kafka for JUnit has been released. It increases all Kafka dependencies to 3.2.3. If you're using Kafka for JUnit 3.2.0 and experience some odd behavior wrt. Kafka Clients (seemingly lost messages, ...) then please migrate to this patch version of Kafka for JUnit.

more ...

Sampling from data streams

Sampling data from a continuous stream of data is a useful technique to efficiently extrapolate information from a potentially large body of data. There are a couple of sampling strategies in literature that vary in their degree of complexity. I'd like to introduce you to a rather simple sampling strategy that is easy to implement as well as easy to reason about and might take you a long way until you have to go for more advanced solutions. I'm talking about Bernoulli sampling.

more ...

Hi there! I'm Markus!

I'm an independent freelance IT consultant, a well-known expert for Apache Kafka and Apache Solr, software architect (iSAQB certified) and trainer.

How can I support you?