Bloom Filters Mathmania

So, you’ve come across Bloom filters and understand that, despite their probabilistic nature, they are a great fit for your use case. You’ve decided to integrate them into your system design, but you’re unsure about the optimal size and the number of hash functions needed for your…

Using Binomial Distribution to Model Data Durability

Durability requirements influence the choice of data protection mechanisms, such as replication, erasure coding, and RAID parity configurations. Achieving higher durability involves trade-offs between redundancy, storage usage ratio, and computational complexity. Replication achieves durability by creating multiple copies of data, which increases redundancy but reduces the storage usage ratio. In…

Redpanda Highlights

What's Redpanda? Redpanda, a Kafka® replacement for mission critical systems: * 10X Faster * Kafka® API Compatible - no code changes * Easy to use - No Zookeeper®, No JVM - built in C++. One binary. Redpanda is a modern streaming data platform. It essentially is a drop-in replacement of Apache…