Discover more from SUP! Hubert’s Substack
Streaming Updates for da Peeps
____ _ _ ____ _ / ___|| | | | _ \| | \___ \| | | | |_) | | ___) | |_| | __/|_| |____/ \___/|_| (_)
Another few weeks have gone by and a lot has happened in the data streaming / real-time analytical domains. Starting today, I’ll be adding videos of interviews. As part of our research on “Streaming Databases”, Ralph Debusmann and I are interviewing new and existing players in the real-time streaming space. They are very raw because I suck at video editing (and as a host 😉, I’m no Tim Berglund). Subscribe below.
Ralph Debusmann and I interviewed Dolt CEO, Timothy Sehn. Dolt is Git for data, The world's first and only version-controlled SQL database. It’s a new way of thinking about how to share, collaborate, and release data. Watch the interview below.
The streaming ecosystem grows with WarpStream. WarpStream is a Kafka protocol-compatible data streaming platform built directly on top of S3. 🤯
If your workload can tolerate a P99 of ~1s of producer-to-consumer latency, then WarpStream can reduce your total data streaming costs by 5-10x per GiB, with almost zero operational overhead.
SUP Redpanda and RisingWave - Non-JVMs makin’ Friends!
That's a SUPER COOL feature!!! Let me try to understand: so redpandadata's tiered storage data will be written and read using Iceberg format, correct? That essentially means, Redpanda will be evolving to a streaming lake house, correct?
That's cool! RisingWaveLabs may be the query engine - we've evaluated RisingWave's SQL on Kafka performance, and the initial result looks very promising! Will evaluate RisingWave SQL on Redpanda!!
SUP Giannis Polyzos! Flink book coming out soon!!
Giannis gets extremely hands-on with Flink in his book “Stream Processing, Hands On with Apache Flink.” I had the privilege to review the book and I learned a lot!!!
The book is a good reference for devs that are new to Flink. The easier it is to find the answers to their questions the more they will use it as a reference.
Get a preview of it here.
PeerDB is an ETL/ELT tool built for PostgreSQL. It enables you to Stream Query Results from a source Postgres database to a target Postgres database.
Definitely, a lot of action at the edge. Replication with transformation in between Postgres databases.
SUP Michael Drogalis!! Good blog “Real-Time Generative AI”
If you’re trying to find a relationship between streaming and GenAI, this is a good one.
SUP Mihai Budiu!! All Databases are Streaming Databases!!!
SUP Mark! Learn about Rollup Segments in Apache Pinot!
That’s it. I still don’t know how to end a Newsletter.
Hubert’s Substack is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber. Expense it brah.