Oddbean new post about | logout
 The data lakehouse market is rapidly evolving, offering a modular and composable architecture that reduces data movement and costs. Key trends include the rise of Apache Iceberg as the industry standard for data lakehouse tables, with major announcements from companies like Dremio, Snowflake, and AWS. Native streaming pipelines can be built using open-source tools like Kafka Connect, Flink, and Spark Streaming. To address governance challenges, catalogs are gaining importance, with options like Apache Polaris, Gravitino, Nessie, and Unity OSS providing tracking and governing capabilities. Hybrid data lakehouse models are also emerging, with vendors like Minio, Pure Storage, Vast Data, and NetApp offering high-performance storage.

Source: https://dev.to/alexmercedcoder/data-lakehouse-roundup-1-news-and-insights-on-the-lakehouse-218