A Tale of Many Streams: Characterizing a Hybrid Batch-Stream Production Workload in Digazu, a Data Lake supported by Apache Kafka and Flink
Published in DEBS '25: Proceedings of the 19th ACM International Conference on Distributed and Event-based Systems, 2025
Recommended citation: Schmitz, D., Berrewaerts, L., Rosinosky, G., Skhiri, S., & Rivière, E. (2025, June). A Tale of Many Streams: Characterizing a Hybrid Batch-Stream Production Workload in Digazu, a Data Lake supported by Apache Kafka and Flink. In Proceedings of the 19th ACM International Conference on Distributed and Event-based Systems (pp. 188-198). https://dl.acm.org/doi/full/10.1145/3701717.3734462
