Sunday 1:50 p.m.–2:20 p.m.

One Data Pipeline to Rule Them All

Sam Kitajima-Kimbrel

Description

There are myriad data storage systems available for every use case imaginable, but letting application teams choose storage engines independently can lead to duplicated efforts and wheel reinvention. This talk will explore how to build a reusable data pipeline based on Kafka to support multiple applications, datasets, and use cases including archival, warehousing and analytics, stream and batch processing, and low-latency "hot" storage.