Chris Riccomini

429 days ago

Creating a Data Pipeline with the Kafka Connect API – from Architecture to Operations

Pandora began adoption of Apache KafkaTMin 2016 to orient its infrastructure around real-time stream processing analytics. As a data-driven company, we have a several thousand node Hadoop clusters with hundreds of Hive tables critical to Pandora’s operational and reporting success.