Chris Riccomini

94 days ago

Hoodie: Uber Engineering’s Incremental Processing Framework on Hadoop

eng.uber.com

With the evolution of storage formats like Apache Parquet and Apache ORC and query engines like Presto and Apache Impala, the Hadoop ecosystem has the potential to become a general-purpose, unified serving layer for workloads that can tolerate latencies of a few minutes.