flume-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Kristoffer Sjögren <sto...@gmail.com>
Subject Parquet buffering capability
Date Sat, 23 Aug 2014 08:51:25 GMT

Does flume have support for buffering/staging avro events locally on disk
and storing them in hdfs as parquet files?

Cloudera CDK explains [1] how to do this method manually but ideally I want
this process directly integrated into the flume runtime.


1. https://github.com/cloudera/cdk-examples/tree/master/dataset-staging

View raw message