flume-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brock Noland <br...@cloudera.com>
Subject Re: HBase sink
Date Sun, 29 Jul 2012 16:32:20 GMT

You might want to write a class which implements HbaseEventSerializer. To
test, SimpleHbaseEventSerializer might fit your needs. Using timestamps as
keys will create hotspots. As such it's recommended you "bucket" so that
each write is hitting say 1 of 32 buckets. Each agent would be configured
as follows:

...rowPrefix = bucket
...suffix = timestamp


On Sun, Jul 29, 2012 at 11:15 AM, Mohit Anchlia <mohitanchlia@gmail.com>wrote:

> I am looking at example in
> http://flume.apache.org/FlumeUserGuide.html#hbasesinks but this doesn't
> show me how to generate row keys. My row keys are timeseries data.

Apache MRUnit - Unit testing MapReduce - http://incubator.apache.org/mrunit/

View raw message