[bcc flume-user@cloudera.org (deprecated), cc
flume-user@incubator.apache.org]
Brian,
The easiest way is to use the regex decorator to create a new attribute and
use that attribute as to do output bucketing.
http://archive.cloudera.com/cdh/3/flume/UserGuide/index.html#_extractors
Jon.
On Mon, Jul 25, 2011 at 5:50 PM, Brian Tran <briantran86@gmail.com> wrote:
> I want to do output bucketing based on the tailSrcFile metadata value
> set by the tailDir source. However, I only want part of the value for
> the destination path in HDFS.
>
> For example, I have an event with the tailSrcFile value
> "unwanted_prefix_category_name-2011-07-25.log" but only want to use
> "category_name" for output bucketing.
>
> What is the easiest way to do this?
>
--
// Jonathan Hsieh (shay)
// Software Engineer, Cloudera
// jon@cloudera.com
|