flume-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mohammad Tariq <donta...@gmail.com>
Subject Re: Writing to HDFS from multiple HDFS agents (separate machines)
Date Thu, 14 Mar 2013 22:00:26 GMT
Hello sir,

    One idea could be to create the sub directories with the machines'
hostnames, in case you are getting data from multiple sources. you can
easily find out which data belongs to which machine then.

Warm Regards,

On Fri, Mar 15, 2013 at 3:24 AM, Gary Malouf <malouf.gary@gmail.com> wrote:

> Hi guys,
> I'm new to flume (hdfs for that metter), using the version packaged with
> CDH4 (1.3.0) and was wondering how others are maintaining different file
> names being written to per HDFS sink.
> My initial thought is to create a separate sub-directory in hdfs for each
> sink - though I feel like the better way is to somehow prefix each file
> with a unique sink id.  Are there any patterns that others are following
> for this?
> -Gary

View raw message