flume-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Sinclair <dsincl...@chariotsolutions.com>
Subject HDFS Sink Memory Leak
Date Mon, 11 Nov 2013 15:01:20 GMT
Hi all,

I have been investigating an OutOfMemory error when using the HDFS event
sink. I have determined the problem to be with the

WriterLinkedHashMap sfWriters;

Depending on how you generate your file name/directory path, you can run
out of memory pretty quickly. You need to either set the *idleTimeout* to
some non-zero value or set the number of *maxOpenFiles*.

The map keeps references to BucketWriter around longer than they are
needed. I was able to reproduce this consistently and took a heap dump to
verify that objects being kept around.

I will update this Jira to reflect my findings



View raw message