flume-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Felix.徐 <ygnhz...@gmail.com>
Subject Loading collected files of HDFS sink into Hive
Date Wed, 27 Mar 2013 16:31:24 GMT

I've got 2 problems while trying to load files generated by HDFS sink into

1.I set the HDFS sink with hdfs.fileType=DataStream and a table with
TextFile in Hive, when I try to load data from HDFS sink to Hive, I get the
following exception in Hive:

Failed with exception Wrong file format, pease check the file's format

2.How to pick out the closed files and ignore the files with suffix of .tmp
while loading data to Hive?

View raw message