phoenix-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bulvik, Noam" <>
Subject RE: high CPU when using bulk loading
Date Wed, 07 Jan 2015 13:29:58 GMT
Only when doing bulk loading and only during mapping phase

-----Original Message-----
From: Puneet Kumar Ojha []
Received: רביעי, 07 ינו 2015, 15:03
To: []
Subject: RE: high CPU when using bulk loading

Is the CPU usage 100% all the time OR only while doing bulk loading?

From: Bulvik, Noam []
Sent: Wednesday, January 07, 2015 6:26 PM
Subject: high CPU when using bulk loading


We  are tuning our system for bulk loading. We managed to load ~250M records per hour (~96G
of raw input csv data ) on a cluster with 8 nodes. We use MR bulk loading tool with pre split
table and salted key.

What we currently see is that while Mappers are working we have 100% CPU usage across the
cluster. It was our impression that the mapper will be I/O bound and not so much CPU intensive

Any idea what else can we tune /check.



Information in this e-mail and its attachments is confidential and privileged under the TEOCO
confidentiality terms that can be reviewed here<>.

View raw message