phoenix-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Wangwenli <wangwe...@huawei.com>
Subject Re: RE: high CPU when using bulk loading
Date Wed, 07 Jan 2015 13:42:11 GMT
what kind of disc using , sas or sata ?  how much cpu for system /user?
also can using jstack to check what is the map are doing ?
whether too much map stared in one node?
________________________________
Wangwenli

From: Bulvik, Noam<mailto:Noam.Bulvik@teoco.com>
Date: 2015-01-07 21:29
To: user@phoenix.apache.org<mailto:user@phoenix.apache.org>
Subject: RE: high CPU when using bulk loading
Only when doing bulk loading and only during mapping phase

-----Original Message-----
From: Puneet Kumar Ojha [puneet.kumar@pubmatic.com]
Received: רביעי, 07 ינו 2015, 15:03
To: user@phoenix.apache.org [user@phoenix.apache.org]
Subject: RE: high CPU when using bulk loading

Is the CPU usage 100% all the time OR only while doing bulk loading?



From: Bulvik, Noam [mailto:Noam.Bulvik@teoco.com]
Sent: Wednesday, January 07, 2015 6:26 PM
To: user@phoenix.apache.org
Subject: high CPU when using bulk loading


Hi,

We  are tuning our system for bulk loading. We managed to load ~250M records per hour (~96G
of raw input csv data ) on a cluster with 8 nodes. We use MR bulk loading tool with pre split
table and salted key.

What we currently see is that while Mappers are working we have 100% CPU usage across the
cluster. It was our impression that the mapper will be I/O bound and not so much CPU intensive

Any idea what else can we tune /check.


Regards

Noam


Information in this e-mail and its attachments is confidential and privileged under the TEOCO
confidentiality terms that can be reviewed here<http://www.teoco.com/email-disclaimer>.

Mime
View raw message