Measure throughput at Hadoop datanode

Question

Bug_Catcher 0 Newbie Poster

12 Years Ago

I want to measure the throughput at each datanode by measuring the time taken for each read/write operation. It is very confusing to read through the million functions and find out where this is happening. Could someone list the series of calls made while reading/writing a block of data? am using version 1.0.1. Alternatively, if there is already an API which measures this at the datanode I could use that information.

api

2 Contributors
2 Replies
104 Views
1 Day Discussion Span
Latest Post 12 Years Ago Latest Post by Bug_Catcher

All 2 Replies

rubberman 1,355 Nearly a Posting Virtuoso

12 Years Ago

The time for any specific read/write function in a hadoop cluster and data node can vary significantly. I don't suppose you are running an industrial strength management tool like Cloudera on your cluster, are you? They do track those sort of metrics, and can alert you when they exceed specified limits.

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

Bug_Catcher 0 Newbie Poster · Answer 1 · 2013-05-08T04:22:34+00:00

No. I cannot assume any metric logging system like Ganglia. This must work on a "vanilla" Hadoop distribution

Measure throughput at Hadoop datanode

Recommended Answers Collapse Answers

All 2 Replies

Recommended Answers