How to calculate the no of Datanode in Hadoop

How to calculate the no of Datanode in Hadoop
————————————–
Data- 100 TB

Replication – 3

HDFS – 300

Hard Drive per node – 24 TB (10 TB)

Overhead – 30 %

Available = 7 TB

Data node requirement – 300/7 = 43 machines

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s