Data Storage Hadoop

  • 1
  • Question
  • Updated 2 years ago
Hello Everyone,

I am new to hadoop and hive. 
Have few doubts in mind regarding data storage into Data Node.

 1) When the data is loaded in Hadoop HDFS its get broken down in chunks of 64MB(configurable) blocks and each blocks get assigned with the namenode connected.
 2) In HIVE we have concept of PARTITION & BUCKETING which creates multiple sub-folders in the main folder to bring the concept of indexing in Hadoop.

Whether the same folders gets replicated across all the Data Node and data gets chuncked first and then we apply the rule of Partition & Bucketing on it?
I am basically not able to relate my both the assumption.
Any article on the same will be a big help

Ritesh Agarwal
Photo of Ritesh Agarwal

Ritesh Agarwal

  • 2 Posts
  • 0 Reply Likes

Posted 2 years ago

  • 1

Be the first to post a reply!