Usually, it's basic to look for documents or tutorials on hadoop.apache.org. If you have any questions, it is important to read the Architecture documents carefully.
In particular, please read the following three.
HDFS: http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HdfsDesign.html
Yarn: http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html
Hadoop itself consists of distributed file systems (hdfs) and distributed computing systems (Yarn), and there are various platforms based on them. In addition to the official Hadoop page, you can get information about various platforms that work with Hadoop, even if you are well-traveled on the home page of companies such as cloudera and hortonworks.
If you are studying for the first time, it is a good idea to study while reading a simple Korean book. However, due to the nature of the open source, new technologies are applied over time, so improving your understanding of HDFS, MapReduce, and Yarn helps you learn new technologies and platforms.
© 2024 OneMinuteCode. All rights reserved.