在Debian上配置Hadoop的步骤大致可以分为以下几个部分:
sudo apt update
sudo apt upgrade
sudo apt install openjdk-8-jdk
echo "export JAVA_HOME=/usr/lib/jvm/jdk1.8.0_202" >> /etc/profile
echo "export PATH=$JAVA_HOME/bin:$PATH" >> /etc/profile
source /etc/profile
java -version
wget https://downloads.apache.org/hadoop/common/hadoop-3.3.1/hadoop-3.3.1.tar.gz
tar -xzvf hadoop-3.3.1.tar.gz -C /usr/local/
echo "export HADOOP_HOME=/usr/local/hadoop" >> /etc/profile
echo "export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin" >> /etc/profile
source /etc/profile
<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://namenode:9000</value>
</property>
</configuration>
<configuration>
<property>
<name>dfs.replication</name>
<value>3</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>/usr/local/hadoop/dfs/name</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>/usr/local/hadoop/dfs/data</value>
</property>
</configuration>
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>
<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
</configuration>
ssh-keygen -t rsa
ssh-copy-id root@namenode
ssh-copy-id root@slave1
ssh-copy-id root@slave2
hdfs namenode -format
start-dfs.sh
start-yarn.sh
可以通过Hadoop的Web界面检查集群的状态,例如NameNode的Web界面通常在http://namenode:9000
。
请注意,以上步骤可能会根据具体的Hadoop版本和需求有所不同。建议参考官方文档以获取最准确的配置指南。
亿速云「云服务器」,即开即用、新一代英特尔至强铂金CPU、三副本存储NVMe SSD云盘,价格低至29元/月。点击查看>>
相关推荐:Debian上Hadoop安装步骤是什么