在Ubuntu上安装Hadoop的步骤大致可以分为以下几个部分:
/usr/lib/jvm/java-8-openjdk-amd64
。~/.bashrc
或/etc/profile
文件中:export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
export JRE_HOME=$JAVA_HOME/jre
export CLASSPATH=.:$JAVA_HOME/lib:$JRE_HOME/lib
export PATH=$JAVA_HOME/bin:$PATH
source ~/.bashrc
sudo apt-get install openssh-server
/usr/local/hadoop
:sudo tar -zxvf hadoop-3.3.6.tar.gz -C /usr/local
sudo chown -R hadoop:hadoop /usr/local/hadoop
/usr/local/hadoop/etc/hadoop/hadoop-env.sh
文件,配置JDK路径。/usr/local/hadoop/etc/hadoop/core-site.xml
,配置HDFS默认文件系统和临时目录。/usr/local/hadoop/etc/hadoop/hdfs-site.xml
,配置HDFS副本数和目录。/usr/local/hadoop/etc/hadoop/mapred-site.xml
,配置MapReduce框架。/usr/local/hadoop/etc/hadoop/yarn-site.xml
,配置YARN资源管理器。cd /usr/local/hadoop/sbin
./hdfs namenode -format
./start-dfs.sh
./start-yarn.sh
mkdir -p /usr/local/hadoop/input
cp /usr/local/hadoop/etc/hadoop/*.xml /usr/local/hadoop/input
./bin/hadoop fs -put /home/hadoop/input /user/root/input
./bin/hadoop jar /usr/local/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.6.jar wordcount /user/root/input /user/root/output
cat /usr/local/hadoop/output/*
请注意,以上步骤是基于Hadoop 3.3.6版本的安装指南,不同版本的Hadoop可能会有细微的配置差异。此外,如果是安装完全分布式集群,还需要配置从节点,并确保所有节点之间的网络通信正常。