Hadoop在Linux上的网络配置要点主要包括以下几个方面:
sudo hostnamectl set-hostname hadoop-master
/etc/network/interfaces文件(Debian/Ubuntu)或/etc/sysconfig/network-scripts/ifcfg-eth0文件(CentOS/RHEL),设置静态IP地址。/etc/hosts文件:
添加所有节点的IP地址和主机名,例如:192.168.1.101 hadoop-master
192.168.1.102 hadoop-slave1
192.168.1.103 hadoop-slave2
sudo firewall-cmd --permanent --zone=public --add-port=50010/tcp
sudo firewall-cmd --permanent --zone=public --add-port=50020/tcp
sudo firewall-cmd --permanent --zone=public --add-port=50070/tcp
sudo firewall-cmd --permanent --zone=public --add-port=50075/tcp
sudo firewall-cmd --permanent --zone=public --add-port=50090/tcp
sudo firewall-cmd --permanent --zone=public --add-port=8030/tcp
sudo firewall-cmd --permanent --zone=public --add-port=8031/tcp
sudo firewall-cmd --permanent --zone=public --add-port=8032/tcp
sudo firewall-cmd --permanent --zone=public --add-port=8088/tcp
sudo firewall-cmd --permanent --zone=public --add-port=9000/tcp
sudo firewall-cmd --reload
core-site.xml:<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://hadoop-master:9000</value>
</property>
</configuration>
hdfs-site.xml:<configuration>
<property>
<name>dfs.replication</name>
<value>3</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>/path/to/namenode/dir</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>/path/to/datanode/dir</value>
</property>
</configuration>
yarn-site.xml:<configuration>
<property>
<name>yarn.resourcemanager.hostname</name>
<value>hadoop-master</value>
</property>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
</configuration>
mapred-site.xml:<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>
ssh-keygen -t rsa
ssh-copy-id hadoop-slave1
ssh-copy-id hadoop-slave2
hdfs namenode -format
start-dfs.sh
start-yarn.sh
log4j.properties文件,调整日志级别以便于调试和监控。通过以上步骤,你可以确保Hadoop在Linux上的网络配置正确无误,从而保证集群的稳定运行和高效性能。