Hadoop单机部署和测试
- 一.单机部署
- 1.安装 JDK(JDK11)
- 2.安装 HADOOP
- 3.测试
一.单机部署
系统版本:cat /etc/anolis-release
1.安装 JDK(JDK11)
#!/bin/bashTOP_PATH=$(pwd)
JAVA_PATH=/usr/local/java
FILE=`ls $TOP_PATH/ | grep jdk | grep .tar.gz | grep -v grep `## 清空历史版本
if [ -d $JAVA_PATH ];thenrm -rf $JAVA_PATH
fi
## 创建安装目录
mkdir -p $JAVA_PATH
## 解压
tar -zxvf $TOP_PATH/$FILE -C $JAVA_PATH
## 设置环境变量 注意路径上的版本信息 jdk-11.0.19
echo "export JAVA_HOME=/usr/local/java/jdk-11.0.19" >> /etc/profile
echo "export CLASSPATH=$JAVA_HOME/lib:$CLASSPATH" >> /etc/profile
echo "export PATH=$JAVA_HOME/bin:$PATH" >> /etc/profile
source /etc/profile
查看版本:java -version
2.安装 HADOOP
下载:Apache Hadoop
#!/bin/bash
TOP_PATH=$(pwd)
HADOOP_PATH=/usr/local/hadoop
FILE=`ls $TOP_PATH/ | grep hadoop | grep .tar.gz | grep -v grep `## 清空历史版本
if [ -d $HADOOP_PATH ];thenrm -rf $HADOOP_PATH
fiHAD
## 创建安装目录
mkdir -p $HADOOP_PATH
## 解压
tar -zxvf $TOP_PATH/$FILE -C $HADOOP_PATH
## 设置环境变量 注意路径上的版本信息 hadoop-3.3.6
echo "export HADOOP_HOME=/usr/local/hadoop/hadoop-3.3.6" >> /etc/profile
echo "export PATH=${PATH}:${HADOOP_HOME}/bin:${HADOOP_HOME}/sbin" >> /etc/profile
source /etc/profile
查看版本:hadoop version
3.测试
## 创建测试目录
mkdir /home/input
## 拷贝测试数据
cp $HADOOP_HOME/etc/hadoop/*.xml /home/input/
cd $HADOOP_HOME/
## 测试命令
./bin/hadoop jar ./share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.6.jar grep /home/input/ /home/output 'dfs[a-z]+'
查看输出结果:cat /home/output/part-r-00000
验证结果正确性:find /home/input -type f -name "*.xml" | xargs grep dfs
创建一个测试文件:touch /home/input/text.xml && echo dfstext >> /home/input/text.xml && rm -rf /home/output
重新统计:./bin/hadoop jar ./share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.6.jar grep /home/input/ /home/output 'dfs[a-z]+'
查看结果:cat /home/output/part-r-00000