Home > Operation and Maintenance > Linux Operation and Maintenance > How to install Hadoop in linux

How to install Hadoop in linux

WBOY
Release: 2023-05-18 20:19:13
forward
4489 people have browsed it

1: Install JDK

1. Execute the following command to download the JDK1.8 installation package.

wget --no-check-certificate https://repo.huaweicloud.com/java/jdk/8u151-b12/jdk-8u151-linux-x64.tar.gz
Copy after login

2. Execute the following command to decompress the downloaded JDK1.8 installation package.

tar -zxvf jdk-8u151-linux-x64.tar.gz
Copy after login

3. Move and rename the JDK package.

mv jdk1.8.0_151/ /usr/java8
Copy after login

4. Configure Java environment variables.

echo 'export JAVA_HOME=/usr/java8' >> /etc/profile
echo 'export PATH=$PATH:$JAVA_HOME/bin' >> /etc/profile
source /etc/profile
Copy after login

5. Check whether Java is successfully installed.

java -version
Copy after login

2: Install Hadoop

Note: To download the Hadoop installation package, you can choose Huawei source (the speed is medium, acceptable, the focus is on the full version), Tsinghua source (3.0.0 or above The version download speed is too slow and there are few versions), Beijing Foreign Studies University source (the download speed is very fast, but there are few versions) - I personally tested it

1. Execute the following command to download Hadoop installation Bag.

wget --no-check-certificate https://repo.huaweicloud.com/apache/hadoop/common/hadoop-3.1.3/hadoop-3.1.3.tar.gz
Copy after login

2. Execute the following command to decompress the Hadoop installation package to /opt/hadoop.

tar -zxvf hadoop-3.1.3.tar.gz -C /opt/
mv /opt/hadoop-3.1.3 /opt/hadoop
Copy after login

3. Execute the following command to configure Hadoop environment variables.

echo 'export HADOOP_HOME=/opt/hadoop/' >> /etc/profile
echo 'export PATH=$PATH:$HADOOP_HOME/bin' >> /etc/profile
echo 'export PATH=$PATH:$HADOOP_HOME/sbin' >> /etc/profile
source /etc/profile
Copy after login

4. Execute the following command to modify the configuration files yarn-env.sh and hadoop-env.sh.

echo "export JAVA_HOME=/usr/java8" >> /opt/hadoop/etc/hadoop/yarn-env.sh
echo "export JAVA_HOME=/usr/java8" >> /opt/hadoop/etc/hadoop/hadoop-env.sh
Copy after login

5. Execute the following command to test whether Hadoop is installed successfully.

hadoop version
Copy after login

If version information is returned, the installation is successful.

3: Configure Hadoop

1. Modify the Hadoop configuration file core-site.xml.

a. Execute the following command to enter the editing page.

vim /opt/hadoop/etc/hadoop/core-site.xml
Copy after login

b. Enter i to enter edit mode. c. Insert the following content into the <configuration></configuration> node.

 <property>
        <name>hadoop.tmp.dir</name>
        <value>file:/opt/hadoop/tmp</value>
        <description>location to store temporary files</description>
    </property>
    <property>
        <name>fs.defaultFS</name>
        <value>hdfs://localhost:9000</value>
    </property>
Copy after login

d. Press the Esc key to exit the editing mode, enter: wq to save and exit.

2. Modify the Hadoop configuration file hdfs-site.xml.

a. Execute the following command to enter the editing page.

vim /opt/hadoop/etc/hadoop/hdfs-site.xml
Copy after login

b. Enter i to enter edit mode. c. Insert the following content into the <configuration></configuration> node.

 <property>
        <name>dfs.replication</name>
        <value>1</value>
    </property>
    <property>
        <name>dfs.namenode.name.dir</name>
        <value>file:/opt/hadoop/tmp/dfs/name</value>
    </property>
    <property>
        <name>dfs.datanode.data.dir</name>
        <value>file:/opt/hadoop/tmp/dfs/data</value>
    </property>
Copy after login

d. Press the Esc key to exit the editing mode, enter: wq to save and exit.

Four: Configure SSH password-free login

1. Execute the following command to create the public key and private key.

ssh-keygen -t rsa
Copy after login

2. Execute the following command to add the public key to the authorized_keys file.

cd ~
cd .ssh
cat id_rsa.pub >> authorized_keys
Copy after login

If an error is reported, perform the following operations and then re-execute the above two commands; if no error is reported, go directly to step five:

Enter the following command in the environment variable Add the following configuration

vi /etc/profile
Copy after login

Then add the following content to it

export HDFS_NAMENODE_USER=root
export HDFS_DATANODE_USER=root
export HDFS_SECONDARYNAMENODE_USER=root
export YARN_RESOURCEMANAGER_USER=root
export YARN_NODEMANAGER_USER=root
Copy after login

Enter the following command to make the changes take effect

source /etc/profile
Copy after login

Five: Start Hadoop

1.Execute the following command to initialize the namenode.

hadoop namenode -format
Copy after login

2.Execute the following commands in sequence to start Hadoop.

start-dfs.sh
Copy after login

If Y/N is selected, select Y; otherwise press Enter directly

start-yarn.sh
Copy after login

3.After successful startup, execute the following command , to view the processes that have been successfully started.

jps
Copy after login

How to install Hadoop in linux

Normally there will be 6 processes;

4.Open the browser to visit http://:8088 and http://:50070. If the following interface is displayed, it means that the Hadoop pseudo-distributed environment is completed.

How to install Hadoop in linux

How to install Hadoop in linux

The above is the detailed content of How to install Hadoop in linux. For more information, please follow other related articles on the PHP Chinese website!

Related labels:
source:yisu.com
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template