Installing Cloudera Manager
Installing Cloudera Manager 5.4.1 in VirtualBox/Linux/CentOS
A step by step guide to install Cloudera Manager in VirtualBox for a clean installation
Step 1 - OS Installation:
This is a guide to install Cloudera Manager 5.4.1 in Oracle VirtualBox. The first and fore most step is to install OS on the VirtualBox. In this tutorial, we use CentOS 6.6; Cloudera Manager supports the following operating systems (source cloudera website):
Red Hat Enterprise Linux and CentOS
6.5 in SE Linux mode
Oracle Enterprise Linux with default kernel and Unbreakable Enterprise Kernel, 64-bit
5.6 (UEK R2)
6.4 (UEK R2)
6.5 (UEK R2, UEK R3)
6.6 (UEK R3)
SLES - SUSE Linux Enterprise Server 11, 64-bit. Service Pack 2 or later is required for CDH 5, and Service Pack 1 or later is required for CDH 4. To use the embedded PostgreSQL database that is installed when you follow Installation Path A - Automated Installation by Cloudera Manager, the Updates repository must be active. The SUSE Linux Enterprise Software Development Kit 11 SP1 is required on hosts running the Cloudera Manager Agents.
Debian - Wheezy (7.0 and 7.1), Squeeze (6.0) (deprecated), 64-bit
Ubuntu - Trusty (14.04), Precise (12.04), Lucid (10.04) (deprecated), 64-bit
Debian Squeeze and Ubuntu Lucid are supported only for CDH 4.
Using the same version of the same operating system on all cluster hosts is strongly recommended.
Make sure the OS is installed properly in the VirtualBox before proceeding into the installation of Cloudera Manager.
Step 2 - Disable SELinux:
Selinux value should be changed from ‘enforcing’ to ‘disabled’. If it is ‘enforcing’, the Cloudera Manager will not install and it will ask to disable it when the installation starts. So, the following steps can be used to disable it.
[talend@ClouderaCDH5 ~]$ su –
Password: <Your Root Password>
[root@ClouderaCDH5 ~]$ cat /etc/sysconfig/selinux – Check whether selinux is disabled or not.
[root@ClouderaCDH5 ~]$ vi /etc/sysconfig/selinux – If it is not disabled, open vi editor to edit it.
[root@ClouderaCDH5 ~]$ cat /etc/sysconfig/selinux – Check the value for selinux and Restart.
Step 3 - Disable Firewall:
Disabling the Linux firewall is the next step. Cloudera Manager, CDH components, managed services, and third-party components use the ports listed in the tables that follow. These ports need to be opened on each system. If a firewall is being used, such as iptables, and cannot open all the listed ports, the firewall can be completely disabled to ensure full functionality.
To disable the firewall on each host in your cluster, perform the following steps on each host.
1. Save the existing iptables rule set.
iptables-save > /root/firewall.rules
2. Disable iptables.
o For RHEL, CentOS, Oracle, and Debian:
chkconfig iptables off
Step 4 – Edit hosts:
In the next step, we need to edit the hosts file. Copy the hostname from the following step and edit it in the /etc/hosts file.
[talend@ClouderaCDH5 ~]$ hostname
ClouderaCDH5 – Your Host name will be displayed; Copy this.
[talend@ClouderaCDH5 ~]$ vi /etc/hosts
Paste your hostname in the file entry. Refer the below figure for reference.
Step 5 – Download and Install Cloudera Manager
Follow the below steps to get the installer file from the Cloudera repository and install
[root@ClouderaCDH5 ~]$ wget http://archive.cloudera.com/cm5/installer/5.4.1/cloudera-manager-installer.bin
This will download the installation file to your local directory; Now change the permission to execute the file.
[root@ClouderaCDH5 ~]$ chmod u+x cloudera-manager-installer.bin
Once the permission is changed, your installation file will be changed to green color. You can see through ls command like below. Now it is ready to install the cloudera manager in your CentOS.
Run the file to install Cloudera Manager.
[root@ClouderaCDH5 ~]$ ./cloudera-manager-installer.bin
So once we run the ./cloudera-manager-installer.bin file, a pop up window will be opening up and click next buttons to proceed till the end the installation. At the end, you can see a pop up window open up and a browser will be open with the url ‘http://localhost:7180’; the user name and password are ‘admin/admin’. If your page is not loading with this URL, it means that cloudera scm server is yet not started. So wait for sometime and look at the log in the next section Step 6 - Look at the Log.
You can see the below screen on the successful installation.
Step 6 – Look at the Log file
Once the installation completes, we can check the log file for the server starts. On a successful starting of the Cloudera-SCM-Server, you can see a message ‘Finished constructing repo:’ in your log file. Now the URL localhost:7180 will be opening.
[root@ClouderaCDH5 ~] $ tail -f /var/log/cloudera-scm-server/cloudera-scm-server.log
Step 7 – Cluster Installation
It is time to install the Cluster. Once your login to the http://localhost:7180; Cluster installation wizard will be coming up. Accept the license and click on Continue.
Now choose the edition of Cloudera you want to deploy. In this installation, I have chosen ‘Cloudera Express’.
Specify the hosts for your Cluster; just search with the hostname and select.
Now it’s time to choose the repository. If you want to install the most recent version of Cloudera Manager, then select Use Parcles (Recommended) option. If you want to install a specific release, then choose Use Packages and select the specific release.
If you want to enable single user mode, select this option; otherwise continue.
Now the Cluster Installation begins as below. All required packages and items will be installed in this step.
Once the installation completes, you can see the below screen with green color. Click Continue.
Step 8 – Cluster Setup
There will be multiple options displayed in Cluster Setup. Choose the one you want to have in your environment and Continue.
In the next step, database will be configured for the services you selected earlier. In this installation, database setup on Hive and Oozie Server is required for the service Core with Spark. Cloudera Manager will provide a password by default. Just note it down in a text file; this may be useful to debug if you encounter any connection issues.
When you click on the button Test Connection, you will receive a message ‘Skipped’; this will be cleared after the clean installation. So proceed with Continue.
Now this is the last step in Cluster Setup. This step is called First Run; In this step, the following services and packages will be configured and started in the follwo order.
1. Deploying Client Configuration like hdfs-site.xml, core-site.xml etc
2. Starting HDFS
3. Starting MapReduce and Yarn
4. Starting Spark
5. Starting Hive
6. Starting Oozie Server
7. Starting Hue
The First Run Command will start like this. Status is Running.
And this will end like this. Once all the configured services and packages started, you will see the screen like this. You can see the Status Finished.
This is end of instillation. Click Finish to see the cluster details.
These are all the steps to install Cloudera Manager properly. If you guys face any difficulties in installation, feel free to write to us from the below link; we will help you. You can follow us on the below to get to know more about us.