Ansible and hadoop!

2 min readDec 9, 2020

Ansible-

Ansible is an open source IT configuration management (CM) and automation platform, provided by Red Hat. It uses human-readable YAML templates so that users can program repetitive tasks to occur automatically, without learning an advanced language.

Hadoop-

Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs.

Task 11.1

Creating hadoop cluster using ansible playbook!

Steps to follow-

1 Check prerequisites-

This includes installing ansible using (pip3 install ansible) and adding IP’s of namenode and datanode to the inventory. Checking the connection using ansible ping command.

2 Creating playbook-

You have to create 2 playbooks, one for datanode and one for namenode, the playbooks are basically the same for both the nodes.

This is the basic playbook for both nodes just you need to change the path for different nodes as in the creating a folder using the root storage, basically replace namenode with datanode in the playbook.