GitHub - ashishc2503/Candidate-Recommendation: CS643

Datanode AMI ID: ami-0c481f8ab5632701f - Image Name: Datanode-image Namenode AMI ID: ami-0c27685906c173598 - Image Name: Namenode-image

Steps to run the program:

Create one instance from Namenode image and 3 instances from Datanode image. On Name node:
Upload your key to folder ~/.ssh/key.pem
Create ~/.ssh/config file with following content (Remove if already exist): Host namenode HostName User ec2-user IdentityFile ~/.ssh/key.pem Host datanode1 HostName User ec2-user IdentityFile ~/.ssh/key.pem Host datanode2 HostName User ec2-user IdentityFile ~/.ssh/key.pem Host datanode3 HostName User ec2-user IdentityFile ~/.ssh/key.pem
Run following two commands to resolve permissions: chmod 400 ~/.ssh/key.pem chmod 400 ~/.ssh/config
Run following commands to transfer files to datanodes (If files exist on datanodes, you will have to login to each node and delete them): scp ~/.ssh/key.pem ~~/.ssh/config datanode1:~~/.ssh scp ~/.ssh/key.pem ~~/.ssh/config datanode2:~~/.ssh scp ~/.ssh/key.pem ~~/.ssh/config datanode3:~~/.ssh
Run the following commands to start Hadoop. $HADOOP_HOME/sbin/start-all.sh
Run following commands to delete previous results if exist and disable safe mode: hadoop dfsadmin -safemode leave hdfs dfs -rm /result/* hdfs dfs -rmdir /result
Upload resume_dataset.csv, ResumeRecommender.jar, input.txt to ~ folder. (We have provided necessary java file to create jar)
Use below command to move resume_dataset to hdfs if not exist: hdfs dfs -copyFromLocal /home/ec2-user/resume_dataset.csv /ResumeRecommenderinput
Run below command to run the program from ~ directory: hadoop jar ResumeRecommender.jar ResumeRecommender /ResumeRecommenderinput/resume_dataset.csv /result input.txt (The last line of this command will show duration in miliseconds, the program took to execute.)
To see the output: hdfs dfs -cat /result/* (Before re-running the code, execute step-7)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Project_Presentation.pptx		Project_Presentation.pptx
README.md		README.md
ResumeRecommender.java		ResumeRecommender.java
input.txt		input.txt
resume_dataset.csv		resume_dataset.csv
resumerecommander.jar		resumerecommander.jar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ashishc2503/Candidate-Recommendation

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages