0% found this document useful (0 votes)

50 views5 pages

Execute Java Programs in MapReduce

The document provides steps to create a Java project in Eclipse called "wordcount" that counts the frequency of words in a file. It involves: 1. Creating a Java project, package, and class in Eclipse for the wordcount program. 2. Adding external JAR files to the project from the hadoop_jars folder to resolve errors. 3. Exporting the project as a JAR file called "wordcount.jar". 4. Running the JAR file on a test text file using Hadoop MapReduce and viewing the output of word counts.

Uploaded by

Namma ooru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views5 pages

Execute Java Programs in MapReduce

Uploaded by

Namma ooru

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

BIGDATA

Execute Java programs (.JAR files) from MapRedue.

 To make sure our Eclipse is working with the simple program.

Click on Eclipse

To create a java program in eclipse, we need to create a below three objects.

1. Java Project

2. Package

3. Class (where we need to create a java programs)

 Step 1:

Right click on the Package Explorer (Left side Pane)

New -> Java Project

Give the project name as "welcome" and Click Finish.

Note: Now the project "welcome" will be created in the Package Explorer.

1 [email protected]
99520 29030
BIGDATA

 Step 2:

Right click on the project "welcome” New -> Package

Give the package name as "welcome" against the Name

Make sure Source Folder as "welcome\src" and click Finish.

Note: Now the package name "welcome" will be created in the package source folder.

 Step 3:

Right click on the project "welcome" New -> Class

Give the class name as "welcome" against the Name and

make sure Source folder as "welcome\src" and

Package as "welcome" and

make sure Public static void main box checked then click Finish.

Note: Now class "welcome" will be created under the package.

The created class file will be shown like below..

Note: Add the highlighted print statement under void main.

package welcome;

public class welcome {

/**
* @param args
*/
public static void main(String[] args)
{

System.out.println("Welcome");
}

Make sure no error shown in Class page.

To run the file, right click on the class page Run As -> Java Application

2 [email protected]
99520 29030
BIGDATA
Note: If your class file is error free, you will be able to see the "Welcome" on the result pane in the bottom.

Create and Execute .JAR file from MapReduce.

Program description: To count the repeated words from the file

Pre Requisites:

Copy the below files and paste into Cloudera's Home and unzip/Extract the "hadoop jars' file.

Step 1:

Right click on the Package Explorer (Left side Pane)

New -> Java Project

Give the project name as "wordcount" and Click finish.

Note: Now the project " wordcount " will be created in the Package Explorer.

Step 2:

Right click on the project " wordcount " New -> Package

Give the package name as " wordcount " against the Name

Make sure Source Folder as " wordcount \src" and click Finish.

Note: Now the package name " wordcount " will be created in the package source folder.

Step 3:

Right click on the project " wordcount " New -> Class

Give the class name as " wordcount " against the Name and

make sure Source folder as " wordcount \src" and

Package as " wordcount " and

make sure Public static void main box checked then click Finish.

Note: Now class " wordcount " will be created unser the package.

The created class file will be shown like below..

3 [email protected]
99520 29030
BIGDATA
package wordcount;

public class wordcount {

/**
* @param args
*/
public static void main(String[] args) {

Open the file wordcount.java and copy all the contents and
Replace it in Eclipse wordcount.java class file. Please make sure the first line would be " package
wordcount;" and Save it (Ctrl+S).

Note: Now we could see there are lots of red line in the Class script and it is expecting JAR files reference.

To add the reference files, right click on the project "wordcount" from
Package explorer -> Properties ->

select Java Build Path from Left pane and select Libraries from the Right side pane and click on
Add External Jars and browse the folder "hadoop_jars" which we extracted/unzipped in Cloudera's
Home path and select all the 10 supporting .jar files and click Ok to completed.

Note: Make sure all there underlined errors are removed and class became error free.

Now right click on the project "wordcount" from the package explorer
-> Export -> Expand Java -> select JAR File
-> click Next -> select "wordcount" from the left pane and click "Browse"
-> Provide the File name as "wordcount.jar"
-> Provide the Folder path as "/home/cloudera"
-> Next -> Finish.

Note: Now you could see the created jar file "wordcount.jar" created in "/home/cloudera/"

Now we are into execute the program from MapReduce.

Go to "/home/cloudera/" folder then right clicks Open Terminal

4 [email protected]
99520 29030
BIGDATA
[Coudera@Localhost ~]$ pwd

home/Cloudera -- Present Working Directory

Create a test file

$ cat > test.text

this is a test file
this
a
file
Ctrl+D to save

Now test.txt file is created with above mentioned four lines.

$ hadoop fs -ls / --It will list the folders which are avail.

Now place the file into hadoop

$ hadoop fs -put test.txt /tmp

$ hadoop fs -ls /tmp -- Now you cound see the test.txt file which you created is moved to hadoop.

Below command to execute the jar file in MapReduce.

$ hadoop jar wordcount.jar wordcount.wordcount /tmp/test.txt Target

Map Reduce program will run and to get the word count from the test file and stores the result into
Target folder.

$ hadoop fs -ls /user/cloudera/Target

Now you could see the file part-00000

$ hadoop fs -cat /user/cloudera/Target/part-00000

a 2
file 2
is 1
sample 1
this 2

Note: Similarly we need to other jar files as well with MapReduce program.

5 [email protected]
99520 29030

Programming Essentials
No ratings yet
Programming Essentials
5 pages
Iot Notes All Units PDF
No ratings yet
Iot Notes All Units PDF
51 pages
Running A Mapreduce Program On Cloudera Quickstart VM: Requirements
100% (1)
Running A Mapreduce Program On Cloudera Quickstart VM: Requirements
13 pages
Mapreduce Lab
No ratings yet
Mapreduce Lab
36 pages
Activity 2
No ratings yet
Activity 2
31 pages
BDM Lab Manual 2
No ratings yet
BDM Lab Manual 2
4 pages
6 WIBD-Practicals
No ratings yet
6 WIBD-Practicals
19 pages
Java and Project Delivery: E&CE 250 Winter 2002
No ratings yet
Java and Project Delivery: E&CE 250 Winter 2002
21 pages
Packages
No ratings yet
Packages
6 pages
Hands-On Exercises With Big Data: Lab Sheet 1: Getting Started With Mapreduce and Hadoop
No ratings yet
Hands-On Exercises With Big Data: Lab Sheet 1: Getting Started With Mapreduce and Hadoop
14 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
[CSC221-2024-02-06]Packages and the Java Module System
No ratings yet
[CSC221-2024-02-06]Packages and the Java Module System
32 pages
Mapreduce Lab
No ratings yet
Mapreduce Lab
36 pages
Go To Cloudera Quickstart VM To Download A Pre-Setup CDH Virtual Machine
No ratings yet
Go To Cloudera Quickstart VM To Download A Pre-Setup CDH Virtual Machine
20 pages
CE 232 - Week 10 - Unit 6 - Java Utilities - JAR Files 2024 Final
No ratings yet
CE 232 - Week 10 - Unit 6 - Java Utilities - JAR Files 2024 Final
37 pages
Intellipaat Hands On Exercises PDF
No ratings yet
Intellipaat Hands On Exercises PDF
49 pages
Homework_Labs_Lecture2
No ratings yet
Homework_Labs_Lecture2
6 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
MR YARN - Lab 2 - Cloud - Updated-V2.0
No ratings yet
MR YARN - Lab 2 - Cloud - Updated-V2.0
22 pages
11.file IO Package
No ratings yet
11.file IO Package
5 pages
Packages PDF
No ratings yet
Packages PDF
21 pages
Classpath
No ratings yet
Classpath
3 pages
Guide To Creating and Running A Jar File in Java Baeldung
No ratings yet
Guide To Creating and Running A Jar File in Java Baeldung
7 pages
Abstract Class Interface: Types of Packages: Built-In and User Defined
No ratings yet
Abstract Class Interface: Types of Packages: Built-In and User Defined
31 pages
DSBDA 11
No ratings yet
DSBDA 11
15 pages
Big Data Akshat
No ratings yet
Big Data Akshat
57 pages
Steps To Run A JAVA API On Virtual-Box
No ratings yet
Steps To Run A JAVA API On Virtual-Box
5 pages
WordCount Program Hadoop Task 2
No ratings yet
WordCount Program Hadoop Task 2
7 pages
CS702_Big_Data_Programs
No ratings yet
CS702_Big_Data_Programs
58 pages
The Definitive Guide to Getting Started with OpenCart 2.x
From Everand
The Definitive Guide to Getting Started with OpenCart 2.x
iSenseLabs
No ratings yet
Steps to create jar file and execute word count problem in mapper reducer
No ratings yet
Steps to create jar file and execute word count problem in mapper reducer
5 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
Word Count
No ratings yet
Word Count
10 pages
Declaring Classes: New5.java Test - Java Test2.java Test3.java
No ratings yet
Declaring Classes: New5.java Test - Java Test2.java Test3.java
11 pages
Cloudera Academic Partnership 3 PDF
0% (1)
Cloudera Academic Partnership 3 PDF
103 pages
Make Bootstrap Themes
From Everand
Make Bootstrap Themes
Bo Feng
No ratings yet
Development PDF
No ratings yet
Development PDF
14 pages
Eclipse
No ratings yet
Eclipse
33 pages
Lecture 4 PDF
No ratings yet
Lecture 4 PDF
38 pages
Cloudera Academic Partnership 4 PDF
No ratings yet
Cloudera Academic Partnership 4 PDF
38 pages
Prerequisites: Single Node Setup Cluster Setup
No ratings yet
Prerequisites: Single Node Setup Cluster Setup
5 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
32 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
bda lab s
No ratings yet
bda lab s
92 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Labs Lecture2
No ratings yet
Labs Lecture2
6 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
MapReduce Programs
No ratings yet
MapReduce Programs
10 pages
BDA Manual
No ratings yet
BDA Manual
41 pages
DSBDA Lab Manual
No ratings yet
DSBDA Lab Manual
56 pages
Module10-BigData Guide v1.0
No ratings yet
Module10-BigData Guide v1.0
6 pages
Big Data Analytics Lab Manual(BE AI&DS)
No ratings yet
Big Data Analytics Lab Manual(BE AI&DS)
29 pages
Java Programming Manual
No ratings yet
Java Programming Manual
71 pages
Practice 2
No ratings yet
Practice 2
7 pages
Java BC0047 - Software-Engineering-Spring-2013-Assignment
No ratings yet
Java BC0047 - Software-Engineering-Spring-2013-Assignment
7 pages
Chap 18
No ratings yet
Chap 18
11 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Haskell from Another Site
From Everand
Haskell from Another Site
Jagoda Górska
No ratings yet
Selenium 3
No ratings yet
Selenium 3
11 pages
Mainframe Questions
No ratings yet
Mainframe Questions
14 pages
Urlfiles
No ratings yet
Urlfiles
1 page
Pig Practicals
No ratings yet
Pig Practicals
4 pages
Kafka Practicals
No ratings yet
Kafka Practicals
3 pages
StdXI Voc TOCA EM 5
No ratings yet
StdXI Voc TOCA EM 5
40 pages
Sai Hive Practicals Phase I
No ratings yet
Sai Hive Practicals Phase I
8 pages
Speed Practice: Type The Following in Double Line Spacing With A Margin of Ten Degrees
No ratings yet
Speed Practice: Type The Following in Double Line Spacing With A Margin of Ten Degrees
35 pages
ATCI Open Demands
No ratings yet
ATCI Open Demands
462 pages
CSDE2530 - Introduction To Computer Security - V2
No ratings yet
CSDE2530 - Introduction To Computer Security - V2
6 pages
DXB2038 RUS 02 B3 - 3 Ericsson Faulty Report - 1
No ratings yet
DXB2038 RUS 02 B3 - 3 Ericsson Faulty Report - 1
1 page
Untitled
No ratings yet
Untitled
60 pages
Ec Ii Unit2
No ratings yet
Ec Ii Unit2
20 pages
PLC XMIT Software Loadable User Manual v4.0
No ratings yet
PLC XMIT Software Loadable User Manual v4.0
196 pages
36 Popular Snap Command Examples in Linux For Beginners - CyberITHub
No ratings yet
36 Popular Snap Command Examples in Linux For Beginners - CyberITHub
1 page
Network Protocols
No ratings yet
Network Protocols
9 pages
Artificial Neural Network Part-2
No ratings yet
Artificial Neural Network Part-2
15 pages
FortiGate Security Study Guide For FortiOS5.6.2
No ratings yet
FortiGate Security Study Guide For FortiOS5.6.2
666 pages
All MCQ
No ratings yet
All MCQ
16 pages
EE328 CourseOutlineOBE SP 2023
No ratings yet
EE328 CourseOutlineOBE SP 2023
5 pages
Kemper Profiler Player Main Manual 11.0
No ratings yet
Kemper Profiler Player Main Manual 11.0
167 pages
Mithun Bhattacharjee: Pune, Maharashtra, India
No ratings yet
Mithun Bhattacharjee: Pune, Maharashtra, India
5 pages
7.2.1.7 Packet Tracer - Configuring Named Standard IPv4 ACLs Instructions - ILM
No ratings yet
7.2.1.7 Packet Tracer - Configuring Named Standard IPv4 ACLs Instructions - ILM
2 pages
CA1 REVISION Question Paper
No ratings yet
CA1 REVISION Question Paper
2 pages
Microsoft: Question & Answers
No ratings yet
Microsoft: Question & Answers
3 pages
GVC-432 Ref: Donald Hearn & M. Pauline Baker ,: Lecture - 4
No ratings yet
GVC-432 Ref: Donald Hearn & M. Pauline Baker ,: Lecture - 4
36 pages
HLTQ
No ratings yet
HLTQ
12 pages
DC Lab Exp-1
No ratings yet
DC Lab Exp-1
14 pages
kioptrix nikto scans
No ratings yet
kioptrix nikto scans
31 pages
Arjun Quiz
No ratings yet
Arjun Quiz
6 pages
ITI COPA Trade Lesson Plan 2024-25
No ratings yet
ITI COPA Trade Lesson Plan 2024-25
106 pages
Delhi Public School, R.K.Puram Computer Science: Entry Controlled Loop Entry Controlled Loop
No ratings yet
Delhi Public School, R.K.Puram Computer Science: Entry Controlled Loop Entry Controlled Loop
6 pages
iPremier Case
No ratings yet
iPremier Case
3 pages
Viva Questions For SQL & Java For STD 12
No ratings yet
Viva Questions For SQL & Java For STD 12
20 pages
18CSC303J DBMS Sample MCQ
No ratings yet
18CSC303J DBMS Sample MCQ
12 pages
SIMATIC20240708
No ratings yet
SIMATIC20240708
8 pages
Infy JAVA SELENIUM
No ratings yet
Infy JAVA SELENIUM
28 pages
The Turing Test
No ratings yet
The Turing Test
21 pages

Uploaded by

Uploaded by

BIGDATA

Execute Java programs (.JAR files) from MapRedue.

 To make sure our Eclipse is working with the simple program.

To create a java program in eclipse, we need to create a below three objects.

3. Class (where we need to create a java programs)

Right click on the Package Explorer (Left side Pane)

New -> Java Project

Give the project name as "welcome" and Click Finish.

Right click on the project "welcome” New -> Package

Give the package name as "welcome" against the Name

Make sure Source Folder as "welcome\src" and click Finish.

Right click on the project "welcome" New -> Class

Give the class name as "welcome" against the Name and

make sure Source folder as "welcome\src" and

Package as "welcome" and

Note: Now class "welcome" will be created under the package.

The created class file will be shown like below..

Note: Add the highlighted print statement under void main.

public class welcome {

Make sure no error shown in Class page.

Create and Execute .JAR file from MapReduce.

Program description: To count the repeated words from the file

Right click on the Package Explorer (Left side Pane)

New -> Java Project

Give the project name as "wordcount" and Click finish.

make sure Source folder as " wordcount \src" and

Package as " wordcount " and

The created class file will be shown like below..

public class wordcount {

Now we are into execute the program from MapReduce.

Go to "/home/cloudera/" folder then right clicks Open Terminal

home/Cloudera -- Present Working Directory

Create a test file

$ cat > test.text

Now test.txt file is created with above mentioned four lines.

Now place the file into hadoop

$ hadoop fs -put test.txt /tmp

Below command to execute the jar file in MapReduce.

$ hadoop jar wordcount.jar wordcount.wordcount /tmp/test.txt Target

$ hadoop fs -ls /user/cloudera/Target

Now you could see the file part-00000

$ hadoop fs -cat /user/cloudera/Target/part-00000

You might also like