Join the social network of Tech Nerds, increase skill rank, get work, manage projects...

37.1k Increase Timeout in SoapUI

27.4k Export from a list containing

25.8k How to create DLL file from ja

22.8k Tomcat and Eclipse Integration

20.5k java.io.FileNotFoundException:

19.0k How to create a custom Functio

prev 1 2 3 … 104 next

Hadoop Mapper Utility Class

about 11 years ago

0
2
Positive Vote
1
Negative Vote
2
Save Favourite
0
0
0
0
793

Comment on it

In example I am going to demonstrate how to load a file from Hadoop Distributed Cache. Here I am writing Mapper and Driver class, inside of Mapper class we have define input type key vale pairs and output type key value pairs.

Inside driver class we have also define resources initialization information.

public class LoadFileDistributedCachePidUtilsMapperDemo {



//Mapper

//This Mapper class takes input as Key "NullWritable" and value will be "BytesWritable" and output will be this mapper class Key as a "NullWritable" and value "Text"
public static class PidExtractionMapper extends   Mapper<NullWritable, BytesWritable, NullWritable, Text> {

//we have declared class Level Variable
    private NullWritable noKey = NullWritable.get();
    private Configuration conf;
    private Text outputValue= new Text();
    private String fileName;
    private List<String> locpid;
    private Text locationAsXml=new Text();
    private XML xml;
    private String locxml;
    private String json;
    //private final static IntWritable one = new IntWritable(1);


//Hadoop Framwork Called below method once at the beginning of the task.as per our need it will be load "LPIDFILE" in hdfs Distributed Cache.so that we can read file and perform business logic
    @Override
    public void setup(Context context) throws IOException,InterruptedException{

        this.conf= context.getConfiguration();

        String fileName= FileUtils.getFilePathFromDistributedCache("LPIDFILE");


   }


//this is Mapper class logic where we read file data and check if record is exist then pass inside of written JSON utils class so that it will write data into JSON format.
    public void map(NullWritable key, BytesWritable value, Context context) throws IOException, InterruptedException {

        Location location= ExtractionUtilHelperResources.getLocationFromSerializedObject(value);

        String locationPId= location.getPublishedId();

        if((location==null) && (!locpid.contains(location.getPublishedId()))) return;


          System.out.println("Mapper Value: "+locationPId);

           json=JSONUtils.getJSONStringFromObject(location);

            if(json==null){
                 json=locationPId;
            }
            context.write(NullWritable.get(), new Text(json));
        }
}




    public static void main(String [] args) throws Exception{

//create configuration

        Configuration conf= new Configuration();

//going to submit new hadoop job
        Job hadoopJob = new Job(conf, "Extraction Utility class");
// assigned new name to Job Tracker
        hadoopJob.setJobName("Extraction Utility Job");


        hadoopJob.setJarByClass(LoadFileDistributedCachePidUtilsMapperDemo.class);

//mapper class name
        hadoopJob.setMapperClass(PidExtractionMapper.class);

        //specify input file format 
       hadoopJob.setInputFormatClass(SequenceFileInputFormat.class);



        hadoopJob.setOutputFormatClass(TextOutputFormat.class);



//Specify Key
        hadoopJob.setMapOutputKeyClass(NullWritable.class);

//Specify value
        hadoopJob.setMapOutputValueClass(Text.class);

//having no reducer task
        hadoopJob.setNumReduceTasks(0);

//first argument specify HDFS input file Location 
        FileInputFormat.addInputPath(hadoopJob,new Path (args[0]));
//second argument specify  HDFS Output file Location
        FileOutputFormat.setOutputPath(hadoopJob,new Path (args[1]));

//third argument specify HDFS Distributed cache file Location
       FileUtils.loadFiletoDC((args[2]),"LPIDFILE",hadoopJob.getConfiguration());

         hadoopJob.waitForCompletion(true);


    }


}

Tags

Java Hadoop Mapper Utility Class

Comment on it

0 Comment(s)

Sign In

Create an account !!

OR

Register

Sign up using

OR

^* Email

^* Please enter email address ^* Please enter a valid email ^* E-mail already exists ^* Sorry, this domain is blacklisted

^* User Name

^* Email

^* Email confirmation

^* Password

^* Password confirmation

Forgot Password

Fill out the form below and instructions to reset your password will be emailed to you:

Reset Password

Fill out the form below and reset your password:

Hire
Post Projects
Browse Nerds
Work
Find Projects Find Projects
- UI Design and UX
  - 3D max
  - |
  - CSS
  - |
  - Flash
  - |
  - Html
  - |
  - Illustrator
  - |
  - Maya
  - |
  - Photoshop
  - |
  - UI Design
  - |
  - UX
- Software Engineering
  - .Net
  - |
  - Android
  - |
  - Angular
  - |
  - Apache
  - |
  - AS3
  - |
  - Automation
  - |
  - AWS
  - |
  - Azure
  - |
  - C
  - |
  - C++
  - |
  - CakePHP
  - |
  - Cloud Computing
  - |
  - CMS
  - |
  - CodeIgniter
  - |
  - Docker
  - |
  - DotNetNuke
  - |
  - Drupal
  - |
  - Flex
  - |
  - GIT
  - |
  - Google Cloud
  - |
  - Informatica
  - |
  - Ionic
  - |
  - iOS
  - |
  - JAVA
  - |
  - Javascript
  - |
  - Kubernetes
  - |
  - Laravel
  - |
  - Liferay
  - |
  - Linux
  - |
  - Mac OS
  - |
  - Magento
  - |
  - Manual Testing
  - |
  - MongoDB
  - |
  - MySQL
  - |
  - NativeScript
  - |
  - Networking
  - |
  - Node.js
  - |
  - NoSQL
  - |
  - Objective C
  - |
  - OpenERP
  - |
  - Oracle
  - |
  - Phalcon
  - |
  - PhoneGap
  - |
  - PHP
  - |
  - PostgreSQL
  - |
  - Python
  - |
  - Qt
  - |
  - React
  - |
  - React Native
  - |
  - Ruby
  - |
  - Security Testing
  - |
  - SharePoint
  - |
  - Site Testing
  - |
  - SQL
  - |
  - Swift
  - |
  - Symfony
  - |
  - Unity3D
  - |
  - Version Control
  - |
  - Visual Studio
  - |
  - Windows
  - |
  - Windows Mobile
  - |
  - Wordpress
  View more...
  
  View less...
- Marketing
  - Blogging
  - |
  - Digital Marketing
  - |
  - SEO & Growth Hacking
  - |
  - Social Media
- General
  - Agile
  - |
  - Findnerd Updates
  - |
  - Gadgets
  - |
  - Game Development
  - |
  - General
  - |
  - Mobile Development
  - |
  - SCRUM
  - |
  - Technology
  - |
  - Web Development
  View more...
  
  View less...
- Business
  - Business Intelligence
  - |
  - Entrepreneurs
  - |
  - Freelancing
  - |
  - Product Development
  - |
  - Project Management
  - |
  - Startups
Manage
Company Company
Learn
Nerd Digest Nerd Digest
- UI Design and UX
  - 3D max
  - |
  - CSS
  - |
  - Flash
  - |
  - Html
  - |
  - Illustrator
  - |
  - Maya
  - |
  - Photoshop
  - |
  - UI Design
  - |
  - UX
- Software Engineering
  - .Net
  - |
  - Android
  - |
  - Angular
  - |
  - Apache
  - |
  - AS3
  - |
  - Automation
  - |
  - AWS
  - |
  - Azure
  - |
  - C
  - |
  - C++
  - |
  - CakePHP
  - |
  - Cloud Computing
  - |
  - CMS
  - |
  - CodeIgniter
  - |
  - Docker
  - |
  - DotNetNuke
  - |
  - Drupal
  - |
  - Flex
  - |
  - GIT
  - |
  - Google Cloud
  - |
  - Informatica
  - |
  - Ionic
  - |
  - iOS
  - |
  - JAVA
  - |
  - Javascript
  - |
  - Kubernetes
  - |
  - Laravel
  - |
  - Liferay
  - |
  - Linux
  - |
  - Mac OS
  - |
  - Magento
  - |
  - Manual Testing
  - |
  - MongoDB
  - |
  - MySQL
  - |
  - NativeScript
  - |
  - Networking
  - |
  - Node.js
  - |
  - NoSQL
  - |
  - Objective C
  - |
  - OpenERP
  - |
  - Oracle
  - |
  - Phalcon
  - |
  - PhoneGap
  - |
  - PHP
  - |
  - PostgreSQL
  - |
  - Python
  - |
  - Qt
  - |
  - React
  - |
  - React Native
  - |
  - Ruby
  - |
  - Security Testing
  - |
  - SharePoint
  - |
  - Site Testing
  - |
  - SQL
  - |
  - Swift
  - |
  - Symfony
  - |
  - Unity3D
  - |
  - Version Control
  - |
  - Visual Studio
  - |
  - Windows
  - |
  - Windows Mobile
  - |
  - Wordpress
  View more...
  
  View less...
- Marketing
  - Blogging
  - |
  - Digital Marketing
  - |
  - SEO & Growth Hacking
  - |
  - Social Media
- General
  - Agile
  - |
  - Findnerd Updates
  - |
  - Gadgets
  - |
  - Game Development
  - |
  - General
  - |
  - Mobile Development
  - |
  - SCRUM
  - |
  - Technology
  - |
  - Web Development
  View more...
  
  View less...
- Business
  - Business Intelligence
  - |
  - Entrepreneurs
  - |
  - Freelancing
  - |
  - Product Development
  - |
  - Project Management
  - |
  - Startups
Tech Q & A Tech Q & A
- UI Design and UX
  - 3D max
  - |
  - CSS
  - |
  - Flash
  - |
  - Html
  - |
  - Illustrator
  - |
  - Maya
  - |
  - Photoshop
  - |
  - UI Design
  - |
  - UX
- Software Engineering
  - .Net
  - |
  - Android
  - |
  - Angular
  - |
  - Apache
  - |
  - AS3
  - |
  - Automation
  - |
  - AWS
  - |
  - Azure
  - |
  - C
  - |
  - C++
  - |
  - CakePHP
  - |
  - Cloud Computing
  - |
  - CMS
  - |
  - CodeIgniter
  - |
  - Docker
  - |
  - DotNetNuke
  - |
  - Drupal
  - |
  - Flex
  - |
  - GIT
  - |
  - Google Cloud
  - |
  - Informatica
  - |
  - Ionic
  - |
  - iOS
  - |
  - JAVA
  - |
  - Javascript
  - |
  - Kubernetes
  - |
  - Laravel
  - |
  - Liferay
  - |
  - Linux
  - |
  - Mac OS
  - |
  - Magento
  - |
  - Manual Testing
  - |
  - MongoDB
  - |
  - MySQL
  - |
  - NativeScript
  - |
  - Networking
  - |
  - Node.js
  - |
  - NoSQL
  - |
  - Objective C
  - |
  - OpenERP
  - |
  - Oracle
  - |
  - Phalcon
  - |
  - PhoneGap
  - |
  - PHP
  - |
  - PostgreSQL
  - |
  - Python
  - |
  - Qt
  - |
  - React
  - |
  - React Native
  - |
  - Ruby
  - |
  - Security Testing
  - |
  - SharePoint
  - |
  - Site Testing
  - |
  - SQL
  - |
  - Swift
  - |
  - Symfony
  - |
  - Unity3D
  - |
  - Version Control
  - |
  - Visual Studio
  - |
  - Windows
  - |
  - Windows Mobile
  - |
  - Wordpress
  View more...
  
  View less...
- Marketing
  - Blogging
  - |
  - Digital Marketing
  - |
  - SEO & Growth Hacking
  - |
  - Social Media
- General
  - Agile
  - |
  - Findnerd Updates
  - |
  - Gadgets
  - |
  - Game Development
  - |
  - General
  - |
  - Mobile Development
  - |
  - SCRUM
  - |
  - Technology
  - |
  - Web Development
  View more...
  
  View less...
- Business
  - Business Intelligence
  - |
  - Entrepreneurs
  - |
  - Freelancing
  - |
  - Product Development
  - |
  - Project Management
  - |
  - Startups
Ask Tech Query
Post Blogs