top button
Flag Notify
    Connect to us
      Site Registration

Site Registration

MongoDB Hadoop Connector Set-up on EC2?

+2 votes

Would anyone here be willing to walk me through setting this stuff up on Amazon Web Services?

I need to run MongoDB on an EC2 instance and connect to an EMR Hadoop cluster for a project, but I have never used any of this stuff (Mongo/Hadoop/the connector/AWS) before so its a bit overwhelming. I have downloaded the connector from Github so far.

I believe I need to run "gradlew jar" to build the jars (not really sure what those do either), but after that I am a bit lost. Have been searching for about a week now, but I cant find a good step-by-step process for this.

Please help..

posted Apr 18, 2016 by Deepti Singh

Looking for an answer?  Promote on:
Facebook Share Button Twitter Share Button LinkedIn Share Button

Similar Questions
+1 vote

I have Hadoop cluster set up on Amazon EC2. When I am trying to access the application logs through Web UI I am getting page cant be displayed.

Configuration of Cluster: My Namenode is mapped with elastic IP(static) of EC2. Other datanodes public IP changing everyday as we are stopping the clustering during non working hours.

Observation :When I try to view the logs ,Its picking one of the datanode private IP and I am getting Page cant be displayed.

+1 vote

We are planning to migrate MongoDB from On-premise to AWS. But we are restricted not to use cloud manager . With the port is enabled,I want to change the 3 member replica set on-premise to 5 member replica set (3 on premise and 2 on AWS).

Can you please tell me if I can sync the data between on-premise to AWS using SSL but I should not be using SSL for On-premise replica set and only for AWS 2 replica sets the sync needs to happen using SSL.

+1 vote

I would like to know if you have any examples or tutorials where I can learn hadoop mapReduce on mongodb in java?

+1 vote

I am trying to set up a hadoop cluster, I would like to know how many physical VMs are needed for this. . My main interest to measure the shuffle phase network traffic.

What is the basic requirement like namenode and data node .
