Why Hadoop doesn’t support Updates and append?

1 Answer

By default Hadoop meant for write once and read many time functionality. Hadoop 2.x support append operation, but Hadoop 1.x doesn’t support.

answer Dec 1, 2016 by Karthick.c

Similar Questions

+2 votes

What is Hadoop Ecosystem?

+1 vote

Operating with Hadoop ?

I want to know the installation and configuration of Apache Hadoop and Programming Paradigm for working on it..

+2 votes

Hadoop: namenode doesn't update block locations when data directories of a datanode is changed?

I am running hadoop-2.4.0 cluster. Each datanode has 10 disks, directories for 10 disks are specified in dfs.datanode.data.dir.

A few days ago, I modified dfs.datanode.data.dir of a datanode () to reduce disks. so two disks were excluded from dfs.datanode.data.dir, after the datanode was restarted, I expected that the namenode would update block locations. In other words, I thought the namenode should remove from block locations associated with blocks which were stored on excluded disks, but the namenode didnt update the block locations...

In my understanding, datanode send a block report to the namenode when datanode start so the namenode should update block locations immediately.

Is a bug? Could anyone please explain?

+2 votes

Hadoop doesn't find the input file

I am trying to run Nutch 2.2.1 on a Haddop 2-node cluster. My hadoop cluster is running fine and I have successfully added the input and output directory on to HDFS. But when I run

$HADOOP_HOME/bin/hadoop jar /nutch/apache-nutch-2.2.1.job org.apache.nutch.crawl.Crawler urls -dir crawl -depth 3 -topN 5

I am getting something like:

INFO input.FileInputFormat: Total input paths to process : 0

Which, I understand, is meaning that Hadoop cannot locate the input files. The job ends for obvious reasons citing the null pointer exception.

Can someone help me out?

Why Hadoop doesn’t support Updates and append?

Your comment on this post:

1 Answer

Your comment on this answer:

Your answer

Preview