Sunday, March 6, 2016

Hadoop, Avro experimentation

Followed  but caution: I need to set the javac CLASSPATH instead to
export CLASSPATH="$HADOOP_HOME/share/hadoop/tools/lib/*"
export CLASSPATH="$HADOOP_HOME/share/hadoop/mapreduce/*:$CLASSPATH"

when I used the jar command it failed at first. This post helped to get it into alternatives:

Used hadoop fs -put command to put the test.txt file onto the hadoop filesystem per

when running the MapReduceAvroWordCount application get errors similar to as described 
Added yarn.resourcemanager.address per answer.
jps command (see says hadoop services appear to be running.

this time didn't get same error, but job seems stuck.

issue and modifying hadoop config files to match single-node installation.
tests under single-node Testing section gave same error as before: INFO ipc.Client: Retrying connect to server:
Reinstalling to local user dir per alexjf blog.
Still attempts to connect to
Reinstall Fedora, and set static ip address, Install JdK per
tried tests got error this time
Attempt to disable ipv6 using but resulted in OS instability. Forced reboot.
could not login.
try reinstall OS.
install jdk, enable ssh. Tried enabling passwordless ssh per
but would not work.
tried the hadoop installation and test from alexjf blog. It failed the test, pulling up the url, shows memory limit exceeded as cause of failure.
removed all settings from yarn-site.xml and stopped restarted using $HADOOP_PREFIX/sbin/ start etc. but got seeming infinite running.
used first set of instructions to enable passwordless ssh.
restart hadoop services and didn't ask for password this time. test still fails. 
modified command set --num_containers 1 --master_memory 512 and says completed successfully.

Trying on a Mac now per works.

Back on Fedora setup; retried MapReduceAvroWordCount and it worked. Next try writing schema based on Java class: How to convert schema to json representation (avsc file?) and use that instead of java class?

1 comment: