LinuxQuestions.org

LinuxQuestions.org (/questions/)
-   Other *NIX (https://www.linuxquestions.org/questions/other-%2Anix-55/)
-   -   Error in Hadoop Installation (https://www.linuxquestions.org/questions/other-%2Anix-55/error-in-hadoop-installation-4175509318/)

bbdynamite 06-26-2014 06:45 PM

Error in Hadoop Installation
 
I'm just starting to use Linux and am trying to install Hadoop onto my computer. I downloaded the Hadoop file from a mirror recommended by Apache. The file is hadoop-2.2.0.tar.gz. Every website that I have read says that the command "tar -xzvf hadoop-2.2.0.tar.gz" should be able to unpack the file, but I keep getting the error "Error opening archive: Failed to open 'hadoop-2.2.0.tar.gz' "

I have tried both xzf and xvzf, neither seems to solve the problem.

Was I perhaps supposed to do something before I unpacked the file? What is going on?
(My computer is running OSX 10.9.3)

Note: I just started using this forum so I wasn't sure which category I should have posted this thread under; excuse me if this is the wrong category.

dijetlo 06-27-2014 08:17 AM

Quote:

OSX 10.9.3...
That's apple. Isn't there a guy somewhere who's paid to answer that question?

bbdynamite 06-27-2014 02:30 PM

Do you suggest I call Apple customer service?

schneidz 06-27-2014 02:36 PM

well you paid for support (as part of your initial purchase) but i doubt they will be able to answer a question like that.

judging by the filename the command you are running should work.

i have in the past tried to do programming on apple and stuff like netcat/gcc/... were surprisingly missing.

can you show us the output of the file command to make sure it is indeed a gnu-zip.

maybe the version of tar for apple is missing the functionality to de/compress

bbdynamite 06-27-2014 02:37 PM

Sorry for asking, but what exactly is the output of the file command?

schneidz 06-27-2014 02:41 PM

man file.

just type in
Code:

file <name-of-file>
and copy-pasta the output in your next response.

not to sound snarky but if you dont know the file command, how do you expect to administer a multi-node hadoop cluster with a distributed filesystem (in fact i'm not even convinced hadoop can be installed on mac -- i just looked it on duckduckgo: https://wiki.apache.org/hadoop/Runni...ode_Cluster%29) ?

what are you trying to do. i am not trying to dissuade you but perhaps you are going about it the wrong way ?
Quote:

Originally Posted by bbdynamite (Post 5194635)
I'm just starting to use Linux...
(My computer is running OSX 10.9.3)

a clarification although mac os uses the unix kernel it is not the same as using gnu/linux. the linux kernel is similar but not the same as unix.

bbdynamite 06-27-2014 03:01 PM

No offense taken. This is my first computer science-related job so I'm still trying to find my way.

I'm almost certain that hadoop can be installed on Mac; there are a few websites with instructions on how to do so. I don't doubt that I could be going about it the wrong way though.

Quote:

a clarification although mac os uses the unix kernel it is not the same as using gnu/linux. the linux kernel is similar but not the same as unix.
Thanks, good to know!

bbdynamite 06-27-2014 03:03 PM

Macintosh:~ hadoop$ file hadoop-2.2.0.tar.gz
hadoop-2.2.0.tar.gz: cannot open `hadoop-2.2.0.tar.gz' (No such file or directory)


The file is definitely in my Downloads folder. Should I move it somewhere else?

schneidz 06-27-2014 03:10 PM

maybe you are currenlty not in the rite directory (which would explain why its not finding it).

type in:
Code:

pwd
ls -al

and post the results.

bbdynamite 06-27-2014 03:14 PM

Macintosh:~ hadoop$ pwd
/Users/hadoop
Macintosh:~ hadoop$ ls -al
total 40
drwxr-xr-x+ 16 hadoop staff 544 Jun 25 20:32 .
drwxr-xr-x 7 root admin 238 Jun 25 20:14 ..
-rw------- 1 hadoop staff 3 Jun 25 20:12 .CFUserTextEncoding
-rw-r--r--@ 1 hadoop staff 15364 Jun 25 21:06 .DS_Store
drwx------ 4 hadoop staff 136 Jun 26 23:50 .Trash
-rw------- 1 hadoop staff 404 Jun 26 23:50 .bash_history
drwx------ 5 hadoop staff 170 Jun 25 20:34 .ssh
drwx------ 3 hadoop staff 102 Jun 25 20:18 Applications
drwx------+ 7 hadoop staff 238 Jun 25 21:16 Desktop
drwx------+ 4 hadoop staff 136 Jun 26 18:39 Documents
drwx------+ 8 hadoop staff 272 Jun 25 21:08 Downloads
drwx------@ 45 hadoop staff 1530 Jun 27 14:58 Library
drwx------+ 3 hadoop staff 102 Jun 25 20:12 Movies
drwx------+ 3 hadoop staff 102 Jun 25 20:12 Music
drwx------+ 3 hadoop staff 102 Jun 25 20:12 Pictures
drwxr-xr-x+ 5 hadoop staff 170 Jun 25 20:12 Public


Should I cd to Downloads?

schneidz 06-27-2014 03:16 PM

^ sure.

bbdynamite 06-27-2014 03:37 PM

I think it worked! I didn't get an error this time. Thank you so much!!

In retrospect, this was a really dumb question to post. Sorry about that.

I have 2 more questions:
1. After tar -xzf hadoop-2.2.0.tar.gz the instructions say to type chown -R hadoop hadoop-2.2.0.tar.gz. What does the second command do?
2. The Apache website says to verify the integrity of the downloaded hadoop file using the PGP signature. I didn't do this because I couldn't find the public keys and the asc signature file. Will this be an issue later on when I am configuring or using hadoop?

dijetlo 06-27-2014 05:38 PM

Quote:

After tar -xzf hadoop-2.2.0.tar.gz the instructions say to type chown -R hadoop hadoop-2.2.0.tar.gz. What does the second command do?
It makes a user named hadoop the owner of the archive hadoop-2.2.tar.gz.
Quote:

Will this be an issue later on when I am configuring or using hadoop?
Probably not, it's normally just an md5checksum to insure the source file and destination file are identical, it's normally needed after that.

bbdynamite 06-27-2014 09:48 PM

Thanks!

So now I am supposed to edit the .bashrc file, but I can't locate the .bashrc file. Instructions say I should "update its appropriate configuration files instead of .bashrc" if I don't have a .bashrc file. (Although I'm pretty certain I do have a .bashrc file.)
What are my appropriate configuration files, can you tell?

Macintosh:hadoop hadoop$ cd /Users/hadoop/Downloads/hadoop-2.2.0/etc/hadoop
Macintosh:hadoop hadoop$ ls
capacity-scheduler.xml httpfs-site.xml
configuration.xsl log4j.properties
container-executor.cfg mapred-env.cmd
core-site.xml mapred-env.sh
hadoop-env.cmd mapred-queues.xml.template
hadoop-env.sh mapred-site.xml
hadoop-metrics.properties mapred-site.xml.template
hadoop-metrics2.properties slaves
hadoop-policy.xml ssl-client.xml.example
hdfs-site.xml ssl-server.xml.example
httpfs-env.sh yarn-env.cmd
httpfs-log4j.properties yarn-env.sh
httpfs-signature.secret yarn-site.xml


All times are GMT -5. The time now is 07:27 AM.