Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

총관리자 2013.12.16 22:09 조회 수 : 4463

출처 : http://www.spikyjohn.com/cribsheets/20130609_hadoopinstall.html

Just the command lines to get hadoop 2 installed on Ubuntu. These are all cribbed from the following source notes, and I am preserving them here for my own benefit so I can quickly repeat what I did. Note many of these instructions are also in the main hadoop docs from apache.

Source material	Use Michael-noll's guide for version 1 & ssh http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/ http://hadoop.apache.org/docs/r1.1.2/single_node_setup.html Or this one for Hadoop 2 http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html http://hadoop.apache.org/docs/r2.0.5-alpha/
Create the hadoop user and ssh	sudo apt-get install openssh-server openssh-client sudo addgroup hadoop sudo adduser --ingroup hadoop hduser su - hduser If you cannot ssh to localhost without a passphrase, execute the following commands: ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys Testing your SSH ssh localhost Say yes #exit
Get hadoop all set up	As the hduser, after downloading the tar tar -xvf hadoop-2.0.5-alpha.tar.gz ln -s hadoop-2.0.5-alpha hadoop #edit .bashrc export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_21/ export HADOOP_PREFIX="/home/hduser/hadoop" export PATH=$PATH:$HADOOP_PREFIX/bin export PATH=$PATH:$HADOOP_PREFIX/sbin export HADOOP_MAPRED_HOME=${HADOOP_PREFIX} export HADOOP_COMMON_HOME=${HADOOP_PREFIX} export HADOOP_HDFS_HOME=${HADOOP_PREFIX} export YARN_HOME=${HADOOP_PREFIX}
Stolen entirely from JJ, but with path changed for my Ubuntu	Stolen from http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html Please click on his blog. Login again so bash has paths above. In Hadoop 2.x version /etc/hadoop is the default conf directory. We need to modify / create following property files in the /etc/hadoop directory cd ~ mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/name;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/data;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/system;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/local Edit core-site.xml with following contents <configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:8020</value> <description>The name of the default file system. Either the literal string "local" or a host:port for NDFS.</description> <final>true</final> </property> </configuration> Edit hdfs-site.xml with following contents <configuration> <property> <name>dfs.namenode.name.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name</value> <description>Determines where on the local filesystem the DFS name node should store the name table. If this is a comma-delimited list of directories then the name table is replicated in all of the directories, for redundancy. </description> <final>true</final> </property> <property> <name>dfs.datanode.data.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data</value> <description>Determines where on the local filesystem an DFS data node should store its blocks. If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. Directories that do not exist are ignored. </description> <final>true</final> </property> <property> <name>dfs.replication</name> <value>1</value> </property> <property> <name>dfs.permissions</name> <value>false</value> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name AND file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data are some folders in your computer which would give space to store data and name edit files Path should be specified as URI Create a file mapred-site.xml inside /etc/hadoop with following contents <configuration> <property> <name>mapreduce.framework.name</name> <value>yarn</value> </property> <property> <name>mapred.system.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system</value> <final>true</final> </property> <property> <name>mapred.local.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local</value> <final>true</final> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system AND file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local are some folders in your computer which would give space to store data Path should be specified as URI Edit yarn-site.xml with following contents <configuration> <property> <name>yarn.nodemanager.aux-services</name> <value>mapreduce.shuffle</value> </property> <property> <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name> <value>org.apache.hadoop.mapred.ShuffleHandler</value> </property> </configuration> Format the namenode # hdfs namenode –format Say Yes and let it complete the format Time to start the daemons # hadoop-daemon.sh start namenode # hadoop-daemon.sh start datanode You can also start both of them together by # start-dfs.sh Start Yarn Daemons # yarn-daemon.sh start resourcemanager # yarn-daemon.sh start nodemanager You can also start all yarn daemons together by # start-yarn.sh Time to check if Daemons have started Enter the command # jps 2539 NameNode 2744 NodeManager 3075 Jps 3030 DataNode 2691 ResourceManager Time to launch UI Open the localhost:8088 to see the Resource Manager page Done :) Happy Hadooping :)

이 게시물을

이 글의 추천인 목록 목록

번호	제목	날짜	조회 수
730	[CDP7.1.7]impala-shell을 이용하여 kudu table에 insert/update수행시 발생하는 오류(Transport endpoint is not connected (error 107)) 발생시 확인할 내용	2023.11.30	5891
729	[CDP7.1.7]impala-shell수행시 간헐적으로 "-k requires a valid kerberos ticket but no valid kerberos ticket found." 오류	2023.11.16	5312
728	[CDP7.1.7]Encryption Zone내부/외부 간 데이터 이동(mv,cp)및 CTAS, INSERT SQL시 오류(can't be moved into an encryption zone, can't be moved from an encryption zone)	2023.11.14	4431
727	kudu table와 impala(hive) table정보가 틀어져서 테이블을 읽지 못하는 경우(Error Loading Metadata) 조치방법	2023.11.10	4568
726	임시 테이블에서 데이터를 읽어서 partitioned table에 입력하는 impala SQL문 예시	2023.11.10	4841
725	[EncryptionZone]User:hdfs not allowed to do 'DECRYPT_EEK on 'enc_key'오류	2023.11.02	4720
724	[Hadoop Encryption] Encryption Zone에 생성된 table에 Hue에서 insert 수행시 User:hdfs not allowed to do 'DECRYPT_EEK' ON 'testkey' 오류	2023.11.01	4608
723	[보안/인증]javax.net.ssl.SSLHandshakeException: sun.security.validator.ValidatorException: PKIX path building failed: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target발생 원인/조치내용	2023.10.24	4905
722	[CDP7.1.7]EncryptionZone에 table생성및 권한 테스트	2023.09.26	4580
721	[Oracle 11g]Kudu table의 meta정보를 담고 있는 table_params의 백업본을 이용하여 특정 컬럼값을 update하는 Oracle SQL문	2023.09.04	3395
720	[Impala jdbc]CDP7.1.7환경에서 java프로그램을 이용하여 kerberized impala cluster에 접근하여 SQL을 수행하는 방법	2023.08.22	3549
719	[Hue metadata]Oracle에 있는 Hue 메타정보 테이블을 이용하여 coordinator와 workflow관계 목록을 추출하는 방법	2023.08.22	2860
718	[Hue admin]Add/Sync LDAP user, Sync LDAP users/groups 버튼 기능 설명	2023.08.09	4725
717	oozie의 sqoop action수행시 ooize:launcher의 applicationId를 이용하여 oozie:action의 applicationId및 관련 로그를 찾는 방법	2023.07.26	4610
716	[CDP7.1.6,HDFS]HDFS파일을 삭제하고 Trash비움이 완료된후에도 HDFS 공간을 차지하고 있는 경우 확인/조치 방법	2023.07.17	4552
715	[Encryption Zone]Encryption Zone에 생성된 table을 select할때 HDFS /tmp/zone1에 대한 권한이 없는 경우	2023.06.29	3070
714	[EncryptionZone]User:testuser not allowed to do "DECRYPT_EEK" on 'testkey'	2023.06.29	2097
713	[HDFS]Encryption Zone에 생성된 테이블 조회시 Failed to open HDFS file hdfs://nameservice1/tmp/zone1/sec_test_file.txt Error(255): Unknown error 255 Root cause: AuthorizationException: User:impala not allowd to do 'DECRYPT_EEK' on 'testkey'	2023.06.29	4435
712	[Hadoop Encryption] Encryption Zone 생성/설정시 User:hadoop not allowed to do 'DECRYPT_EEK' ON 'testkey' 오류 발생 조치 사항	2023.06.28	4489
711	[KTS Cluster의 Key Trustee Server]self-signed 인증서 발급및 설정 방법	2023.06.27	4324

쓰기 태그

첫 페이지 1 2 3 4 5 6 7 8 9 10 끝 페이지

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

LOGIN