Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

총관리자 2013.12.16 22:09 조회 수 : 4459

출처 : http://www.spikyjohn.com/cribsheets/20130609_hadoopinstall.html

Just the command lines to get hadoop 2 installed on Ubuntu. These are all cribbed from the following source notes, and I am preserving them here for my own benefit so I can quickly repeat what I did. Note many of these instructions are also in the main hadoop docs from apache.

Source material	Use Michael-noll's guide for version 1 & ssh http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/ http://hadoop.apache.org/docs/r1.1.2/single_node_setup.html Or this one for Hadoop 2 http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html http://hadoop.apache.org/docs/r2.0.5-alpha/
Create the hadoop user and ssh	sudo apt-get install openssh-server openssh-client sudo addgroup hadoop sudo adduser --ingroup hadoop hduser su - hduser If you cannot ssh to localhost without a passphrase, execute the following commands: ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys Testing your SSH ssh localhost Say yes #exit
Get hadoop all set up	As the hduser, after downloading the tar tar -xvf hadoop-2.0.5-alpha.tar.gz ln -s hadoop-2.0.5-alpha hadoop #edit .bashrc export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_21/ export HADOOP_PREFIX="/home/hduser/hadoop" export PATH=$PATH:$HADOOP_PREFIX/bin export PATH=$PATH:$HADOOP_PREFIX/sbin export HADOOP_MAPRED_HOME=${HADOOP_PREFIX} export HADOOP_COMMON_HOME=${HADOOP_PREFIX} export HADOOP_HDFS_HOME=${HADOOP_PREFIX} export YARN_HOME=${HADOOP_PREFIX}
Stolen entirely from JJ, but with path changed for my Ubuntu	Stolen from http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html Please click on his blog. Login again so bash has paths above. In Hadoop 2.x version /etc/hadoop is the default conf directory. We need to modify / create following property files in the /etc/hadoop directory cd ~ mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/name;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/data;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/system;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/local Edit core-site.xml with following contents <configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:8020</value> <description>The name of the default file system. Either the literal string "local" or a host:port for NDFS.</description> <final>true</final> </property> </configuration> Edit hdfs-site.xml with following contents <configuration> <property> <name>dfs.namenode.name.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name</value> <description>Determines where on the local filesystem the DFS name node should store the name table. If this is a comma-delimited list of directories then the name table is replicated in all of the directories, for redundancy. </description> <final>true</final> </property> <property> <name>dfs.datanode.data.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data</value> <description>Determines where on the local filesystem an DFS data node should store its blocks. If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. Directories that do not exist are ignored. </description> <final>true</final> </property> <property> <name>dfs.replication</name> <value>1</value> </property> <property> <name>dfs.permissions</name> <value>false</value> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name AND file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data are some folders in your computer which would give space to store data and name edit files Path should be specified as URI Create a file mapred-site.xml inside /etc/hadoop with following contents <configuration> <property> <name>mapreduce.framework.name</name> <value>yarn</value> </property> <property> <name>mapred.system.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system</value> <final>true</final> </property> <property> <name>mapred.local.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local</value> <final>true</final> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system AND file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local are some folders in your computer which would give space to store data Path should be specified as URI Edit yarn-site.xml with following contents <configuration> <property> <name>yarn.nodemanager.aux-services</name> <value>mapreduce.shuffle</value> </property> <property> <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name> <value>org.apache.hadoop.mapred.ShuffleHandler</value> </property> </configuration> Format the namenode # hdfs namenode –format Say Yes and let it complete the format Time to start the daemons # hadoop-daemon.sh start namenode # hadoop-daemon.sh start datanode You can also start both of them together by # start-dfs.sh Start Yarn Daemons # yarn-daemon.sh start resourcemanager # yarn-daemon.sh start nodemanager You can also start all yarn daemons together by # start-yarn.sh Time to check if Daemons have started Enter the command # jps 2539 NameNode 2744 NodeManager 3075 Jps 3030 DataNode 2691 ResourceManager Time to launch UI Open the localhost:8088 to see the Resource Manager page Done :) Happy Hadooping :)

이 게시물을

이 글의 추천인 목록 목록

번호	제목	날짜	조회 수
690	빅데이터 분석을 위한 샘플 빅데이터 파일 다운로드 사이트	2014.04.28	5336
689	Cloudera Hadoop and Spark Developer Certification 준비(참고)	2018.05.16	5309
688	[CDP7.1.7]impala-shell수행시 간헐적으로 "-k requires a valid kerberos ticket but no valid kerberos ticket found." 오류	2023.11.16	5307
687	org.apache.hadoop.hbase.PleaseHoldException: Master is initializing	2013.03.15	5303
686	HBase, BigTable, Cassandra Schema Design	2013.03.15	5294
685	Hive Query Examples from test code (1 of 2)	2014.03.26	5245
684	ping 안될때.. networking restart 날려주면 잘됨..	2014.05.09	5216
683	[Kudu]ERROR: Unable to advance iterator for node with id '2' for Kudu table 'impala::core.pm0_abdasubjct': Network error: recv error from unknown peer: Transport endpoint is not connected (error 107)	2023.03.16	5210
682	kafka broker기동시 brokerId가 달라서 기동에 실패하는 경우 조치방법	2016.05.02	5188
681	의사분산모드에 hadoop설치및 ecosystem 환경 정리	2014.05.29	5170
680	Hive+mysql 설치 및 환경구축하기	2013.03.07	5120
679	[백업] 리눅스 시스템 백업하기 (Linux System Backup) - TAR 사용 시스템 전체 백업	2022.01.19	5110
678	hive job실행시 meta정보를 원격의 mysql에 저장하는 경우 설정방법	2014.05.28	5098
677	HBase 설치하기 – Pseudo-distributed	2013.03.12	5093
676	kudu rebalance수행 command예시	2022.01.17	5072
675	Master rejected startup because clock is out of sync 오류 해결방법	2016.05.03	5067
674	List<Map<String, String>>형태의 데이타에서 중복제거 하는 방법	2016.12.23	5059
673	HiveServer2인증을 PAM을 이용하도록 설정하는 방법	2018.07.21	5043
672	build.gradle을 pom.xml로 변환하는 방법	2016.08.18	5010
671	Cloudera의 API를 이용하여 impala의 실행되었던 쿼리 확인하는 예시	2018.05.03	5002

쓰기 태그

첫 페이지 1 2 3 4 5 6 7 8 9 10 끝 페이지

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

LOGIN