Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

총관리자 2013.12.16 22:09 조회 수 : 4463

출처 : http://www.spikyjohn.com/cribsheets/20130609_hadoopinstall.html

Just the command lines to get hadoop 2 installed on Ubuntu. These are all cribbed from the following source notes, and I am preserving them here for my own benefit so I can quickly repeat what I did. Note many of these instructions are also in the main hadoop docs from apache.

Source material	Use Michael-noll's guide for version 1 & ssh http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/ http://hadoop.apache.org/docs/r1.1.2/single_node_setup.html Or this one for Hadoop 2 http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html http://hadoop.apache.org/docs/r2.0.5-alpha/
Create the hadoop user and ssh	sudo apt-get install openssh-server openssh-client sudo addgroup hadoop sudo adduser --ingroup hadoop hduser su - hduser If you cannot ssh to localhost without a passphrase, execute the following commands: ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys Testing your SSH ssh localhost Say yes #exit
Get hadoop all set up	As the hduser, after downloading the tar tar -xvf hadoop-2.0.5-alpha.tar.gz ln -s hadoop-2.0.5-alpha hadoop #edit .bashrc export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_21/ export HADOOP_PREFIX="/home/hduser/hadoop" export PATH=$PATH:$HADOOP_PREFIX/bin export PATH=$PATH:$HADOOP_PREFIX/sbin export HADOOP_MAPRED_HOME=${HADOOP_PREFIX} export HADOOP_COMMON_HOME=${HADOOP_PREFIX} export HADOOP_HDFS_HOME=${HADOOP_PREFIX} export YARN_HOME=${HADOOP_PREFIX}
Stolen entirely from JJ, but with path changed for my Ubuntu	Stolen from http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html Please click on his blog. Login again so bash has paths above. In Hadoop 2.x version /etc/hadoop is the default conf directory. We need to modify / create following property files in the /etc/hadoop directory cd ~ mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/name;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/data;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/system;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/local Edit core-site.xml with following contents <configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:8020</value> <description>The name of the default file system. Either the literal string "local" or a host:port for NDFS.</description> <final>true</final> </property> </configuration> Edit hdfs-site.xml with following contents <configuration> <property> <name>dfs.namenode.name.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name</value> <description>Determines where on the local filesystem the DFS name node should store the name table. If this is a comma-delimited list of directories then the name table is replicated in all of the directories, for redundancy. </description> <final>true</final> </property> <property> <name>dfs.datanode.data.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data</value> <description>Determines where on the local filesystem an DFS data node should store its blocks. If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. Directories that do not exist are ignored. </description> <final>true</final> </property> <property> <name>dfs.replication</name> <value>1</value> </property> <property> <name>dfs.permissions</name> <value>false</value> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name AND file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data are some folders in your computer which would give space to store data and name edit files Path should be specified as URI Create a file mapred-site.xml inside /etc/hadoop with following contents <configuration> <property> <name>mapreduce.framework.name</name> <value>yarn</value> </property> <property> <name>mapred.system.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system</value> <final>true</final> </property> <property> <name>mapred.local.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local</value> <final>true</final> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system AND file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local are some folders in your computer which would give space to store data Path should be specified as URI Edit yarn-site.xml with following contents <configuration> <property> <name>yarn.nodemanager.aux-services</name> <value>mapreduce.shuffle</value> </property> <property> <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name> <value>org.apache.hadoop.mapred.ShuffleHandler</value> </property> </configuration> Format the namenode # hdfs namenode –format Say Yes and let it complete the format Time to start the daemons # hadoop-daemon.sh start namenode # hadoop-daemon.sh start datanode You can also start both of them together by # start-dfs.sh Start Yarn Daemons # yarn-daemon.sh start resourcemanager # yarn-daemon.sh start nodemanager You can also start all yarn daemons together by # start-yarn.sh Time to check if Daemons have started Enter the command # jps 2539 NameNode 2744 NodeManager 3075 Jps 3030 DataNode 2691 ResourceManager Time to launch UI Open the localhost:8088 to see the Resource Manager page Done :) Happy Hadooping :)

이 게시물을

이 글의 추천인 목록 목록

번호	제목	날짜	조회 수
730	HAX is not working and emulator runs in emulation mode 메세지가 나오는 경우	2015.05.25	2268
729	운영계 하둡클러스터에 노드 4대를 EdgeNode로 추가하는 방법/절차	2025.01.12	2291
728	여러가지 방법으로 특정 jar파일을 exclude하지 못하는 경우 해당 jar파일을 제외시키는 방법	2016.08.11	2331
727	주문 생성 데이터 예시	2022.04.30	2331
726	S2RDF모듈의 실행부분만 추출하여 별도록 실행하는 방법(draft)	2016.06.14	2340
725	ntp시간 맞추기	2018.09.12	2342
724	Hadoop 2.7.x에서 사용할 수 있는 파일/디렉토리 관련 util성 클래스 파일	2017.09.28	2346
723	호출 url현황	2023.02.21	2349
722	시맨틱 관련 논문 모음 사이트	2017.06.13	2377
721	protege 4.3 다운로드	2015.12.09	2391
720	센서테스트	2015.05.25	2395
719	파일끝에 붙는 ^M 일괄 지우기(linux, unix(AIX)) 혹은 파일내에 있는 ^M지우기	2016.09.24	2403
718	LUBM 개수별 hadoop HDFS data사이즈 정리	2017.04.06	2416
717	Lagom에서 제공하는 Maven을 이용한 Hello프로젝트 자동생성 및 실행	2018.01.19	2416
716	drools를 이용한 로그,rule matching등의 테스트 java프로그램	2016.07.21	2419
715	Apache Kudu에서 동일한 이름의 테이블을 반복적으로 DROP → CREATE → INSERT하는 로직을 2분 간격으로 10회 수행할 때 발생할 수 있는 주요 이슈	2025.01.26	2423
714	shard3가 있는 서버에 문제가 있는 상태에서 solr query를 요청하는 경우 "no servers hosting shard: shard3" 오류가 발생하는 경우 조치사항	2018.01.04	2436
713	Lagom프레임웍에서 제공하는 HelloWorld 테스트를 수행시 [unknown-version]오류가 발생하면서 빌드가 되지 않는 경우 조치사항	2017.12.22	2453
712	fuseki의 endpoint를 이용한 insert, delete하는 sparql예시	2018.02.14	2455
711	https://github.com/Merck/Halyard프로젝트 컴파일및 배포/테스트	2017.01.24	2461

쓰기 태그

첫 페이지 1 2 3 4 5 6 7 8 9 10 끝 페이지

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

LOGIN