Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

총관리자 2013.12.16 22:09 조회 수 : 4462

출처 : http://www.spikyjohn.com/cribsheets/20130609_hadoopinstall.html

Just the command lines to get hadoop 2 installed on Ubuntu. These are all cribbed from the following source notes, and I am preserving them here for my own benefit so I can quickly repeat what I did. Note many of these instructions are also in the main hadoop docs from apache.

Source material	Use Michael-noll's guide for version 1 & ssh http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/ http://hadoop.apache.org/docs/r1.1.2/single_node_setup.html Or this one for Hadoop 2 http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html http://hadoop.apache.org/docs/r2.0.5-alpha/
Create the hadoop user and ssh	sudo apt-get install openssh-server openssh-client sudo addgroup hadoop sudo adduser --ingroup hadoop hduser su - hduser If you cannot ssh to localhost without a passphrase, execute the following commands: ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys Testing your SSH ssh localhost Say yes #exit
Get hadoop all set up	As the hduser, after downloading the tar tar -xvf hadoop-2.0.5-alpha.tar.gz ln -s hadoop-2.0.5-alpha hadoop #edit .bashrc export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_21/ export HADOOP_PREFIX="/home/hduser/hadoop" export PATH=$PATH:$HADOOP_PREFIX/bin export PATH=$PATH:$HADOOP_PREFIX/sbin export HADOOP_MAPRED_HOME=${HADOOP_PREFIX} export HADOOP_COMMON_HOME=${HADOOP_PREFIX} export HADOOP_HDFS_HOME=${HADOOP_PREFIX} export YARN_HOME=${HADOOP_PREFIX}
Stolen entirely from JJ, but with path changed for my Ubuntu	Stolen from http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html Please click on his blog. Login again so bash has paths above. In Hadoop 2.x version /etc/hadoop is the default conf directory. We need to modify / create following property files in the /etc/hadoop directory cd ~ mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/name;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/data;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/system;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/local Edit core-site.xml with following contents <configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:8020</value> <description>The name of the default file system. Either the literal string "local" or a host:port for NDFS.</description> <final>true</final> </property> </configuration> Edit hdfs-site.xml with following contents <configuration> <property> <name>dfs.namenode.name.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name</value> <description>Determines where on the local filesystem the DFS name node should store the name table. If this is a comma-delimited list of directories then the name table is replicated in all of the directories, for redundancy. </description> <final>true</final> </property> <property> <name>dfs.datanode.data.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data</value> <description>Determines where on the local filesystem an DFS data node should store its blocks. If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. Directories that do not exist are ignored. </description> <final>true</final> </property> <property> <name>dfs.replication</name> <value>1</value> </property> <property> <name>dfs.permissions</name> <value>false</value> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name AND file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data are some folders in your computer which would give space to store data and name edit files Path should be specified as URI Create a file mapred-site.xml inside /etc/hadoop with following contents <configuration> <property> <name>mapreduce.framework.name</name> <value>yarn</value> </property> <property> <name>mapred.system.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system</value> <final>true</final> </property> <property> <name>mapred.local.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local</value> <final>true</final> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system AND file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local are some folders in your computer which would give space to store data Path should be specified as URI Edit yarn-site.xml with following contents <configuration> <property> <name>yarn.nodemanager.aux-services</name> <value>mapreduce.shuffle</value> </property> <property> <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name> <value>org.apache.hadoop.mapred.ShuffleHandler</value> </property> </configuration> Format the namenode # hdfs namenode –format Say Yes and let it complete the format Time to start the daemons # hadoop-daemon.sh start namenode # hadoop-daemon.sh start datanode You can also start both of them together by # start-dfs.sh Start Yarn Daemons # yarn-daemon.sh start resourcemanager # yarn-daemon.sh start nodemanager You can also start all yarn daemons together by # start-yarn.sh Time to check if Daemons have started Enter the command # jps 2539 NameNode 2744 NodeManager 3075 Jps 3030 DataNode 2691 ResourceManager Time to launch UI Open the localhost:8088 to see the Resource Manager page Done :) Happy Hadooping :)

이 게시물을

이 글의 추천인 목록 목록

번호	제목	날짜	조회 수
50	JobHistory 서버 기동시 HDFS상에 특정 폴더를 생성할 수 없어서 기동하지 못하는 경우 조치	2018.05.29	5652
49	hbase shell 필드 검색 방법	2015.05.24	5711
48	solr 6.2에 한글 형태소 분석기(arirang 6.x) 적용 및 테스트	2017.06.27	5750
47	System Properties Comparison Elasticsearch vs. Hive vs. Jena	2016.03.10	5784
46	"java.net.NoRouteToHostException: 호스트로 갈 루트가 없음" 오류시 확인및 조치할 사항	2016.04.01	5814
45	HBASE Client API : 기본 기능 정리	2013.04.01	5852
44	[CDP7.1.7]impala-shell을 이용하여 kudu table에 insert/update수행시 발생하는 오류(Transport endpoint is not connected (error 107)) 발생시 확인할 내용	2023.11.30	5888
43	java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from...오류해결방법	2015.06.16	5944
42	HBase 설치하기 – Fully-distributed	2013.03.12	6014
41	Hbase Shell 명령 정리	2013.04.01	6036
40	Hadoop Cluster 설치 (Hadoop+Zookeeper+Hbase)	2013.03.07	6099
39	checking for termcap functions library... configure: error: No curses/termcap library found	2013.03.08	6099
38	protege 설명및 사용법	2017.04.04	6132
37	[impala]insert into db명.table명 select a, b from db명.table명 쿼리 수행시 "Memory limit exceeded: Failed to allocate memory for Parquet page index"오류 조치 방법	2023.05.31	6339
36	원보드pc인 bananapi를 이용하여 hadoop 클러스터 구성하기(준비물)	2014.05.29	6432
35	hadoop및 ecosystem에서 사용되는 명령문 정리	2014.05.28	6435
34	다수의 로그 에이전트로 부터 로그를 받아 각각의 파일로 저장하는 방법(interceptor및 multiplexing)	2014.04.04	6439
33	hadoop 2.6.0 기동(에코시스템 포함)및 wordcount 어플리케이션을 이용한 테스트	2015.05.05	6542
32	Last transaction was partial에 따른 Unable to load database on disk오류 발생시 조치사항	2018.08.03	6653
31	import 혹은 export할때 hive파일의 default 구분자는 --input-fields-terminated-by "x01"와 같이 지정해야함	2014.05.20	6701

쓰기 태그

첫 페이지 29 30 31 32 33 34 35 36 37 38 끝 페이지

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

LOGIN