Bigdata, Hadoop ecosystem, Semantic IoT등의 프로젝트를 진행중에 습득한 내용을 정리하는 곳입니다.
필요한 분을 위해서 공개하고 있습니다. 문의사항은 gooper@gooper.com로 메일을 보내주세요.

hive hive에서 생성된 external table에서 hbase의 table에 값 insert하기

총관리자 2014.04.11 14:04 조회 수 : 1748

1. hive table(file을 바라보고 있으며 hbase table(아래의 hbase_mytable)에 값을 넣기 위한 src table) 을 external로 table 생성

CREATE EXTERNAL TABLE IF NOT EXISTS external_file
     (
     FOO STRING,
     BAR STRING
     )
     COMMENT 'TEST TABLE OF EMP_IP_TABLE'
     ROW FORMAT DELIMITED FIELDS TERMINATED BY ','
     STORED AS TEXTFILE LOCATION '/data';

* /data에 들어 있는 파일 내용

hadoop@bigdata-host:~/hive/conf$ hadoop fs -cat /data/external_file.txt
a,b
a1,b1
a2,b2
a3,b3

2.hive table( hbase table을 바라보는 테이블)생성

CREATE EXTERNAL TABLE hbase_mytable(table_id string, foo string, bar string)
STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,cf:foo,cf:bar")
TBLPROPERTIES("hbase.table.name" = "mytable");

3. hive기동시 아래와 같이 jar를 포함해준다.

adoop@bigdata-host:~/hive/bin$ hive --auxpath /home/hadoop/hive/lib/hbase-0.94.6.1.jar,/home/hadoop/hive/lib/zookeeper-3.4.3.jar,/home/hadoop/hive/lib/hive-hbase-handler-0.11.0.jar,/home/hadoop/hive/lib/guava-11.0.2.jar,/home/hadoop/hive/lib/hive-contrib-0.11.0.jar -hiveconf hbase.master=localhost:60000

4. hive에 들어가서.. table을 생성한 후 hbase table에 입력 실행결과......

hive> insert into table hbase_mytable select foo, foo, bar from external_file;
Total MapReduce jobs = 1
Launching Job 1 out of 1
Number of reduce tasks is set to 0 since there's no reduce operator
Starting Job = job_201404111158_0008, Tracking URL = http://localhost:50030/jobdetails.jsp?jobid=job_201404111158_0008
Kill Command = /home/hadoop/hadoop-1.2.1/libexec/../bin/hadoop job -kill job_201404111158_0008
Hadoop job information for Stage-0: number of mappers: 1; number of reducers: 0
2014-04-11 13:56:49,322 Stage-0 map = 0%, reduce = 0%
2014-04-11 13:56:55,482 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:56:56,500 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:56:57,518 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:56:58,541 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:56:59,561 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:57:00,587 Stage-0 map = 100%, reduce = 100%, Cumulative CPU 1.74 sec
MapReduce Total cumulative CPU time: 1 seconds 740 msec
Ended Job = job_201404111158_0008
4 Rows loaded to hbase_mytable
MapReduce Jobs Launched:
Job 0: Map: 1 Cumulative CPU: 1.74 sec HDFS Read: 220 HDFS Write: 0 SUCCESS
Total MapReduce CPU Time Spent: 1 seconds 740 msec
OK
Time taken: 34.565 seconds

5. hbase_mytable의 값 확인(기존에 있던 값하고 새로 추가된 값이 같이 보인다.)

hive> select * from hbase_mytable;
OK
2.5 1.3 NULL
a a b
a1 a1 b1
a2 a2 b2
a3 a3 b3
second 3 NULL
third NULL 3.14159
Time taken: 1.315 seconds, Fetched: 7 row(s)

6. hbase shell에서 확인

hbase(main):001:0> scan 'mytable'
ROW                        COLUMN+CELL
2.5                       column=cf:foo, timestamp=1397112248576, value=1.3
a                         column=cf:bar, timestamp=1397192214568, value=b
a                         column=cf:foo, timestamp=1397192214568, value=a
a1                        column=cf:bar, timestamp=1397192214568, value=b1
a1                        column=cf:foo, timestamp=1397192214568, value=a1
a2                        column=cf:bar, timestamp=1397192214568, value=b2
a2                        column=cf:foo, timestamp=1397192214568, value=a2
a3                        column=cf:bar, timestamp=1397192214568, value=b3
a3                        column=cf:foo, timestamp=1397192214568, value=a3
first                     column=cf:message, timestamp=1397109873612, value=hellp Hbase
second                    column=cf:foo, timestamp=1397112803662, value=3
second2                   column=cf:foo2, timestamp=1397112883691, value=3
third                     column=cf:bar, timestamp=1397109940598, value=3.14159
9 row(s) in 1.8090 seconds

이 게시물을

번호	제목	글쓴이	날짜	조회 수
399	banana pi에 hive 0.13.1+mysql(metastore)설치	총관리자	2014.09.09	2406
398	Cacti로 Hadoop 모니터링 하기	구퍼	2013.03.12	2367
397	AIX 7.1에 MariaDB 10.2 소스 설치	총관리자	2016.09.24	2364
396	jupyter, zeppelin, rstudio를 이용하여 spark cluster에 job를 실행시키기 위한 정보	총관리자	2018.04.13	2335
395	kafka broker기동시 brokerId가 달라서 기동에 실패하는 경우 조치방법	총관리자	2016.05.02	2331
394	hadoop설치시 오류	총관리자	2013.12.18	2313
393	메이븐 (maven) 설치 및 이클립스 연동하기	구퍼	2013.03.06	2280
392	hadoop설치시 참고사항	구퍼	2013.03.08	2131
391	W/F수행후 Logs not available for 1. Aggregation may not to complete. 표시되며 로그내용이 보이지 않은 경우	총관리자	2020.05.08	2110
390	hbase에 필요한 jar들	구퍼	2013.04.01	2100
389	Hive java connection 설정	구퍼	2013.04.01	2013
388	Hadoop 설치 및 시작하기	구퍼	2013.03.06	1951
387	hbase shell 필드 검색 방법	총관리자	2015.05.24	1900
386	VisualVM 1.3.9을 이용한 spark-submit JVM 모니터링을 위한 설정및 spark-submit실행 옵션	총관리자	2016.10.28	1891
385	Hadoop wordcount 소스 작성	구퍼	2013.03.06	1888
384	Spark 2.1.1 clustering(5대) 설치(YARN기반)	총관리자	2016.04.22	1882
383	hadoop 2.6.0에 sqoop2 (1.99.5) server및 client설치 == fail	총관리자	2015.06.11	1770
»	hive에서 생성된 external table에서 hbase의 table에 값 insert하기	총관리자	2014.04.11	1748
381	java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from...오류해결방법	총관리자	2015.06.16	1726
380	access=WRITE, inode="staging":ubuntu:supergroup:rwxr-xr-x 오류	총관리자	2014.07.05	1719

쓰기 태그

첫 페이지 1 2 3 4 5 6 7 8 9 10 끝 페이지

A personal place to organize information learned during the development of such Hadoop, Hive, Hbase, Semantic IoT, etc.
We are open to the required minutes. Please send inquiries to gooper@gooper.com.

Bigdata, Semantic IoT, Hadoop, NoSQL

Bigdata, Hadoop ecosystem, Semantic IoT등의 프로젝트를 진행중에 습득한 내용을 정리하는 곳입니다.
필요한 분을 위해서 공개하고 있습니다. 문의사항은 gooper@gooper.com로 메일을 보내주세요.

hive hive에서 생성된 external table에서 hbase의 table에 값 insert하기

댓글 0

A personal place to organize information learned during the development of such Hadoop, Hive, Hbase, Semantic IoT, etc.
We are open to the required minutes. Please send inquiries to gooper@gooper.com.

Bigdata, Semantic IoT, Hadoop, NoSQL

Bigdata, Hadoop ecosystem, Semantic IoT등의 프로젝트를 진행중에 습득한 내용을 정리하는 곳입니다. 필요한 분을 위해서 공개하고 있습니다. 문의사항은 gooper@gooper.com로 메일을 보내주세요.

hive hive에서 생성된 external table에서 hbase의 table에 값 insert하기

댓글 0

A personal place to organize information learned during the development of such Hadoop, Hive, Hbase, Semantic IoT, etc. We are open to the required minutes. Please send inquiries to gooper@gooper.com.

LOGIN

Bigdata, Hadoop ecosystem, Semantic IoT등의 프로젝트를 진행중에 습득한 내용을 정리하는 곳입니다.
필요한 분을 위해서 공개하고 있습니다. 문의사항은 gooper@gooper.com로 메일을 보내주세요.

A personal place to organize information learned during the development of such Hadoop, Hive, Hbase, Semantic IoT, etc.
We are open to the required minutes. Please send inquiries to gooper@gooper.com.