Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

hive hive에서 생성된 external table에서 hbase의 table에 값 insert하기

총관리자 2014.04.11 14:04 조회 수 : 4627

1. hive table(file을 바라보고 있으며 hbase table(아래의 hbase_mytable)에 값을 넣기 위한 src table) 을 external로 table 생성

CREATE EXTERNAL TABLE IF NOT EXISTS external_file
     (
     FOO STRING,
     BAR STRING
     )
     COMMENT 'TEST TABLE OF EMP_IP_TABLE'
     ROW FORMAT DELIMITED FIELDS TERMINATED BY ','
     STORED AS TEXTFILE LOCATION '/data';

* /data에 들어 있는 파일 내용

hadoop@bigdata-host:~/hive/conf$ hadoop fs -cat /data/external_file.txt
a,b
a1,b1
a2,b2
a3,b3

2.hive table( hbase table을 바라보는 테이블)생성

CREATE EXTERNAL TABLE hbase_mytable(table_id string, foo string, bar string)
STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,cf:foo,cf:bar")
TBLPROPERTIES("hbase.table.name" = "mytable");

3. hive기동시 아래와 같이 jar를 포함해준다.

adoop@bigdata-host:~/hive/bin$ hive --auxpath /home/hadoop/hive/lib/hbase-0.94.6.1.jar,/home/hadoop/hive/lib/zookeeper-3.4.3.jar,/home/hadoop/hive/lib/hive-hbase-handler-0.11.0.jar,/home/hadoop/hive/lib/guava-11.0.2.jar,/home/hadoop/hive/lib/hive-contrib-0.11.0.jar -hiveconf hbase.master=localhost:60000

4. hive에 들어가서.. table을 생성한 후 hbase table에 입력 실행결과......

hive> insert into table hbase_mytable select foo, foo, bar from external_file;
Total MapReduce jobs = 1
Launching Job 1 out of 1
Number of reduce tasks is set to 0 since there's no reduce operator
Starting Job = job_201404111158_0008, Tracking URL = http://localhost:50030/jobdetails.jsp?jobid=job_201404111158_0008
Kill Command = /home/hadoop/hadoop-1.2.1/libexec/../bin/hadoop job -kill job_201404111158_0008
Hadoop job information for Stage-0: number of mappers: 1; number of reducers: 0
2014-04-11 13:56:49,322 Stage-0 map = 0%, reduce = 0%
2014-04-11 13:56:55,482 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:56:56,500 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:56:57,518 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:56:58,541 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:56:59,561 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.74 sec
2014-04-11 13:57:00,587 Stage-0 map = 100%, reduce = 100%, Cumulative CPU 1.74 sec
MapReduce Total cumulative CPU time: 1 seconds 740 msec
Ended Job = job_201404111158_0008
4 Rows loaded to hbase_mytable
MapReduce Jobs Launched:
Job 0: Map: 1 Cumulative CPU: 1.74 sec HDFS Read: 220 HDFS Write: 0 SUCCESS
Total MapReduce CPU Time Spent: 1 seconds 740 msec
OK
Time taken: 34.565 seconds

5. hbase_mytable의 값 확인(기존에 있던 값하고 새로 추가된 값이 같이 보인다.)

hive> select * from hbase_mytable;
OK
2.5 1.3 NULL
a a b
a1 a1 b1
a2 a2 b2
a3 a3 b3
second 3 NULL
third NULL 3.14159
Time taken: 1.315 seconds, Fetched: 7 row(s)

6. hbase shell에서 확인

hbase(main):001:0> scan 'mytable'
ROW                        COLUMN+CELL
2.5                       column=cf:foo, timestamp=1397112248576, value=1.3
a                         column=cf:bar, timestamp=1397192214568, value=b
a                         column=cf:foo, timestamp=1397192214568, value=a
a1                        column=cf:bar, timestamp=1397192214568, value=b1
a1                        column=cf:foo, timestamp=1397192214568, value=a1
a2                        column=cf:bar, timestamp=1397192214568, value=b2
a2                        column=cf:foo, timestamp=1397192214568, value=a2
a3                        column=cf:bar, timestamp=1397192214568, value=b3
a3                        column=cf:foo, timestamp=1397192214568, value=a3
first                     column=cf:message, timestamp=1397109873612, value=hellp Hbase
second                    column=cf:foo, timestamp=1397112803662, value=3
second2                   column=cf:foo2, timestamp=1397112883691, value=3
third                     column=cf:bar, timestamp=1397109940598, value=3.14159
9 row(s) in 1.8090 seconds

이 게시물을

이 글의 추천인 목록 목록

번호	제목	날짜	조회 수
27	Caused by: java.net.URISyntaxException: Relative path in absolute URI: ${system:java.io.tmpdir%7D/$%7Bsystem:user.name%7D오류발생시 조치사항	2016.06.03	3429
26	hive 2.0.1 설치및 mariadb로 metastore 설정	2016.06.03	9020
25	CDH 5.4.4 버전에서 hive on tez (0.7.0)설치하기	2016.01.14	3370
24	Tracking URL = N/A 가발생하는 경우 - 환경설정값을 잘못설정하는 경우에 발생함	2015.06.17	4595
23	java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from...오류해결방법	2015.06.16	5943
22	hive 0.13.1 설치 + meta정보는 postgresql 9.3에 저장	2015.04.30	4065
21	lateral view 예제	2014.09.18	3903
20	banana pi에 hive 0.13.1+mysql(metastore)설치	2014.09.09	4856
19	FAILED: IllegalStateException Variable substitution depth too large: 40 오류발생시 조치사항	2014.08.19	4584
18	hive job실행시 meta정보를 원격의 mysql에 저장하는 경우 설정방법	2014.05.28	5098
17	hive query에서 mapreduce돌리지 않고 select하는 방법	2014.05.23	4429
16	hiverserver2기동시 connection refused가 발생하는 경우 조치방법	2014.05.22	4641
15	hive에서 insert overwrite directory.. 로 하면 default column구분자는 'SOH'혹은 't'가 됨	2014.05.20	3590
14	dual table만들기	2014.05.16	4243
13	insert hbase by hive ... error occured after 5 hours..HMaster가 뜨지 않는 장애에 대한 복구 방법	2014.04.29	9666
12	index생성, 삭제, 활용	2014.04.25	4488
11	unique한 값 생성	2014.04.25	4447
10	sequence한 번호 생성방법	2014.04.25	4523
9	json serde사용법	2014.04.17	4535
8	json 값 다루기	2014.04.17	4217

쓰기 태그

첫 페이지 1 2 3 끝 페이지

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

hive hive에서 생성된 external table에서 hbase의 table에 값 insert하기

댓글 0

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

hive hive에서 생성된 external table에서 hbase의 table에 값 insert하기

댓글 0

LOGIN