메뉴 건너뛰기

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.


Hive가 "GROUP BY"에서 최적화하면서 내부적으로 컬럼을 추가하여 reducer로 보내는데 이 컬럼은 실존하지 않는 컬럼으로 문제가 발생할 수 있음
hive 0.14.0에서 default가 disabled되었는데 이전 버전에서는 default가 enabled로 되어 있어서 문제가 발생하면
set hive.optimize.sort.dynamic.partition=false;
를 추가하여 준다.

-----오류내용----
2015-06-15 14:12:01,345 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: task_1434344795088_0005_r_000000 Task Transitioned from SCHEDULED to RUNNING
2015-06-15 14:12:02,327 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerRequestor: getResources() for application_1434344795088_0005: ask=1 release= 0 newContainers=0 finishedContainers=0 resourcelimit=<memory:0, vCores:0> knownNMs=5
2015-06-15 14:12:03,067 INFO [Socket Reader #1 for port 47209] SecurityLogger.org.apache.hadoop.ipc.Server: Auth successful for job_1434344795088_0005 (auth:SIMPLE)
2015-06-15 14:12:03,079 INFO [IPC Server handler 2 on 47209] org.apache.hadoop.mapred.TaskAttemptListenerImpl: JVM with ID : jvm_1434344795088_0005_r_000003 asked for a task
2015-06-15 14:12:03,079 INFO [IPC Server handler 2 on 47209] org.apache.hadoop.mapred.TaskAttemptListenerImpl: JVM with ID: jvm_1434344795088_0005_r_000003 given task: attempt_1434344795088_0005_r_000000_0
2015-06-15 14:12:04,185 INFO [IPC Server handler 1 on 47209] org.apache.hadoop.mapred.TaskAttemptListenerImpl: MapCompletionEvents request from attempt_1434344795088_0005_r_000000_0. startIndex 0 maxEvents 10000
2015-06-15 14:12:04,363 INFO [IPC Server handler 2 on 47209] org.apache.hadoop.mapred.TaskAttemptListenerImpl: Progress of TaskAttempt attempt_1434344795088_0005_r_000000_0 is : 0.0
2015-06-15 14:12:04,382 INFO [IPC Server handler 6 on 47209] org.apache.hadoop.mapred.TaskAttemptListenerImpl: Progress of TaskAttempt attempt_1434344795088_0005_r_000000_0 is : 0.0
2015-06-15 14:12:04,769 INFO [IPC Server handler 7 on 47209] org.apache.hadoop.mapred.TaskAttemptListenerImpl: Progress of TaskAttempt attempt_1434344795088_0005_r_000000_0 is : 0.0
2015-06-15 14:12:04,772 FATAL [IPC Server handler 9 on 47209] org.apache.hadoop.mapred.TaskAttemptListenerImpl: Task: attempt_1434344795088_0005_r_000000_0 - exited : java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from x1x50x48x49x53x45x48x54x45x49x53x32x49x52x58x49x49x58x52x53x0x1x50x48x49x53x45x48x54x45x48x51x0x1x50x48x49x53x45x48x54x45x48x51x32x49x53x58x49x57x58x48x49x46x0x1x50x50x48x46x57x48x46x50x49x54x46x55x51x0x1x104x116x116x112x58x47x47x50x50x48x46x57x48x46x50x0x1x71x69x84x0x1x187x154x203x138x255 with properties {columns=reducesinkkey0,reducesinkkey1,reducesinkkey2,reducesinkkey3,reducesinkkey4,reducesinkkey5,reducesinkkey6, serialization.lib=org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe, serialization.sort.order=+++++++, columns.types=string,string,string,string,string,string,string}
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:283)
	at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:444)
	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:392)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from x1x50x48x49x53x45x48x54x45x49x53x32x49x52x58x49x49x58x52x53x0x1x50x48x49x53x45x48x54x45x48x51x0x1x50x48x49x53x45x48x54x45x48x51x32x49x53x58x49x57x58x48x49x46x0x1x50x50x48x46x57x48x46x50x49x54x46x55x51x0x1x104x116x116x112x58x47x47x50x50x48x46x57x48x46x50x0x1x71x69x84x0x1x187x154x203x138x255 with properties {columns=reducesinkkey0,reducesinkkey1,reducesinkkey2,reducesinkkey3,reducesinkkey4,reducesinkkey5,reducesinkkey6, serialization.lib=org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe, serialization.sort.order=+++++++, columns.types=string,string,string,string,string,string,string}
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:222)
	... 7 more
Caused by: org.apache.hadoop.hive.serde2.SerDeException: java.io.EOFException
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:189)
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:220)
	... 7 more
Caused by: java.io.EOFException
	at org.apache.hadoop.hive.serde2.binarysortable.InputByteBuffer.read(InputByteBuffer.java:54)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserializeText(BinarySortableSerDe.java:550)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:285)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:185)
	... 8 more

2015-06-15 14:12:04,772 INFO [IPC Server handler 9 on 47209] org.apache.hadoop.mapred.TaskAttemptListenerImpl: Diagnostics report from attempt_1434344795088_0005_r_000000_0: Error: java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from x1x50x48x49x53x45x48x54x45x49x53x32x49x52x58x49x49x58x52x53x0x1x50x48x49x53x45x48x54x45x48x51x0x1x50x48x49x53x45x48x54x45x48x51x32x49x53x58x49x57x58x48x49x46x0x1x50x50x48x46x57x48x46x50x49x54x46x55x51x0x1x104x116x116x112x58x47x47x50x50x48x46x57x48x46x50x0x1x71x69x84x0x1x187x154x203x138x255 with properties {columns=reducesinkkey0,reducesinkkey1,reducesinkkey2,reducesinkkey3,reducesinkkey4,reducesinkkey5,reducesinkkey6, serialization.lib=org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe, serialization.sort.order=+++++++, columns.types=string,string,string,string,string,string,string}
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:283)
	at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:444)
	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:392)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from x1x50x48x49x53x45x48x54x45x49x53x32x49x52x58x49x49x58x52x53x0x1x50x48x49x53x45x48x54x45x48x51x0x1x50x48x49x53x45x48x54x45x48x51x32x49x53x58x49x57x58x48x49x46x0x1x50x50x48x46x57x48x46x50x49x54x46x55x51x0x1x104x116x116x112x58x47x47x50x50x48x46x57x48x46x50x0x1x71x69x84x0x1x187x154x203x138x255 with properties {columns=reducesinkkey0,reducesinkkey1,reducesinkkey2,reducesinkkey3,reducesinkkey4,reducesinkkey5,reducesinkkey6, serialization.lib=org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe, serialization.sort.order=+++++++, columns.types=string,string,string,string,string,string,string}
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:222)
	... 7 more
Caused by: org.apache.hadoop.hive.serde2.SerDeException: java.io.EOFException
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:189)
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:220)
	... 7 more
Caused by: java.io.EOFException
	at org.apache.hadoop.hive.serde2.binarysortable.InputByteBuffer.read(InputByteBuffer.java:54)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserializeText(BinarySortableSerDe.java:550)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:285)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:185)
	... 8 more

2015-06-15 14:12:04,773 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: Diagnostics report from attempt_1434344795088_0005_r_000000_0: Error: java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from x1x50x48x49x53x45x48x54x45x49x53x32x49x52x58x49x49x58x52x53x0x1x50x48x49x53x45x48x54x45x48x51x0x1x50x48x49x53x45x48x54x45x48x51x32x49x53x58x49x57x58x48x49x46x0x1x50x50x48x46x57x48x46x50x49x54x46x55x51x0x1x104x116x116x112x58x47x47x50x50x48x46x57x48x46x50x0x1x71x69x84x0x1x187x154x203x138x255 with properties {columns=reducesinkkey0,reducesinkkey1,reducesinkkey2,reducesinkkey3,reducesinkkey4,reducesinkkey5,reducesinkkey6, serialization.lib=org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe, serialization.sort.order=+++++++, columns.types=string,string,string,string,string,string,string}
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:283)
	at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:444)
	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:392)
	at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614)
	at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163)
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error: Unable to deserialize reduce input key from x1x50x48x49x53x45x48x54x45x49x53x32x49x52x58x49x49x58x52x53x0x1x50x48x49x53x45x48x54x45x48x51x0x1x50x48x49x53x45x48x54x45x48x51x32x49x53x58x49x57x58x48x49x46x0x1x50x50x48x46x57x48x46x50x49x54x46x55x51x0x1x104x116x116x112x58x47x47x50x50x48x46x57x48x46x50x0x1x71x69x84x0x1x187x154x203x138x255 with properties {columns=reducesinkkey0,reducesinkkey1,reducesinkkey2,reducesinkkey3,reducesinkkey4,reducesinkkey5,reducesinkkey6, serialization.lib=org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe, serialization.sort.order=+++++++, columns.types=string,string,string,string,string,string,string}
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:222)
	... 7 more
Caused by: org.apache.hadoop.hive.serde2.SerDeException: java.io.EOFException
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:189)
	at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.reduce(ExecReducer.java:220)
	... 7 more
Caused by: java.io.EOFException
	at org.apache.hadoop.hive.serde2.binarysortable.InputByteBuffer.read(InputByteBuffer.java:54)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserializeText(BinarySortableSerDe.java:550)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:285)
	at org.apache.hadoop.hive.serde2.binarysortable.BinarySortableSerDe.deserialize(BinarySortableSerDe.java:185)
	... 8 more
번호 제목 날짜 조회 수
270 [번역] solr 검색 엔진 튜토리얼 2014.10.07 3633
269 hive기동시 Caused by: java.net.URISyntaxException: Relative path in absolute URI: ${system:java.io.tmpdir%7D/$%7Bsystem:user.name%7D 오류 발생시 조치사항 2016.09.25 3631
268 jena/fuseki 3.4.0 설치 2017.07.25 3630
267 federated query 예제 2017.01.19 3627
266 crypto관련 기생충 박멸 스크립트 2018.05.11 3626
265 producer / consumer구현시 설정 옵션 설명 2016.10.19 3621
264 프로그래밍 언어별 딥러닝 라이브러리 정리 file 2016.10.05 3615
263 python3.5에서 numpy버젼에 따른 문제점을 조치하는 방법및 pymysql import할때 오류 발생시 조치사항 2017.09.28 3614
262 jena jar파일실행시 org.apache.jena.tdb.TDB.init에서 java.lang.NullPointerException발생시 조치사항 2016.08.19 3611
261 Embedded PostgreSql설정을 외부의 MariaDB로변경하기 [1] 2018.05.22 3607
260 git 초기화(Windows에서 Git Bash사용) 2016.11.17 3607
259 update를 많이 하면 heap memory가 많이 소진되고 최종적으로 OOM가 발생하는데 이에 대한 설명 2017.04.10 3606
258 이미지 관리 오픈소스 목록 2018.03.11 3600
257 maven을 이용하여 Hello world 서비스 자동 생성시 HelloServiceImpl.java에서 사용하는 getMessage() 와 getName() 이 정의되지 않은 오류가 발생시 조치방법 2018.01.19 3599
256 RDF4J의 rdf4j-server.war가 제공하는 RESTFul API를 이용하여 repository에 CRUD테스트 2017.08.30 3592
255 hive에서 insert overwrite directory.. 로 하면 default column구분자는 'SOH'혹은 't'가 됨 2014.05.20 3590
254 CDH 5.14.2 설치중 agent설치에서 실패하는 경우 확인/조치 2018.05.22 3576
253 spark notebook 0.7.0설치및 설정 2016.11.14 3573
252 spark submit용 jar파일을 만드는 sbt 용 build.sbt설정 파일(참고용) 2016.08.19 3561
251 kafkaWordCount.scala의 producer와 consumer 클래스를 이용하여 kafka를 이용한 word count 테스트 하기 2016.08.02 3561
위로