Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

spark Scala에서 countByWindow를 이용하기(예제)

총관리자 2018.03.08 14:26 조회 수 : 4422

import org.apache.spark.SparkContext

import org.apache.spark.streaming.StreamingContext

import org.apache.spark.streaming.Seconds

object StreamingLogsMB {

def main(args: Array[String]) {

if (args.length < 2) {

System.err.println("Usage: stubs.StreamingLogsMB <hostname> <port>")

System.exit(1)

}

// get hostname and port of data source from application arguments

val hostname = args(0)

val port = args(1).toInt

// Create a Spark Context

val sc = new SparkContext()

// Set log level to ERROR to avoid distracting extra output

sc.setLogLevel("ERROR")

// Configure the Streaming Context with a 1 second batch duration

val ssc = new StreamingContext(sc,Seconds(1))

// Create a DStream of log data from the server and port specified

val logs = ssc.socketTextStream(hostname,port)

ssc.checkpoint("logcheckpt")

logs.countByWindow(Seconds(5), Seconds(2)).print

ssc.start()

ssc.awaitTermination()

}

이 게시물을

이 글의 추천인 목록 목록

번호	제목	날짜	조회 수
145	엑셀에서 K ,M, G ,T 단위를 숫자로 변환 하는 수식	2025.04.09	1264
144	파일끝에 붙는 ^M 일괄 지우기(linux, unix(AIX)) 혹은 파일내에 있는 ^M지우기	2016.09.24	2397
143	Apache Kudu에서 동일한 이름의 테이블을 반복적으로 DROP → CREATE → INSERT하는 로직을 2분 간격으로 10회 수행할 때 발생할 수 있는 주요 이슈	2025.01.26	2419
142	JavaStreamingContext를 이용하여 스트림으로 들어오는 문자열 카운트 소스	2017.03.30	2551
141	Spark에서 KafkaUtils.createStream()를 이용하여 이용하여 kafka topic에 접근하여 객채로 저장된 값을 가져오고 처리하는 예제 소스	2017.04.26	2802
140	[Impala] alter table구문수행시 "WARNINGS: Impala does not have READ_WRITE access to path 'hdfs://nameservice1/DATA/Temp/DB/source/table01_ccd'" 발생시 조치	2024.04.26	2941
139	spark-shell실행시 "A read-only user or a user in a read-only database is not permitted to disable read-only mode on a connection."오류가 발생하는 경우 해결방법	2016.05.20	2953
138	Toree 0.1.0-incubating이 Scala 2.10.4까지만 지원하게 되어서 발생하는 NoSuchMethod오류 문제 해결방법(scala 2.11.x을 지원하지만 오류가 발생할 수 있음)	2018.04.20	2961
137	Ubuntu 16.04 LTS에 Hive 2.1.1설치하면서 "Version information not found in metastore"발생하는 오류원인및 조치사항	2017.05.03	3088
136	spark 2.0.0의 api를 이용하는 예제 프로그램	2017.03.15	3108
135	Hive MetaStore Server기동시 Could not create "increment"/"table" value-generation container SEQUENCE_TABLE since autoCreate flags do not allow it. 오류발생시 조치사항	2017.05.03	3112
134	spark-sql실행시 ERROR log: Got exception: java.lang.NumberFormatException For input string: "2000ms" 오류발생시 조치사항	2016.06.09	3135
133	hue메타 정보를 저장(oracle DB)하는 내부 테이블을 이용하여 전체 테이블목록, 전체 코디네이터 목록, 코디네이터기준 workflow구조를 추출하는 쿼리문	2022.04.01	3153
132	[hive] hive.tbls테이블의 owner컬럼값은 hadoop.security.auth_to_local에 의해서 filtering된다.	2022.04.14	3249
131	spark2.0.0에서 hive 2.0.1 table을 읽어 출력하는 예제 소스(HiveContext, SparkSession, SQLContext)	2017.03.09	3254
130	Could not compute split, block input-0-1517397051800 not found형태의 오류가 발생시 조치방법	2018.02.01	3290
129	./spark-sql 실행시 "java.lang.NumberFormatException: For input string: "1s"오류발생시 조치사항	2016.06.09	3300
128	CDH 5.4.4 버전에서 hive on tez (0.7.0)설치하기	2016.01.14	3370
127	spark 2.0.0를 windows에서 실행시 로컬 파일을 읽을때 발생하는 오류 해결 방법	2017.01.12	3375
126	Permission denied: user=hadoop, access=EXECUTE, inode="/tmp":root:supergroup:drwxrwx--- 오류해결방법	2015.05.17	3409

쓰기 태그

첫 페이지 1 2 3 4 5 6 7 8 끝 페이지

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

spark Scala에서 countByWindow를 이용하기(예제)

댓글 0

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

spark Scala에서 countByWindow를 이용하기(예제)

댓글 0

LOGIN