학술논문

Apache Spark: A Unified Engine for Big Data Processing.
Document Type
Article
Source
Communications of the ACM. Nov2016, Vol. 59 Issue 11, p56-65. 10p. 3 Color Photographs, 3 Diagrams, 4 Charts, 3 Graphs.
Subject
*Big data
*Open source products
*Software frameworks
*Distributed computing
Image processing software
Language
ISSN
0001-0782
Abstract
The article discusses the open source computing framework, Apache Spark, which unifies streaming, batch, and interactive big data workloads to unlock new applications. Topics include Spark's use of an RDD programming model, the use of Spark in diverse applications such as batch processing and image processing, and the additional cost of Spark over other specialized systems due to fault tolerance.