학술논문
Apache Spark: A Unified Engine for Big Data Processing.
Document Type
Article
Author
Source
Subject
*Big data
*Open source products
*Software frameworks
*Distributed computing
Image processing software
*
*
*
Language
ISSN
0001-0782
Abstract
The article discusses the open source computing framework, Apache Spark, which unifies streaming, batch, and interactive big data workloads to unlock new applications. Topics include Spark's use of an RDD programming model, the use of Spark in diverse applications such as batch processing and image processing, and the additional cost of Spark over other specialized systems due to fault tolerance.