학술논문

MapReduce research on warehousing of big data
Document Type
Conference
Source
2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2017 40th International Convention on. :1361-1366 May, 2017
Subject
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Photonics and Electrooptics
Power, Energy and Industry Applications
Signal Processing and Analysis
Big Data
Data warehouses
NoSQL databases
Warehousing
Structured Query Language
Google
Language
Abstract
The growth of social networks and affordability of various sensing devices has lead to a huge increase of both human and non-human entities that are interconnected via various networks, mostly Internet. All of these entities generate large amounts of various data, and BI analysts have realized that such data contain knowledge that can no longer be ignored. However, traditional support for extraction of knowledge from mostly transactional data - data warehouse - can no longer cope with large amounts of fast incoming various, unstructured data - big data - and is facing a paradigm shift. Big data analytics has become a very active research area in the last few years, as well as the research of underlying data organization that would enhance it, which could be addressed as big data warehousing. One research direction is enhancing data warehouse with new paradigms that have proven to be successful at handling big data. Most popular of them is the MapReduce paradigm. This paper provides an overview on research and attempts to incorporate MapReduce with data warehouse in order to empower it for handling of big data.