학술논문

Fault-tolerant real-time analytics with distributed Oracle Database In-memory
Document Type
Conference
Source
2016 IEEE 32nd International Conference on Data Engineering (ICDE) Data Engineering (ICDE), 2016 IEEE 32nd International Conference on. :1298-1309 May, 2016
Subject
Computing and Processing
Computer architecture
Servers
Distributed databases
Context
Real-time systems
Fault tolerance
Fault tolerant systems
real-time analytics
OLTAP
Oracle Database In-memory
distributed architecture
high availability
distributed in-memory fault tolerant analytics
Language
Abstract
Modern data management systems are required to address new breeds of OLTAP applications. These applications demand real time analytical insights over massive data volumes not only on dedicated data warehouses but also on live mainstream production environments where data gets continuously ingested and modified. Oracle introduced the Database In-memory Option (DBIM) in 2014 as a unique dual row and column format architecture aimed to address the emerging space of mixed OLTAP applications along with traditional OLAP workloads. The architecture allows both the row format and the column format to be maintained simultaneously with strict transactional consistency. While the row format is persisted in underlying storage, the column format is maintained purely in-memory without incurring additional logging overheads in OLTP. Maintenance of columnar data purely in memory creates the need for distributed data management architectures. Performance of analytics incurs severe regressions in single server architectures during server failures as it takes non-trivial time to recover and rebuild terabytes of in-memory columnar format. A distributed and distribution aware architecture therefore becomes necessary to provide real time high availability of the columnar format for glitch-free in-memory analytic query execution across server failures and additions, besides providing scale out of capacity and compute to address real time throughput requirements over large volumes of in-memory data. In this paper, we will present the high availability aspects of the distributed architecture of Oracle DBIM that includes extremely scaled out application transparent column format duplication mechanism, distributed query execution on duplicated in-memory columnar format, and several scenarios of fault tolerant analytic query execution across the in-memory column format at various stages of redistribution of columnar data during cluster topology changes.