Parametric I/O Model

Problem

Over the past few decades, storage and main-memory devices have evolved drastically, offering extremely fast read/write access, increased storage, lower cost, lower latency and energy consumption, and higher density. Consequently, these devices (e.g. Solid State Drive aka SSD) do not share some properties (i.e. read/write asymmetry, internal parallelism) with the classical rotating hard disk drives (HDD). These properties are crucial because only by utilizing these properties, the full benefits of the modern devices can be achieved.

The traditional I/O model (EM Model) was developed based on the properties of HDDs, and has not evolved accordingly while it is still widely used. The EM Model is based on the assumption that read and write accesses have the same cost (symmetric cost) whereas most modern flash devices and emerging non-volatile memory (NVM) technologies show asymmetric cost - writes are significantly slower compared to reads. In addition, most of the contemporary storage devices provide some degree of access concurrency which can not be modeled with the EM Model because it considers only one I/O at a time. The EM Model can not capture these key properties (read-write asymmetry, access concurrency) of modern memory technologies, thus unable to comprehend the full benefits of contemporary devices.

Algorithms

New buffer pool page eviction algorithms:

For more details on each algorithm, click on the buttons below:

Parametric I/O Model in Action

Comparison of Algorithms

Workload
Buffer size
Disk size
Read Percentage
%
# of Operations

Maximum: 100,000
Asymmetry
Skewness
%
on data
%

Base Algorithm
Chart type


The runtime of COW algorithms are faster than the baseline with very little or no increase in buffer miss and total writes. The general trend is that COW(n) and COW-X(n) perform better than COW(n,E) and COW-X(n,E) because the formers retain more pages in the bufferpool.


Variations in Read/Write Ratio

Base Algorithm
Chart type


Write-intensive workload has more gains from the proposed algorithms because the algorithms have to write more pages back to disk, hence the benefits from concurrent writing are pronounced.


Variations in Buffer Pool Ratio

Base Algorithm
Chart type


COW algorithms performs significantly well under heavy memory pressure because smaller bufferpool results in higher writes.


For more details, check our paper!

The Need for a New I/O Model

In Proceedings of the Annual Conference on Innovative Data Systems Research (CIDR), 2021 (abstract)

Meet The Team

Tarikul Islam Papon

PhD Researcher

Subin (Rachael) Kim

Intern

Manos Athanassoulis

Assistant Professor