Computational Storage @ SSRC

Computational Storage

Data movement between storage and compute represents a bottleneck in data-driven applications. By executing compute kernels on the storage device instead of moving the data through the memory hierarchy to the CPU cache, throughput can be increased and energy consumption can be reduced. This new type of application architecture allows for a reduction in total cost of ownership when performing similar workloads such as genomics and data analytics.

Status

At CRSS we are researching a computational storage device interface simulator which allow the user application to offload many concurrent compute tasks to the device. This simulator provides a platform which can be used to further develop the interface between user applications, device drivers, and computational storage devices. Given the current lack of readily available hardware designs, this simulator platform allows research in these areas to continue to progress in parallel. This allows us to explore different models for using this type of hardware, including different possible constraints in the NVMe specifications as well as multiple approaches to offloading compute tasks to the device.

We are developing a simulator which is built using the QEMU Linux device emulator system. Using the Intel SPDK userspace NVMe device driver atop the emulated QEMU device allows for high throughput access to the PCIe bus, and the NVMe I/O queueing system allows thousands of requests to be in flight simultaneously. This approach allows the user to take advantage of high levels of parallelism inherent in many data-driven workloads. Our simulator design allows us to further develop an application framework for compute kernel offload. This provides us an opportunity to explore different interfaces and synchronization mechanisms available to connect the user application to the device. By exploring these different approaches in device interface design, we will be able to provide the means for application developers to easily develop the program modifications necessary in order to port existing data-driven applications to this computational storage interface, reducing the engineering effort required to realize these performance and efficiency benefits. This system enables us to evaluate the scalability of kernel offloading techniques and the computational cost of synchronization between the host and the storage accelerator.

Faculty

Heiner Litz

Publications

Date		Publication
Jul 28, 2024		Yiwei Yang, Yusheng Zheng, Andrew Quinn, Kgent: Kernel Extensions Large Language Model Agent, Proceedings of the ACM SIGCOMM 2024 Workshop on EBPF and Kernel Extensions, July 2024. [Computational Storage]
Mar 11, 2024		Lokesh Jaliminche, Yangwook Kang, Changho Choi, Pankaj Mehra, Heiner Litz, CS-Assist: A Tool to Assist Computational Storage Device Offload, 15TH ANNUAL NON-VOLATILE MEMORIES WORKSHOP 2024, March 2024. [CXL SIG (Disaggregated Memory)] [Computational Storage]
Jan 10, 2022		Devashish Purandare, Pete Wilcox, Heiner Litz, Shel Finkelstein, Append is Near: Log-based Data Management on ZNS SSDs, Conference on Innovative Data Systems Research 2022 (CIDR '22), January 2022. [Archival Storage] [Designing an Efficient Flash Translation] [Designing systems for QLC flash] [Shingled Disk] [Computational Storage]
Apr 26, 2021		Pete Wilcox, Heiner Litz, Design for Computational Storage Simulation Platform, Workshop on Challenges and Opportunities of Efficient and Performant Storage Systems (CHEOPS), April 2021. [Computational Storage]

Last modified 19 Oct 2020