AEQ: Alleviating Performance Interference through Intra-Queue and Inter-Queue I/O Isolation for NVMe-over-Fabrics
Wenhao Gu, Xuchao Xie, Dezun Dong, Weixia XuNVMe over Fabrics (NVMeoF) is a next-generation storage disaggregation protocol tailored for high-performance data centers utilizing NVMe SSDs and interconnected networks. Its performance advantages stem from leveraging multiple deep queue pairs to enable parallel PDUs transfers. However, as NVMe SSDs and network infrastructure continue to advance, achieving near-ideal performance in NVMeoF systems becomes increasingly challenging. This paper investigates key performance limitations in the native NVMeoF queue design, particularly focusing on issues related to the unified read-write communication pipeline, as well as intra- and inter-queue I/O performance interference. To address these challenges, we propose AEQ, a co-designed NVMeoF queue pair architecture aimed at enhancing the efficiency of data and command transmission for remote storage access. Specifically, we introduce AQ-RI, a switchable queue design that optimizes differentiated read-write request processing; EQ-RI, an end-to-end packet processing system with dedicated I/O pipelines for distinct application types; and AQ-PI, a PDU isolation mechanism that ensures intra-queue I/O isolation. Experimental results show that AEQ significantly enhances high-concurrency performance in NVMeoF storage networks, while also improving both latency and throughput quality of service.