Part of the Technology photoes in this website are created by rawpixel.com - www.freepik.com

Revolutionizing Data-Intensive Applications: Unleashing the Power of CXL Memory Chassis and FAMFS

477

In today's data-intensive application environment, the demand for high-performance and scalable memory solutions has become crucial. To address the insufficient DDR memory resources and performance limitations of traditional systems, H3 Platform has introduced the industry's first CXL 2.0 memory chassis, in collaboration with Micron, showcased the FAMFS shared-memory file system at the AIC booth during FMS 2024. This innovative memory-sharing solution is designed to provide memory pooling and sharing capabilities for database, machine learning, and artificial intelligence workloads.

 

Solution Architecture 


Our CXL Memory Sharing solution significantly enhances memory capacity, achieving up to 5.5TB of bandwidth per x16 PCIe lane through the disaggregation and sharing of memory resources. This breakthrough not only offers a 10x increase in memory capacity but also enables CXL memory to be directly exposed as a file system to applications without altering the existing application architecture. 

In this setup, CXL memory modules are aggregated into a CXL Memory Chassis, supporting up to 22 E3.S modules with a total capacity of 5.5TB. These modules are connected to 4 hosts via PCIe Gen5 x16 lanes. 

On the host side, the FAMFS file system mounts these CXL memory modules as DAX devices, allowing applications like RocksDB and other in-memory databases to perform read and write operations directly on CXL memory modules. RocksDB can seamlessly utilize these DAX devices and run the file system on them without any modifications, demonstrating compatibility between CXL memory and RocksDB. Moreover, the seamless integration of Linux/KVM with FAMFS further enhances the application of CXL memory in RocksDB, making this integrated solution highly effective for modern data-intensive workloads.

 

Benchmark Results 

The provided benchmark chart clearly illustrates the performance of RocksDB under different storage solutions, particularly comparing the performance differences between using XFS-SSD and shared CXL RAM (FAMFS) in random read operations.

 

Chart Analysis:

Operations Per Second (Ops/Sec): The upper chart shows the number of operations per second in random read operations as the database size approaches and exceeds the system RAM capacity. When using XFS-SSD fully cached in system RAM, performance drops sharply as the database size surpasses the system RAM capacity, reducing operations from nearly 200,000 to almost zero. In contrast, the FAMFS solution using shared CXL RAM maintains stable performance even when the database size exceeds the system RAM capacity. 

P99 Latency: The lower chart shows the P99 latency trends in random read operations. When using XFS-SSD fully cached in system RAM, latency increases dramatically as the database size surpasses the system RAM capacity, rising from hundreds of microseconds to tens of thousands. In contrast, the FAMFS solution using shared CXL RAM maintains low latency even when the database size exceeds the system RAM capacity, demonstrating exceptional performance stability.

These benchmark results highlight the advantages of combining CXL memory chassis with FAMFS in handling RocksDB workloads. Compared to traditional XFS-SSD solutions, the FAMFS solution using shared CXL RAM can maintain stable operational performance and low latency even when the database exceeds system RAM capacity, making it an ideal choice for data-intensive applications.

 

Potential Benefits

These results not only demonstrate the system's flexibility and stability under different data loads but also underscore the potential and practical value of CXL technology in enhancing database application performance. For application scenarios requiring high performance under heavy load conditions, the combination of CXL memory chassis and FAMFS is undoubtedly a great solution. 

Integrating CXL chassis and FAMFS enhances performance and delivers significant total cost of ownership (TCO) benefits. By freeing up system RAM originally allocated for RocksDB execution, system resources can be utilized more efficiently for broader general-purpose computing. Additionally, RocksDB data in CXL memory can be shared and pooled across other RocksDB nodes or virtual machines, further increasing system flexibility. The higher CXL capacity also reduces data transfers across storage networks, thereby lowering system power consumption.

 

Extended Applications

This solution is also applicable to various other data-intensive applications such as Apache Arrow, Jupyter, NumPy, Pandas, Dask, PyTorch, DuckDB, Apache Spark, Ray, Velox, and more, fully demonstrating its potential in high-performance computing and data processing scenarios.

 

The joint solution by H3 Platform and Micron, through introducing the CXL 2.0 memory chassis and FAMFS file system, has successfully addressed the insufficient DDR memory resources and performance challenges faced by traditional systems in data-intensive applications. This innovative technology significantly enhances memory capacity and performance while achieving compatibility with existing applications through simple integration. Particularly in handling workloads such as RocksDB in in-memory databases, the combination of CXL memory and FAMFS showcases exceptional performance and stability, making it an indispensable solution for future data-intensive applications.

Product InfoCXL Memory Pooling SolutionCXL Memory Sharing Architecture

category : CXL
tags :