Falcon 6048

Overview

GPU-Managed Data Access

The Falcon 6048 utilizes a GPU-accelerated memory/storage architecture to expand effective GPU capacity through high-IOPS NVMe SSD integration. By orchestrating seamless data movement between GPU HBM and NVMe storage via a high-speed PCIe fabric, the system maximizes compute utilization and maintains ultra-low latency for massive-scale AI and HPC workloads.
Breakthrough Performance — More than 200M IOPS with Gen6 Fabric

Powered by 2 Broadcom PCIe Gen6 switches and 44 E1.S NVMe SSDs, Falcon 6048 achieves up to 200 million IOPS, delivering ultra-low latency and massive parallel throughput ideal for AI training, inference, prediction and large-scale data analytics.
Built for cuVS and Graph Neural Network (GNN) Acceleration

Optimized for NVIDIA cuVS and the WholeGraph framework, the Falcon 6048 delivers significant performance gains for vector database operations and GNN workloads. It provides native GPU acceleration for the entire embedding-to-retrieval pipeline. This synergy, combined with support for large-scale GNN training, allows developers to orchestrate highly responsive AI systems capable of handling massive, unstructured datasets.

Management Interface	Redfish®, RESTful API, GUI and CLI
System Management	Dashboard for SSD utilization, performance, and other information Predictive health monitoring Role-based authentication and access control

Model Name	Falcon 6048
BMC	Aspeed AST2600 Advanced PCIe Graphics & Remote Management Processor
CPU	Single Intel® Gen6 Xeon® Scalable Processor (Granite Rapids CPU)
PCIe Switch	Broadcom PEX 90144 PCIe 6.0 Switch
Memory	Eight (8) DDR5 6400MHz RDIMM slots Maximum capacity: 1TB
NVMe SSD	PCIe fabric: Up to forty-eight (48) E1.S 15mm NVMe SSDs CPU : Two (2) U.2 (PCIe Gen5)
PCIe Slot	GPU slot, full-height, full-length, double width: Two (2) PCIe 6.0 x16 slots (up to 650W GPU) Network card, half-height, height-length, single width: Up to four (4) PCIe 6.0 x16 slots (up to 75W NIC)
Power	3200W (3+1 redundant) power supply 80+ Titanium
Fan	Ten (10) 60x56mm fans Hot swap
LAN	One (1) RJ45 GbE connector for BMC dedicate management port Two (2) RJ45 1GbE connectors
Environmental Spec.	Storage temperature : -10ºC(14ºF) ~ 60ºC(140ºF) Operating temperature : 0ºC(32ºF) ~ 35ºC(95ºF) Storage operating humidity : 5%~95% (non-condensing)
Dimension	3U; 129.6 (H) x 438(W) x 850(D) mm

Accelerator	NVIDIA H200, H100 NVIDIA RTX Pro 6000
Network Cards	NVIDIA ConnectX-7, ConnectX-8, ConnectX-9

If you want to apply for any product display, please write a form and we will contact you after receiving the message.

First Name*

Last Name*

Email*

Company*

Job Title

Industry

Product

Interested Solution

Country*