Overview

Meet Quantum Machines’ Open Acceleration Stack, a modular framework for hybridization of quantum and classical compute resources. It connects QM’s hybrid controller directly with classical accelerators (CPUs, GPUs, and more). The large selection of high-bandwidth integrations and time-bounded communications allows the Open Acceleration Stack to provide the necessary amount of classical compute resources and response latencies needed by your quantum systems. It closes the loop in microseconds, enabling quantum error correction, rapid calibrations, machine-learning routines, adaptive experiments, and paving the way to classically-accelerated quantum supercomputing.

main background
OPNIC QM
OPNIC

QM’ low-latency interface PCIe card handling data marshaling between hybrid controllers and any classical accelerators in microseconds

OPNIC empty
V80
FPGAs

Integrating dedicated FPGAs for decoding, such as Riverlane Deltaflow and AMD FPGA Decoder.

V80
GPUs

GPUs allow efficient machine learning algorithms for qubit calibrations and drift corrections. Ready with full NVQLink compliance

V80
V80
Superchips

When both CPU and GPU resources are needed, superchips are the solution. Integrate today NVIDIA GraceHopper, Grace Blackwell, and more

V80
V80
V80
CPUs

Integration of X86 Intel CPUs and AMD CPUs with 2-3 µs latency for all classical processing needs within quantum workflows

V80
V80
OPX1000

QM’s flagship hybrid controller

Highest channel density, Cutting-edge analog front-end, Adaptive control at any scale

V80
V80
V80
QM's Open Acceleration Stack
OPNIC
QECi Interface
FPGA DECODERS
NVIDIA Workstation GPU
RTX6000 Blackwell Max-Q
NVIDIA Data Center GPU
RTX6000 Blackwell Server
NVIDIA Superchips
GB200 NVL-4
CPU Intel XEON
CPU AMD EPYC
OPX1000
HPC
EXTERNAL DECODER

Mode:

Full server configuration

Powering Quantum Innovation in Leading Labs Worldwide

An heterogeneous architecture for hybrid quantum-classical supercomputers

The Open Acceleration Stack is a modular framework designed to tightly couple classical compute resources to QPUs with bounded-latency integration, so classical acceleration becomes part of the quantum runtime. At its core is the OPX1000 Hybrid Controller, powered by the Pulse Processing Unit. QM’s OPNIC couples the controller to any server, with CPUs, GPUs, FPGAs, and more, from a variety of vendors and any compute power. Boost your quantum system with classical acceleration for real-time calibration of qubit parameters and quantum error correction decoding at scale.

  • NVQLink-compliant, ready for quantum supercomputing.
  • Accelerators-QPU integration with roundtrip latency
of 2-4 µs.
  • Supports CUDA, CUDA-Q, C++, Python, & custom device calls.
  • 4-5U rack-mountable hot-swappable system, ready for HPC.
QM's open acceleration stack and its components

Improving quantum workflows: online calibrations with GPU/CPU reinforcement learning

Integrating quantum systems with classical resources is especially valuable for online calibrations, where powerful classical accelerators make it possible to use more computationally demanding methods. With implementations of QM’s Open Acceleration Stack, calibration routines run as a closed loop inside the experiment: the system coordinates shots, streams readout to the classical accelerator, evaluates optimizers or reinforcement-learning (RL) policies, then updates control parameters before the next shot. Connecting an OPX1000 hybrid controller with an NVIDIA GraceHopper, and considering a 3-qubits GHZ routine, we show the online RL optimizes 6 pulse parameters consistently exceeding the best pre-calibrated baseline, and correcting drifts that static routines miss.

The payoff is practical: fewer manual retunes, faster bring-up, higher and steadier fidelities across long runs, and a template that scales to large QPUs and more complex sequences. This demonstrates how tight GPU-CPU-QPU integration transforms calibration from a one-off, time-consuming step into continuous real-time optimization: clear evidence of the dawn of quantum-accelerated supercomputing.

  • 6-parameter tune-up with RL
on classical superchip, updated in real time
  • Few-µs runtimes, maximizing correction
bandwidth and QPU utilization
  • Consistently superior fidelity
and stability compared to baseline

Integrating classical resources for quantum error correction decoding at scale

Quantum error correction only works if the decoding speed keeps up with the error rate, and as QPUs scale up, the required classical computation becomes increasingly resource-intensive. QM’s Open Acceleration Stack allows QEC decoding to run on high-performance classical accelerators, so the heavy lifting happens on a powerful CPU-GPU server. The quantum system controller runs stabilizer rounds on the QPU, and streams error syndromes (measurement results) to the server. A separate kernel (e.g. CUDA kernel) decodes syndromes and computes corrections, sending the information back to the controller, where feed-forward operations close the loop. Between an OPX1000 and an NVIDIA GraceHopper, the communication roundtrip latency is kept below 4 µs, allowing the decoding time budget to remain below the 10-20 µs limit for useful decoding [Kurman, et al. arXiv:2412.00289 (2024)]. The result of using the Open Acceleration Stack to integrate acceleration servers into quantum sequences is real-time QEC that scales: larger codes, deeper rounds, and higher-throughput experiments without rewriting the stack.

Protocols can be written in CUDA-Q: a single, unified hybrid programming language that defines a quantum circuit, data streaming, device calls, and the quantum feedback to apply corrections. This keeps real-time quantum orchestration, decoding, and control in one place, aligned and fully compatible with NVIDIA NVQLink’s open model for bounded-latency quantum-classical integration. As codes grow and decoders evolve, classically accelerated QEC decoding offers the scale and flexibility needed for fault-tolerant quantum computing.

Key Features

The first system to integrate classical accelerators in quantum sequences, and demonstrate QEC workflows and AI-based calibrations on GPU.

Microsecond-level controller-accelerator roundtrip latencies (2-4 μs).

Natively supports programming in CUDA, CUDA-Q, C++, Python, and custom device calls.

Designed for any scale, from cost-efficient servers to cutting-edge data centers.

Benefits

Run the Most Advanced 
Hybrid Algorithms

The only solution with demonstrated support for quantum error correction and reinforcement-learning-based calibrations

Effortless Scalability

Add OPX1000 controllers as your QPU grows, and add accelerator servers to meet increasing computational demands

HPC & Data Center Ready

HPC-QC integration from the hardware quantum control layer and up, designed for data center reliability
with redundancies and hot swappable critical parts

Compatible with 
All Qubit Technologies

The Open Acceleration Stack supports superconducting qubits, spin qubits, neutral atoms, trapped ions, defect centers, photonic qubits and more

Wide Language Support

The Open Acceleration Stack natively supports programming in CUDA, CUDA-Q, C++, Python, and custom device calls, for co-development of hybrid workflows.