Introducing NVIDIA Dynamo, A Low-Latency Distributed Inference Framework for Scaling Reasoning AI Models

67 · NVIDIA Corporation · March 18, 2025, 6:35 p.m.
Summary
NVIDIA has announced NVIDIA Dynamo, a new open-source framework for low-latency distributed inference aimed at scaling AI models efficiently. This release was made during GTC 2025, promising high throughput and enhanced performance for AI applications.