Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 1

1 · NVIDIA Corporation · July 27, 2021, 8:49 p.m.
Most CUDA developers are familiar with the cudaMalloc and cudaFree API functions to allocate GPU accessible memory. However, there has long been an obstacle with these API functions: they aren’t stream ordered. In this post, we introduce new API functions, cudaMallocAsync and cudaFreeAsync, that enable memory allocation and deallocation to be stream-ordered operations. In part … Continued...