The largest independent dev blog feed.

We surface the best developer writing from thousands of independent blogs, updated daily. The open web is worth fighting for.

Learn more

Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 1

1 · NVIDIA Corporation · July 27, 2021, 8:49 p.m.

AI / Deep Learning HPC memory allocation

Summary

Most CUDA developers are familiar with the cudaMalloc and cudaFree API functions to allocate GPU accessible memory. However, there has long been an obstacle with these API functions: they aren’t stream ordered. In this post, we introduce new API functions, cudaMallocAsync and cudaFreeAsync, that enable memory allocation and deallocation to be stream-ordered operations. In part … Continued...

Read full post on developer.nvidia.com →

AUTHOR