Speeding Up Deep Learning Inference Using NVIDIA TensorRT (Updated)

1 · NVIDIA Corporation · July 20, 2021, 1:11 p.m.
This post was updated July 20, 2021 to reflect NVIDIA TensorRT 8.0 updates. NVIDIA TensorRT is an SDK for deep learning inference. TensorRT provides APIs and parsers to import trained models from all major deep learning frameworks. It then generates optimized runtime engines deployable in the datacenter as well as in automotive and embedded environments. … Continued...