Recommended videos
CUDA course.
NVidia CUDA tutorial. Duration: 11 episodes.
Fundamentals of GPU Architecture
High-performance computing is now dominated by general-purpose graphics processing unit (GPGPU) oriented computations. How can we leverage our knowledge of C++ to program the GPU?
Overview of each generation of CUDA hardware from Tesla through Ampere