Deep Learning Performance Software Engineer — Nvidia | cvGO!
Nvidia · China, Shanghai · Office
### About the Role Optimize deep learning framework performance on NVIDIA GPUs (Hopper, Blackwell). Work at the intersection of algorithms, systems programming, and hardware. ### Responsibilities - Optimize CUDA kernels and DL libraries (cuDNN, TensorRT) for maximum throughput. - Develop low-level G