Deep Learning Performance Architect — Nvidia | cvGO!
Nvidia · 2 Locations · Office
### About the Role Optimize deep learning model performance on Nvidia GPUs, bridging algorithms, systems software, and hardware. ### Responsibilities - Profile DL models to identify performance bottlenecks. - Develop and deploy CUDA kernel, library (cuDNN, cuBLAS), and framework (PyTorch, TensorFlow