We are looking for a kernel-focused engineer to lead efforts in writing, porting, and optimizing GPU kernels used in inference workloads.... As part of the inference team, you'll be responsible for unlocking every last FLOP from our GPUs by designing kernels, tuning...
and optimize CUDA-based systems and kernels to maximize performance across our fleet. Partner with researchers to integrate novel... across research, infra, and product teams. Mentor engineers on GPU performance, CUDA development, and distributed inference...
Engineer for Kernel Libraries, you will write high-performance kernels for the training and inference workloads. You will work..., or other AI accelerator architectures Have experience writing and optimizing compute kernels with CUDA or similar languages Are familiar...
in software/hardware co-design Deep understanding of GPU and/or other AI accelerators Experience with CUDA or a related..., active learning, among other topics. The Scaling team builds robust and scalable software to support our research efforts...