specific problems. Software Engineer, Systems ML - Frameworks / Compilers / Kernels Responsibilities Development of SW... machine learning compiler frameworks and will help in driving next generation hardware software codesign for AI domain...
job responsibilities As a ML Compiler Engineer II on the Neuron Compiler Automated Reasoning Group, you will develop and maintain tooling... domains, including Large Language and Vision, originating from leading frameworks such as PyTorch, TensorFlow, and JAX...
hardware-software boundary, our engineers craft high-performance kernels for ML functions, ensuring every FLOP counts... of software, hardware, and machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML...
hardware-software boundary, our engineers craft high-performance kernels for ML functions, ensuring every FLOP counts... of software, hardware, and machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML...
fast. Senior Staff Software Engineer, High Performance GPU Inference Systems Mission: Push the limits... scalable, low-latency runtime systems that coordinate thousands of GPUs across tightly integrated, software-defined...
Inference benchmarks on the newest NVIDIA GPUs. Productionize inference systems with uncompromised software quality... computing principles. Hands-on experience with ML frameworks (e.g., PyTorch) and inference engines (e.g., vLLM and SGLang...
We are now looking for a Senior Deep Learning Software Engineer, FlashInfer. NVIDIA has been transforming computer... in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member...
frameworks for AMD GPUs. Your expertise will be critical in enhancing GPU kernels, deep learning models, and training/inference... and optimize frameworks like TensorFlow and PyTorch for AMD GPUs in open-source repositories. Develop GPU Kernels: Create...
for all key AI services. The team has been developing AI frameworks to accelerate Meta's DL/ML workloads on the specialized MTIA... of Software stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development...
, tensor core optimization ML Compilers & Frameworks: PyTorch/JAX internals, torch.compile, XLA, custom operators Performance... breakthrough innovations in GPU performance and systems engineering. As a GPU Performance Engineer, you'll architect and implement...
algorithms for ML/AI compilers, kernels and HW features to improve mappings of ML/AI workloads on existing and future HW... Engineering, Software Engineering, Systems Engineering, or related work experience. OR Master's degree in Computer Science...