We're now seeking a Senior AI Software Engineer, in our LLM Inference Performance Analysis and Optimization team...! NVIDIA leads the generative AI revolution. We're now seeking an experienced AI Software Engineer to optimize LLM inference...
opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM... and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer...
NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key... pipelines. What you'll be doing: Performance optimization, analysis, and tuning of DL models in various domains like LLM...
NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key... pipelines. What you'll be doing: Performance optimization, analysis, and tuning of DL models in various domains like LLM...
_ THE ROLE: AMD is looking for an influential software engineer who is passionate about improving the performance of key... very latest hardware and software technology. THE PERSON: As a Senior Staff Software Developer, you will be at the...
We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation Prior experience with performance...
or research experience in deep learning. Strong software design skills, including debugging, performance analysis, and test... focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference...
_ THE ROLE: AMD is looking for an influential software engineer who is passionate about improving the performance of key... very latest hardware and software technology. THE PERSON: As a Senior Staff Software Developer, you will be at the...
We are looking for a Software Test development engineer in NVIDIA’s Deep Learning SWQA team. The position is in NVIDIA... and measure the performance of NVIDIA‘s Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech...
focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference... architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning...
engineer who is passionate about open-source and excited to create our next-generation post-training software stack... intersection of computer-architecture, libraries, frameworks, AI applications and the entire software stack. Performance tuning...
in Python programming, software design, debugging, performance analysis, test design and documentation. Consistent record... bottlenecks, pipelining, and multiprocessing) and demonstrated excellence in related performance analysis and tuning. Prior...
programming, software design, debugging, performance analysis, test design and documentation. Consistent record of working..., pipelining, and multiprocessing) and demonstrated excellence in related performance analysis and tuning. Expertise...
and deployment environments (e.g. TRTLLM, vLLM, SGLang). Proficient in Python programming, software design, debugging, performance... excellence in related performance analysis and tuning. Expertise in distributed computing, model parallelism, and mixed...