Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior AI Software Engineer, LLM Inference Performance Analysis, Location: Santa Clara, CA

Page: 1

Senior AI Software Engineer, LLM Inference Performance Analysis

We're now seeking a Senior AI Software Engineer, in our LLM Inference Performance Analysis and Optimization team...! NVIDIA leads the generative AI revolution. We're now seeking an experienced AI Software Engineer to optimize LLM inference...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 02 Nov 2025

Senior Deep Learning Software Engineer, Inference and Model Optimization

opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM... and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 18 Oct 2025

Senior Deep Learning Software Engineer, Inference

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key... pipelines. What you'll be doing: Performance optimization, analysis, and tuning of DL models in various domains like LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 07 Sep 2025

Senior Deep Learning Software Engineer, Inference

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key... pipelines. What you'll be doing: Performance optimization, analysis, and tuning of DL models in various domains like LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 05 Sep 2025

Senior Staff Software Development Engineer- GPU, LLM, AI

_ THE ROLE: AMD is looking for an influential software engineer who is passionate about improving the performance of key... very latest hardware and software technology. THE PERSON: As a Senior Staff Software Developer, you will be at the...

Posted Date: 14 Sep 2025

Senior Software Development Engineer, TensorRT-LLM

We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation Prior experience with performance...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 02 Nov 2025

Senior GenAI Algorithms Engineer — Model Optimizations for Inference

or research experience in deep learning. Strong software design skills, including debugging, performance analysis, and test... focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Sep 2025

Senior Staff Software Development Engineer- GPU/AI/ML

_ THE ROLE: AMD is looking for an influential software engineer who is passionate about improving the performance of key... very latest hardware and software technology. THE PERSON: As a Senior Staff Software Developer, you will be at the...

Posted Date: 21 Sep 2025

Senior Software Test Development Engineer, SDET - Deep Learning

We are looking for a Software Test development engineer in NVIDIA’s Deep Learning SWQA team. The position is in NVIDIA... and measure the performance of NVIDIA‘s Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 08 Nov 2025

Senior GenAI Algorithms Engineer — Post-Training Optimizations

focuses on optimizing generative AI models such as large language models (LLM) and diffusion models for maximal inference... architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 19 Sep 2025

Senior Research Engineer

engineer who is passionate about open-source and excited to create our next-generation post-training software stack... intersection of computer-architecture, libraries, frameworks, AI applications and the entire software stack. Performance tuning...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 11 Oct 2025

Senior Deep Learning Algorithm Engineer

in Python programming, software design, debugging, performance analysis, test design and documentation. Consistent record... bottlenecks, pipelining, and multiprocessing) and demonstrated excellence in related performance analysis and tuning. Prior...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 26 Oct 2025

Senior Deep Learning Algorithm Engineer

programming, software design, debugging, performance analysis, test design and documentation. Consistent record of working..., pipelining, and multiprocessing) and demonstrated excellence in related performance analysis and tuning. Expertise...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 15 Oct 2025

Senior Deep Learning Algorithm Engineer, Training Framework

and deployment environments (e.g. TRTLLM, vLLM, SGLang). Proficient in Python programming, software design, debugging, performance... excellence in related performance analysis and tuning. Expertise in distributed computing, model parallelism, and mixed...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 21 Aug 2025