Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Generative AI Inference Engineer, Location: USA

Page: 1

Generative AI Inference Engineer

Generative AI Inference Engineer Remote About the role: We are seeking passionate Machine Learning Engineers... to join our Inference team, focusing on the creative applications of generative AI models. The ideal candidate will have substantial...

Company: Stability AI
Location: USA
Posted Date: 19 Nov 2025

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

NVIDIA is at the forefront of the generative AI revolution. We are looking for a Software Engineer, Performance... Analysis, and Optimization for LLM Inference, to join our performance engineering team. In this role, you will focus...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 16 Jan 2026
Salary: $124000 - 195500 per year

Large Model Inference Acceleration Engineer

generative AI models. Responsibilities - Design and optimize large model inference pipelines for low-latency, high-throughput... creation and consumption on TikTok and serve billions of users. We are seeking an experienced AI model optimization engineer...

Company: TikTok
Location: San Jose, CA
Posted Date: 14 Nov 2025

Neural Rendering Research Inference Engineer – Advanced Graphics Programs

and generative ai applications. THE ROLE: AMD is looking for a strategic research inference engineer who is passionate... your career. THE PERSON: We are seeking an exceptional Neural Rendering Research Inference Engineer - Advanced Graphics...

Posted Date: 09 Nov 2025

Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range... models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side...

Company: Amazon
Location: Seattle, WA
Posted Date: 01 Jan 2026

Software Engineer II - AI/ML, AWS Neuron, LLM Inference, AI/ML, AWS Neuron, Model Inference

inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range... models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side...

Company: Amazon
Location: Cupertino, CA
Posted Date: 14 Dec 2025

Senior Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range... models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side...

Company: Amazon
Location: Cupertino, CA
Posted Date: 11 Dec 2025

Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range... models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side...

Company: Amazon
Location: Cupertino, CA
Posted Date: 11 Dec 2025

Software Development Engineer, AI/ML, AWS Neuron, Model Inference

inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range... models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side...

Company: Amazon
Location: Cupertino, CA
Posted Date: 21 Nov 2025

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your work will involve... inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range...

Company: Amazon
Location: Cupertino, CA
Posted Date: 06 Nov 2025

Solutions Architect, Inference Deployments

AI Engineer or similar. Active contributions to Kubernetes SIGs or AI inference projects (e.g., KServe, Dynamo, SGLang...We’re forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA’s GPU...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 23 Nov 2025

Senior Python Engineer - Generative AI

impact. The Technology Community Office is in search of a Senior Engineer - Generative AI to expand the team... products in public cloud. Knowledge of Generative AI and Large Language Model inference. Knowledge of AI tools and frameworks...

Company: American Express
Location: Phoenix, AZ
Posted Date: 15 Jan 2026
Salary: $123000 - 215250 per year

Generative AI Engineer - New York, NY

Job Title: Generative AI Engineer Location: New York, NY Domain: Financial Duration: Long Term Contract... Looking for W2 Candidates. No C2C Job Summary: We are seeking a highly skilled and innovative Generative AI Engineer to lead the...

Company: TechniPros
Location: New York City, NY
Posted Date: 15 Jan 2026

Generative AI Engineer

technology and services Position: Generative AI Engineer Location: Atlanta, GA Duration: 6 Months Job Type: Temporary...: We are looking for a Generative AI Engineer to design, build, and scale an enterprise AI-powered semantic search platform for API discovery...

Company: TekWissen
Location: Atlanta, GA
Posted Date: 15 Jan 2026

Staff Research Engineer - Generative Video

Research Engineer (Generative Video), you'll help bring Canva's next wave of AI-powered video creation to life - turning... and product engineering teams to shape the end-to-end generative video stack - from data and training, to evaluation, to inference...

Company: Canva
Location: San Francisco, CA
Posted Date: 01 Jan 2026

AI Engineer, Generative AI Agents

. Job Summary: LG Ads is seeking a highly skilled and motivated AI Engineer specializing in Generative AI Agents... development of the LLM Service Hub, ensuring consistent model access and inference for various AI applications. Utilize...

Company: LG Ad Solutions
Location: Denver, CO
Posted Date: 15 Nov 2025

Senior Generative AI Software Engineer

for evaluation, inference, or release. Prior work cleaning up sophisticated generative model codebases—adding tests, improving...At NVIDIA, we're not just building the future, we're generating it! Our Cosmos generative AI engineering team...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 21 Dec 2025

Sr. Cloud Hardware Dev Engineer (AWS Generative AI & ML Servers), AWS Generative AI & ML Servers

Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the... cloud for AI training and inference? Want to do industry leading work delivering continuous price performance improvements...

Company: Amazon
Location: Seattle, WA
Posted Date: 13 Dec 2025

Cloud Hardware Dev Engineer (AWS Generative AI & ML Servers), AWS Generative AI & ML Servers

Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the... cloud for AI training and inference? Want to do industry leading work delivering continuous price performance improvements...

Company: Amazon
Location: Seattle, WA
Posted Date: 11 Dec 2025

Cloud Hardware Dev Engineer (AWS Generative AI & ML Servers), AWS Generative AI & ML Servers

Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the... cloud for AI training and inference? Want to do industry leading work delivering continuous price performance improvements...

Company: Amazon
Location: Seattle, WA
Posted Date: 11 Dec 2025