Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Generative AI Inference Engineer, Location: USA

Page: 10

AI/ML Engineer

Preferred Qualifications: Experience with predictive modeling and generative AI/LLMs, including RAG systems Familiarity... with LLM inference servers (e.g., vLLM, Ollama) Hands-on benchmarking/Test & Evaluation of AI systems Data visualization...

Posted Date: 23 Dec 2025

Senior Deep Learning Algorithm Engineer

outstanding engineers to join our team and help shape the future of LLM inference. Our team is dedicated to pushing the.... We constantly reflect on how to improve these systems, developing new inference algorithms and protocols, improving existing models...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 20 Dec 2025

Software Development Engineer III, Annapurna Labs

to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the AWS portfolio... and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the...

Company: Amazon
Location: New York City, NY
Posted Date: 19 Dec 2025

MTS - Site Reliability Engineer

Reliability & Availability: Ensure uptime, resiliency, and fault tolerance of AI model training and inference systems... orchestration. Knowledge of CI/CD pipelines for Inference and ML model deployment. Hands-on experience with public cloud platforms...

Company: Microsoft
Location: Redmond, WA
Posted Date: 18 Dec 2025

Software Development Engineer III, Annapurna Labs

to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the AWS portfolio... and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the...

Company: Amazon
Location: New York City, NY
Posted Date: 15 Dec 2025

Senior Machine Learning Engineer

experience with PyTorch or Jax. Experience with designing, optimizing, and monitoring generative modeling training workflows.... Familiarity with profiling training and/or inference code, identifying performance bottlenecks, and mitigating them. Comfort...

Company: Microsoft
Location: Cambridge, MA
Posted Date: 12 Dec 2025

Senior Machine Learning Engineer II - AI Special Projects

with AI. We are tackling some of the company's biggest growth bets, including generative recommendation systems, hyper-personalized marketing... development of personalized content and recommendation systems by leveraging a mix of generative and traditional machine learning...

Company: Instacart
Location: USA
Posted Date: 11 Dec 2025

Backend Engineer | Multimodal AI Systems

that can handle the extreme complexity of generative AI, from managing inference pipelines to building the infrastructure... intelligence layer that powers our autonomous agentic workflows and massive-scale inference. This role involves designing systems...

Company: Luma AI
Location: Palo Alto, CA
Posted Date: 07 Dec 2025

iOS Engineer | Mobile Architecture & Strategy

our generative media platform on iOS. This is a hands-on leadership role where you will define the technical strategy for our mobile... infrastructure, ensuring it can support massive-scale inference and complex agentic workflows. What You Will Build Distributed...

Company: Luma AI
Location: Palo Alto, CA
Posted Date: 07 Dec 2025

Sr. Software Development Engineer, Annapurna Labs

from various other teams including training, inference and runtime. Collaborate with the runtime team to ensure timely release...'s growing suite of generative AI services and other cloud computing offerings across the AWS portfolio. About AWS Amazon Web...

Company: Amazon
Location: Austin, TX
Posted Date: 05 Dec 2025

Data Center GPU Performance Engineer – Product

direct experience in developing or deploying large scale GPU based AI applications, like LLMs, for training and inference... Ability to quickly develop intuitive, first-principles based models of Generative AI workload performance using GPU and system...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 05 Dec 2025

Staff Machine Learning Engineer - End to End Autonomy

in generative AI for robotics and/or autonomous driving, such as large e2e behavior models, foundation models, world models..., ablation studies, evaluation, deployment, inference optimization Knowledge of debugging and profiling deep neural networks...

Posted Date: 04 Dec 2025

Senior Software Engineer, Real-Time AI and Rendering - Holoscan SDK

observation, and beyond. Wherever high-throughput sensor processing, AI inference, and visualization meet, Holoscan provides the... visualization-spanning surgery, robotics, manufacturing, and scientific discovery. Generative AI is becoming a central force...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 04 Dec 2025

Software Development Engineer, ML Systems, Annapurna Labs

to and experience with Amazon's growing suite of generative AI services and other cloud computing offerings across the AWS portfolio... and Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost in the...

Company: Amazon
Location: New York City, NY
Posted Date: 28 Nov 2025

Senior Software Engineer, Machine Learning

on advanced analytics such as change detection, object detection, and emerging generative AI capabilities. This role is a blend... Collaborating with adjacent ML and software engineering teams to ensure seamless integration of ML pre-processing and inference...

Company: Planet Labs
Location: USA
Posted Date: 20 Nov 2025
Salary: $153000 - 191300 per year

2026 AI/ML Intern - Machine Learning Engineer Intern

customers. At Adobe Firefly, we build foundation generative models for image, video, and other modalities that power the suite... pioneering image and video foundation models powered by data, training, and inference infrastructures. All 2025 Adobe interns...

Company: Adobe
Location: San Jose, CA
Posted Date: 15 Nov 2025

Senior Software Engineer, Computer Vision (Knowledge Distillation)

as large model inference and multi-machine multi-card deployment. Our work enhances user experience by powering diverse... engineering systems for generative AI tasks, including but not limited to model training and optimization, model deployment...

Company: TikTok
Location: San Jose, CA
Posted Date: 14 Nov 2025

Principal Engineer, AI Agents

, orchestration frameworks, and generative AI integration. Proven track record of building production-grade AI/ML inference... solutions across multi-cloud and on-prem environments, integrating agentic AI, generative AI, open data formats, and real-time...

Company: Teradata
Location: San Diego, CA
Posted Date: 13 Nov 2025

Sr. System Development Engineer, High-Performance Accelerator Servers for AI/ML

Description Do you want to shape the future of Generative AI at AWS? Join the team building the foundation of the... world's most advanced cloud for AI training and inference - where multi-billion-parameter models come to life at scale...

Company: Amazon
Location: Seattle, WA
Posted Date: 06 Nov 2025

Sr Hardware Development Engineer, High Performance AI & ML Servers

Description Do you want to shape the future of Generative AI at AWS? Join the team building the foundation of the... world's most advanced cloud for AI training and inference - where multi-billion-parameter models come to life at scale...

Company: Amazon
Location: Seattle, WA
Posted Date: 06 Nov 2025