Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Software Engineer, Inference Deployment, Location: USA

Page: 1

Software Engineer, Inference Deployment

systems that make inference deployment continuous and unattended. As a Software Engineer on the Launch Engineering team... is to make inference deployment boring and unattended. Anthropic serves Claude to millions of users across GPUs, TPUs...

Company: Anthropic
Location: USA
Posted Date: 07 Feb 2026

Senior Deep Learning Software Engineer, Inference and Model Optimization

and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... to develop and scale up our automated inference and deployment solution. As part of the team, you will be instrumental in pushing...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 23 Jan 2026

Senior Software Development Engineer - LLM Kernel & Inference Systems

Optimize the full inference stack—from model execution graphs and runtimes to scheduling, batching, and deployment. * Open...) inference and kernel optimization for AMD GPUs. You will play a critical role in advancing high-performance LLM serving...

Posted Date: 20 Dec 2025

Senior Software Engineer - vLLM Inference

and software. As an ML Ops engineer, you will work closely with our technical and research teams to manage training and deployment... What you will bring 2+ years of experience in MLOps, DevOps, Automation and modern Software Deployment practices Experience evaluating...

Company: Red Hat
Location: Boston, MA
Posted Date: 07 Dec 2025

Senior II Software Engineer Lead - Akamai Inference Cloud (Remote)

Job Category: Software Engineer Job Description: Do you thrive on technical leadership and building cutting-edge... performance, compliance, and economics. Partner with the best As a Senior II Software Engineer Lead, you will be responsible...

Company: Akamai
Location: Cambridge, MA
Posted Date: 21 Nov 2025

Principal Software Engineer - Akamai Inference Cloud (Remote)

Job Category: Software Engineer Job Description: Do you thrive on solving complex technical challenges..., and economics. Partner with the best As a Principal Software Engineer, you will serve as a technical leader and architect...

Company: Akamai
Location: Cambridge, MA
Posted Date: 21 Nov 2025

Staff + Senior Software Engineer, Cloud Inference

Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP... integration and intelligent request routing to inference execution, capacity management, and day-to-day operations...

Company: Anthropic
Location: USA
Posted Date: 05 Feb 2026

Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used... inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range...

Company: Amazon
Location: Cupertino, CA
Posted Date: 30 Jan 2026

Software Development Engineer - AI/ML, AWS Neuron, Multimodal Inference

Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used... inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range...

Company: Amazon
Location: Seattle, WA
Posted Date: 01 Jan 2026

Software Engineer II - AI/ML, AWS Neuron, LLM Inference, AI/ML, AWS Neuron, Model Inference

Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used... inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range...

Company: Amazon
Location: Cupertino, CA
Posted Date: 14 Dec 2025

Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Description The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used... inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range...

Company: Amazon
Location: Cupertino, CA
Posted Date: 21 Nov 2025

Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)

as a Forward Deployed Engineer. In this role, you will not just build software; you will be the bridge between our cutting-edge... inference platform ( , and ) and our customers' most critical production environments. You will interface directly with the...

Company: Red Hat
Location: Massachusetts
Posted Date: 25 Jan 2026

AI Inference Engineer

Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the model deployment for efficient inference; [3... software and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and endpoint devices...

Company: quadric, Inc
Location: Burlingame, CA
Posted Date: 24 Dec 2025

Large Model Inference Acceleration Engineer

, including deep learning model training, optimization, deployment and applications. We provide AI capabilities to empower content... creation and consumption on TikTok and serve billions of users. We are seeking an experienced AI model optimization engineer...

Company: TikTok
Location: San Jose, CA
Posted Date: 14 Nov 2025

AI Models GPU deployment software Engineer

your career. THE ROLE: AMD is looking for a software engineer who is passionate about expanding AI models on AMD GPUs... industry specialists and will work with the very latest hardware and software technology. THE PERSON: Strong technical...

Location: Austin, TX
Posted Date: 04 Jan 2026

Software Engineer - Python

Genesis10 is currently seeking a Software Engineer – Python with our client in the financial industry located... for scalable model serving Coordinate with deployment and support teams for successful releases into testing and production...

Company: Genesis10
Location: Charlotte, NC
Posted Date: 08 Feb 2026

Software Engineer - Python

Genesis10 is currently seeking a Software Engineer – Python with our client in the financial industry located... for scalable model serving Coordinate with deployment and support teams for successful releases into testing and production...

Company: Genesis10
Location: Charlotte, NC
Posted Date: 08 Feb 2026

Software Engineer - Python

Genesis10 is currently seeking a Software Engineer – Python with our client in the financial industry located... for scalable model serving Coordinate with deployment and support teams for successful releases into testing and production...

Company: Genesis10
Location: Charlotte, NC
Posted Date: 08 Feb 2026

Principal Software Engineer- GenAI

latest information – and . TITLE: Principal Software Engineer - GenAI WHAT YOU’LL DO: As a Principal Software Engineer... and implement core AI services, including model orchestration, prompt engineering frameworks, and inference pipelines. Collaborate...

Location: USA
Posted Date: 08 Feb 2026

Software Developer/Engineer (LLM / Meta Llama 3 / Mistral / Mixtral / Python)

Trigyn has a long-term contract opportunity for Software Developer/Engineer with our direct client - a major utility... 3 and Mistral / Mixtral in on-prem or private environments Strong proficiency in Python for LLM inference, prompt...

Posted Date: 08 Feb 2026