Job Search Results

Large Model Inference Acceleration Engineer

with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal... generative AI models. Responsibilities - Design and optimize large model inference pipelines for low-latency, high-throughput...

Apply Now

Company: TikTok

Location: San Jose, CA

Posted Date: 14 Nov 2025

Large Model Training Acceleration Engineer

with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal... creation and consumption on TikTok and serve billions of users. We are seeking an experienced AI model optimization engineer...

Apply Now

Company: TikTok

Location: San Jose, CA

Posted Date: 14 Nov 2025

Sr. Research Engineer/Scientist(all levels), Efficient Models

, but are not limited to, distillation frameworks, model acceleration, hardware-efficient inference, and their applications... and implementing efficient models for large-scale generative AI, with a particular emphasis on large model distillation and compression...

Apply Now

Company: TikTok

Location: San Jose, CA

Posted Date: 27 Jan 2026

Software Engineer - Model Serving Infrastructure - USDS

company. Currently, we are looking for Machine Learning Engineer - Model Serving Infrastructure to join our team to support... and advance that mission. - Responsible for the design and implementation of distributed inference infrastructure for feeds, ads...

Apply Now

Company: TikTok

Location: San Jose, CA

Posted Date: 08 Jan 2026

Research Engineer/Scientist(all levels), Efficient Models

, but are not limited to, distillation frameworks, model acceleration, hardware-efficient inference, and their applications... and implementing efficient models for large-scale generative AI, with a particular emphasis on large model distillation and compression...

Apply Now

Company: TikTok

Location: San Jose, CA

Posted Date: 14 Nov 2025

Machine Learning Engineer - Trae USDS

to date with and apply cutting-edge techniques in large model optimization and inference acceleration. Qualifications: Minimum Qualifications... our hybrid work model, and the specific requirements may change at any time. We are seeking a Machine Learning Engineer...

Apply Now

Company: TikTok

Location: San Jose, CA

Posted Date: 19 Nov 2025

Software Engineer - TikTok AI Search Infrastructure

paradigms - Deploy and optimize text/multimodal LLMs, including inference acceleration, model alignment during training... for large-scale ML infra and online/offline distributed systems, enabling AI to realize its potential value for billions...

Apply Now

Company: TikTok

Location: San Jose, CA

Posted Date: 22 Jan 2026

GPU/AI Application System Software Engineer Intern (System Technologies and Engineering) - 2026 Summer (BS/MS)

benchmark tools and performance optimization of AI workloads specifically tailored for large-scale LLM training and inference... Hardware Acceleration (e.g., GPU/TPU/RDMA) or ML for Systems, and Distributed Storage. - Experience in AI model development...

Apply Now

Company: TikTok

Location: San Jose, CA

Posted Date: 02 Dec 2025

Senior Fellow, ML Optimization

and latest trend in inference and training optimization. Hand-on experience in mapping model architecture to low level software... model architecture, especially SoTA models, distributed inference and deployment at scale is crucial. KEY RESPONSIBILITIES...

Apply Now

Company: Advanced Micro Devices

Location: San Jose, CA

Posted Date: 14 Jan 2026

Senior Director, ML Workload Performance

performance and optimization team across various frameworks and model architectures. This is a highly visible role with large... trend in inference and training. Experience in mapping model architecture to low level software, hardware and understanding...

Apply Now

Company: Advanced Micro Devices

Location: San Jose, CA

Posted Date: 07 Dec 2025

Find your dream job now!

Keywords: Large Model Inference Acceleration Engineer, Location: San Jose, CA

Page: 1

Large Model Inference Acceleration Engineer

Large Model Training Acceleration Engineer

Sr. Research Engineer/Scientist(all levels), Efficient Models

Software Engineer - Model Serving Infrastructure - USDS

Research Engineer/Scientist(all levels), Efficient Models

Machine Learning Engineer - Trae USDS

Software Engineer - TikTok AI Search Infrastructure

GPU/AI Application System Software Engineer Intern (System Technologies and Engineering) - 2026 Summer (BS/MS)

Senior Fellow, ML Optimization

Senior Director, ML Workload Performance