Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Director, Reinforcement Learning (RLHF), Location: USA

Page: 1

Product Director - Generative AI Services

-on experience building or using LLM solutions. Experience in any of Supervised Fine-Tuning (SFT), and Reinforcement learning... Director in the Generative AI Services, you lead innovation through the development of products and features that delight...

Company: JPMorgan Chase
Location: New York City, NY
Posted Date: 13 Dec 2025

Director of Product Management, GenAI

Our Generative AI Data Engine trains the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement...Sr. / Director of Product, GenAI Data Engine Location: San Francisco or New York (Hybrid) Scale At Scale...

Posted Date: 06 Dec 2025

Senior Software Engineer, Billing Platform

Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement... Learning with Human Feedback), human data generation, model evaluation, safety, and alignment. The data we are producing...

Posted Date: 15 Nov 2025

Forward Deployed Engineer, GenAI

Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement... Learning with Human Feedback), human data generation, model evaluation, safety, and alignment. The data we produce...

Posted Date: 04 Nov 2025

Engineering Manager, AgentOps

Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement... Learning with Human Feedback), human data generation, model evaluation, safety, and alignment. The data we are producing...

Posted Date: 02 Nov 2025

Senior Software Engineer, GenAI

Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement... Learning with Human Feedback), human data generation, model evaluation, safety, and alignment. The data we are producing...

Posted Date: 30 Oct 2025

Software Engineer, Gen AI

Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement... Learning with Human Feedback), human data generation, model evaluation, safety, and alignment. The data we are producing...

Posted Date: 30 Oct 2025
Salary: $156000 - 195000 per year

Senior Software Engineer

useful, these models need human eval and reinforcement learning through human feedback (RLHF) during pre-training, fine-tuning... through world-class RLHF, human data generation, model evaluation, safety, and alignment. The data we are producing...

Company: Scale AI
Location: San Francisco, CA
Posted Date: 26 Oct 2025

Engineering Manager, Pay & Incentives

Our Generative AI Data Engine powers the world's most advanced LLMs and generative models through world-class RLHF (Reinforcement... Learning with Human Feedback), human data generation, model evaluation, safety, and alignment. The data we are producing...

Posted Date: 24 Oct 2025

Sr Engineering Program Manager, Evaluation - Special Projects

with reinforcement learning from human feedback (RLHF) and preference optimization. Expertise in statistical analysis, A/B testing... at Director/VP level. Strong ability to navigate ambiguity and lead teams through uncertainty while maintaining program momentum...

Company: Apple
Location: Cupertino, CA
Posted Date: 08 Oct 2025