Find your dream job now!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Multimodal Reinforcement Learning Post-Training Algorithm Expert, Location: Singapore

Page: 1

Multimodal Reinforcement Learning Post-Training Algorithm Expert

and evolution trends of post-training algorithms for multimodal large models (e.g., RLHF, DPO, Curriculum Reinforcement Learning..., with a deep understanding of multimodal large models and the reinforcement learning post-training technology stack Core...

Apply Now

Company: Tencent

Location: Singapore

Posted Date: 29 Nov 2025