from a real user perspective to create natural, high quality data that reflects how people actually use AI in practice. Review..., compare, and rank responses generated by large language models, focusing on usefulness, clarity, and reasoning quality...