with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems... and scoring logic to evaluate agent actions Analyze agent logs, failure modes, and decision paths Work with code repositories...