behavioral tests and migration stubs for Rust targets. - Design and implement black-box functional test frameworks (pytest + uv..., Python 3.10) for native CLI projects. - Built reproducible Docker evaluation images and CI pipelines for agent evaluation...