Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Good point. To keep the regression tests reliable as the app evolves, we run a reliability cascade. First, we generate and execute deterministic Playwright from the codebase. If execution fails then we fall back to DOM and aria tree. If that still fails, we fall back to vision agents that verify what the user actually sees before flagging a drift in the application behavior


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: