Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Defensible Deep Research from Open-Weight Models (thinkwright.ai)
2 points by oceanwaves 3 days ago | past | discuss
State of the Agent: Do coding agents know what they don't know? (thinkwright.ai)
2 points by oceanwaves 4 months ago | past
Agent-evals: Overlap, boundary, and metacognitive scoring for coding agents (thinkwright.ai)
1 point by oceanwaves 4 months ago | past | 1 comment
Agent-evals: Metacognitive scoring and boundary testing for LLM coding agents (thinkwright.ai)
2 points by oceanwaves 4 months ago | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: