Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think your example reflects well on oss-20b, not poorly. It (may) show that they've been successful in separating reasoning from knowledge. You don't _want_ your small reasoning model to waste weights memorizing minutiae.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: