Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm sure they did but there's no reason to believe they pretrained it on chess anymore than 4 so there's some speculation the post training processes mess things up. Turbo instruct does not go through RLHF for instance.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: