Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Does the benchmark reflect your opinion on 3.7? I've been using 3.7 via Cursor and it's noticeably worse than 3.5. I've heard using the standalone model works fine, didn't get a chance to try it yet though.


personal anecdote - claude code is the best llm devx i've had.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: