Does the benchmark reflect your opinion on 3.7? I've been using 3.7 via Cursor a... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		pawelduda 10 months ago \| parent \| context \| favorite \| on: GPT-4.5 Does the benchmark reflect your opinion on 3.7? I've been using 3.7 via Cursor and it's noticeably worse than 3.5. I've heard using the standalone model works fine, didn't get a chance to try it yet though.

jasonjmcghee 10 months ago [–]

personal anecdote - claude code is the best llm devx i've had.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact