Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I've had even better results using the dense 27B model -- less looping and churning on problems


Which dense model are you referring to? The dense model isn’t finetuned for code instruction according to the model card.


https://huggingface.co/Qwen/Qwen3.5-27B

I wasn't aware of that, which page mentions that?


Yeah the page you linked even shows the benchmarks in coding for this model, so I'd be curious where that claim comes from




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: