Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It’s probably something like deepseek’s native sparse attention with content based granularity. They might not be publishing anything because it’s not such a strong value proposition and doing so would lead to commentary that would tank their investment opportunities.


Or maybe because giving it away would tank their investment opportunities.


There's ways and means. Pushing something out in the sub-30B range would gain them mindshare and they could keep bigger models to themselves. I can't see any indication of what size their model is though.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: