Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think people tend to just not understand what autoregressive methods are capable of doing generally (i.e., basically anything an alternative method can do), and worse they sort of mentally view it as equivalent to a context length of 1.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: