Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You cannot use any prior, let alone literally any regularizer, and say it would work almost just as well.

A standard normal prior centered at 0 and one centered at 42 can give very different results.



i said almost - that's code for "obviously i'm not talking about pathological regularizers"


Well, in that case minimizing the (negative) loglikelihood seems principled but you could minimize literally any loss function and it would work almost just as well.


Lol agreed!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: