Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A few months ago I made a (theoretically) infinitely learning geo-guessing model that updated the policy with each user guess: https://geospot.sdan.io/

Hoping to implement a simple RL loop here and optimize whats generated by the LLM to create the perfect slop machine :)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: