Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Fine-tuning is expensive and slow compared to prompt engineering, for making changes to a production system.

You can develop validate and push a new prompt in hours.



You need to include the prompt in every query, which makes it very expensive


The prompt is kv-cached, it's precomputed.


Good point, but it still increases the compute of all subsequent tokens




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: