Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We do enable forcing these sequences of tokens in guidance, and find that it significantly speeds up structured generation. There are tricky alignment issues to make sure you pick the right sequence of tokens, but you can often proxy this well by using the model's native tokenizer. Some details here in an old blog: https://guidance.readthedocs.io/en/latest/example_notebooks/...


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: