I’ll be interested to see benchmarks. My expectation is that accuracy will take ...

mattcollins · 2025-10-28T09:31:41 1761643901

FWIW, I ran a test comparing LLM accuracy with TOON versus JSON, CSV and a variety of other formats when using them to represent tabular data: https://www.improvingagents.com/blog/is-toon-good-for-table-...

I've only looked at one model (gpt-4.1-nano) so far. I'm hoping to run similar tests on some other models but it gets challenging to discern statistically significant differences with better models as their accuracy tends to be a lot better across the board.

mattcollins · 2025-10-29T11:50:53 1761738653

Results from some further tests here: https://www.improvingagents.com/blog/toon-benchmarks

brian-bk · 2025-10-27T11:18:37 1761563917

There are a very light benchmarks in the Readme, or are you looking for more?

Mumps · 2025-10-27T11:40:00 1761565200

Do you mean the [0] Token Benchmarks section? I only see token count numbers.

Which doesn't address the question: do LLMs understand TOON the same as they would JSON? It's quite likely that this notation is not interpreted the same by most LLM, as they would JSON. So benchmarks on, say, data processing tasks, would be warranted.

[0] https://github.com/johannschopplich/toon?tab=readme-ov-file#...

tujux · 2025-10-27T12:34:28 1761568468

I think they're talking about these sections:

1. Retrieval Accuracy - https://github.com/johannschopplich/toon?tab=readme-ov-file#...

2. Performance by dataset - https://github.com/johannschopplich/toon?tab=readme-ov-file#...

saretup · 2025-10-28T05:03:43 1761627823

I would assume the next iterations/fine-tuned variants of current models would reach similar accuracy for TOON as they do for JSON.

The current models unfortunately do not have TOON in their training set, so they would probably require additional input tokens to grok the notation, and even then probably won’t have the same accuracy as they do for JSON.