It should be benchmarked against something like RULER[1] 1: https://github.com/h... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		smusamashah on Aug 29, 2024 \| parent \| context \| favorite \| on: 100M Token Context Windows It should be benchmarked against something like RULER[1] 1: https://github.com/hsiehjackson/RULER (RULER: What’s the Real Context Size of Your Long-Context Language Models)

ipsum2 on Aug 29, 2024 [–]

> To incorporate this, we ask the model to complete a chain of hashes instead (as recently proposed by RULER):

They did mention it but didn't provide concrete benchmarks

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact