Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
MathArena: Evaluating LLMs on uncontaminated math questions (matharena.ai)
2 points by GaggiX 6 days ago | past | discuss
New open source model achieves same score as GPT 5.2 High on AIME2026 I (matharena.ai)
3 points by mh3467 9 days ago | past | 2 comments
MathArena Apex: Unconquered Final-Answer Problems (matharena.ai)
2 points by frozenseven 4 months ago | past
Evaluating publicly available LLMs on IMO 2025 (matharena.ai)
79 points by hardmaru 7 months ago | past | 89 comments
Not Even Bronze: Evaluating LLMs on 2025 International Math Olympiad (matharena.ai)
3 points by amichail 7 months ago | past | 2 comments
IMO 2025 LLM results are in (matharena.ai)
5 points by arberavdullahu 7 months ago | past | 1 comment
Not Even Bronze? Evaluating LLMs on 2025 International Math Olympiad (matharena.ai)
1 point by EvgeniyZh 7 months ago | past | 1 comment
Gemini 2.5 gets 24.4% on MathArena USAMO beating previous top score of 4.7% (matharena.ai)
54 points by alphabetting 10 months ago | past | 10 comments
OpenAI o3-mini scores 78% on yesterday's AIME 2025 math competition (matharena.ai)
3 points by bmislav on Feb 7, 2025 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: