The results for GPT - 4.5 are in for Kagi LLM benchmark too. It does crush our b...

mjirv · 2025-02-28T02:21:20 1740709280

Do you have results for gpt-4? I’d be very interested in seeing the lift here from their last “big one”.

wendyshu · 2025-02-28T02:12:48 1740708768

Why don't you have Grok?

mhh__ · 2025-02-28T03:43:06 1740714186

No api for grok 3 might be why

theodorthe5 · 2025-02-27T21:31:40 1740691900

If Gemini 2 is the top in your benchmark, make sure to re-check your benchmark.

shawabawa3 · 2025-02-27T21:47:39 1740692859

Gemini 2 pro is actually very impressive (maybe not for coding, haven't used it for that)

Flash is pretty garbage but cheap

istjohn · 2025-02-27T22:11:09 1740694269

Gemini 2.0 Pro is quite good.

aoeusnth1 · 2025-02-28T16:09:44 1740758984

Gemini 2 pro is pretty strong actually.