Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
pheeney
11 months ago
|
parent
|
context
|
favorite
| on:
Mistral Small 3
What models would you recommend for basic classification if you don't need a 24B parameter one?
josh-sematic
11 months ago
|
next
[–]
You might find this comparison chart helpful:
https://www.airtrain.ai/blog/how-15-top-llms-perform-on-clas...
Note: from October; also I work at Airtrain
elorant
11 months ago
|
prev
|
next
[–]
I’m using Llama-3 8B to classify html files. It’s surprisingly good, and I run it on an RTX 4060 Ti at 8-bit quantization. No complains so far.
Beretta_Vexee
11 months ago
|
prev
[–]
There's no alternative to testing with your own data. The majority of our data is in French, and our benchmarks differ greatly from public benchmarks generally based on English documents.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: