Is there a crowd-sourced sentiment score for models? I know all these scores are...

hnfong · 2025-08-06T18:20:03 1754504403

Besides the LM Arena Leaderboard mentioned by a sibling comment, if go to the r/LocalLlama/ subreddit, you can very unscientifically get a rough sentiment of the performance of the models by reading the comments (and maybe even check the upvotes). I think the crowd's knee-jerk reaction is unreliable though, but that's what you asked for.

NitpickLawyer · 2025-08-06T19:36:11 1754508971

Not anymore tho. It used to be the place to vibe-check a model ~1 year ago, but lately it's filled with toxic my team vs. your team, memes about CEOs (wtf) and general poor takes on a lot of things.

For a while it was china vs. world, but lately it's even more divided, with heavy camping on specific models. You can still get some signal, but you have to either ban a lot of accounts, or read new during different tzs so you can get some of that "i'm just here for the tech stack" vibe from posters.

littlestymaar · 2025-08-06T20:26:05 1754511965

Yeah, some people just can't stop acting as if tech companies were sport teams, and it gets annoying fast.

parineum · 2025-08-07T00:46:14 1754527574

I don't really go there much anymore but, when I was, there seemed to be an innordinate amount of Chinese nationalism from young accounts speaking odd English.

nurettin · 2025-08-06T17:59:20 1754503160

This has been around for a while https://lmarena.ai/leaderboard/text/coding

klohto · 2025-08-06T17:59:59 1754503199

openrouter usage stats

esafak · 2025-08-06T18:04:26 1754503466

https://openrouter.ai/rankings

The new qwen3 model is not out yet.

setsewerd · 2025-08-06T18:45:55 1754505955

Since the ranking is based on token usage, wouldn't this ranking be skewed by the fact that small models' APIs are often used for consumer products, especially free ones? Meanwhile reasoning models skew it in the opposite direction, but to what extent I don't know.

It's an interesting proxy, but idk how reliable it'd be.

matznerd · 2025-08-06T19:32:06 1754508726

Also, these small models are meant to be run local so not going to appear on openrouter...