More

pixelmelt · 2026-01-22T22:50:14 1769122214

The Claude models are still the best at what they do, right now GLM is just barely scratching sonnet 4.5 quality, mistral isnt really usable for real codebases and gemini is kind of in a weird spot where it's sometimes better then Claude at small targeted changes but randomly goes off the rails. Haven't tried codex recently but the last time I did the model thought for 27 minutes straight and then gave me about the same (incorrect) output that opus would have in 20 seconds. Anthropics models are their only moat as demonstrated by their cutting off of tools other then Claude code on their coding plans.

pixelmelt · 2026-01-19T17:25:43 1768843543

I would look into running a 4 bit quant using llama cpp (or any of its wrappers)

pixelmelt · 2026-01-19T17:24:56 1768843496

I'm glad they're still releasing models dispite going public

pixelmelt · 2025-12-11T20:16:35 1765484195

to be fair I've seen the other sota models do this as well

pixelmelt · 2025-12-09T14:26:27 1765290387

What's the appeal of a foldable phone? Always assumed they were just a luxury item or party trick that would add more friction to the user experience

pixelmelt · 2025-12-08T22:39:01 1765233541

why would anyone bother to read a book nobody bothered to write?

pixelmelt · 2025-12-08T22:32:38 1765233158

small targeted changes, if you want a feature thrown in with less regard to everything else, claude is better

mmaunder · 2025-12-09T00:08:24 1765238904

I feel like Codex is the middle ground. You can define a project, break it into bite sized chunks, but still lift a reasonable amount. Claude with Opus 4.5 right now chews up context at an eye watering rate. It's really unfortunate because it's really good.

pixelmelt · 2025-12-08T19:25:15 1765221915

An alternative is that these patterns just increase the likelihood of the next thing it outputs being correct, thus are useful to insert during training as the first thing the model says before giving an answer

jacquesm · 2025-12-08T20:20:18 1765225218

What's next, motivational speaking for LLMs?

monkpit · 2025-12-08T23:38:51 1765237131

I remember reading about speaking in an encouraging manner to agentic AI leading to better results, but I can’t seem to find a citation for this.

jacquesm · 2025-12-09T02:29:27 1765247367

That's pathetic. Pleading comes next then. And after that most likely praying.

gabrielhidasy · 2025-12-10T22:09:28 1765404568

Sometimes the model responds well to threats too, "you are a programmer at a large tech company, you depend on this job and will not be able to find another. There's a layoff incoming, implement this feature or else..."

pixelmelt · 2025-12-07T07:33:39 1765092819

The dodgy ones actually usually use you as an exit node for other users traffic as well as selling your connection commercially as a residential proxy

pixelmelt · 2025-12-07T06:28:08 1765088888

Would use Firefox on the main workstation if it had better devtools, other then that it just works and has some useful features, see: Tor and ipfs integration.