More

Thev00d00 · 2026-01-12T09:29:47 1768210187

What is happening USA. Please stop.

"It Hurt Itself in Its Confusion!"

Thev00d00 · 2026-01-11T18:53:09 1768157589

I saw a comment in a "I moved from Windows to Linux" thread implying Windows has more configuration potential than Linux. I wonder what that commenter would make of Gentoo.

I wish I had more time I could dedicate to maintaining my system, I'm marooned on Arch due to lack of time, such a shame.

Thev00d00 · 2026-01-06T13:03:22 1767704602

Wait Corey Quinn is on El Reg now? That's awesome

Thev00d00 · 2025-12-26T18:59:19 1766775559

Vibe coding the tool to remove the vibes from text is kind of beautiful really

downboots · 2025-12-27T05:25:45 1766813145

Move fast and break things and duct tape it if the center cannot hold.

Thev00d00 · 2025-12-06T12:29:23 1765024163

ProtonVPN free tier is great

Thev00d00 · 2025-12-03T16:30:48 1764779448

On size limited platforms like steam deck and friends this is a huge W

Thev00d00 · 2025-12-01T14:56:50 1764601010

Remember this not a billion dollar company we are talking about here, this is volunteer OSS project. If no one wants to maintain the X11 support then that's up to them.

headsman771 · 2025-12-01T15:23:19 1764602599

Its more the case that certain people decide X11 will no longer be supported and block anyone from trying to support it in "their" project.

Thev00d00 · 2025-11-25T10:01:30 1764064890

Click the export tab

Thev00d00 · 2025-11-21T17:10:59 1763745059

My issue is that the quality of the macos UI is degrading over time. They can't even get rounding consistent, not quite at windows levels of mismatching yet though.

Also no one bothers making the beautiful native apps now, everything is electron, which is equally inconsistent everywhere.

So I think the advantage over time Vs a Linux system is diminishing... Slowly.

Thev00d00 · 2025-11-18T15:41:56 1763480516

That is pretty impressive.

So impressive it makes you wonder if someone has noticed it being used a benchmark prompt.

burkaman · 2025-11-18T15:48:49 1763480929

Simon says if he gets a suspiciously good result he'll just try a bunch of other absurd animal/vehicle combinations to see if they trained a special case: https://simonwillison.net/2025/Nov/13/training-for-pelicans-...

ddalex · 2025-11-18T16:06:39 1763481999

https://www.svgviewer.dev/s/TVk9pqGE giraffe in a ferrari

jmmcd · 2025-11-18T16:04:21 1763481861

"Pelican on bicycle" is one special case, but the problem (and the interesting point) is that with LLMs, they are always generalising. If a lab focussed specially on pelicans on bicycles, they would as a by-product improve performance on, say, tigers on rollercoasters. This is new and counter-intuitive to most ML/AI people.

BoorishBears · 2025-11-18T19:18:47 1763493527

The gold standard for cheating on a benchmark is SFT and ignoring memorization. That's why the standard for quickly testing for benchmark contamination has always been to switch out specifics of the task.

Like replacing named concepts with nonsense words in reasoning benchmarks.

jmmcd · 2025-11-19T09:06:49 1763543209

Yes. But "the gold standard" just means "the most natural, easy and dumb way".

rixed · 2025-11-18T16:28:11 1763483291

I have tried combinations of hard to draw vehicle and animals (crocodile, frog, pterodactly, riding a hand glider, tricycle, skydiving), and it did a rather good job in every cases (compared to previous tests). Whatever they have done to improve on that point, they did it in a way that generalise.