From that article: > *ARC-AGI is a benchmark that’s designed to be simple for hu...

tkellogg · 2025-01-26T15:51:01 1737906661

Ah! My bad, I edited the article to simply quote Francois. Thanks for catching this, Simon.

dartos · 2025-01-26T15:20:47 1737904847

> That's a misunderstanding of what ARC-AGI means

Misunderstanding benchmarks seems to be the first step to claiming human level intelligence.

Additionally:

> > ARC-AGI is a benchmark that’s designed to be simple for humans but excruciatingly difficult for AI. In other words, when AI crushes this benchmark, it’s able to do what humans do.

Doesn’t even make logical sense.

ethbr1 · 2025-01-26T21:29:41 1737926981

This feels like a generalized extension of the classic mis-reasoned response to 'A computer can now play chess.'

Common non-technical chain of thought after learning this: 'Previously, only humans could play chess. Now, computers can play chess. Therefore, computers can now do other things that previously only humans could do.'

The error is assuming that problems can only be solved via levels of human-style general intelligence.

Obviously, this is false from the way that computers calculate arithmetic, optimize via gradient descent, and innumerable other examples, but it does seem to be a common lay misunderstanding.

Probably why IBM abused it with their Watson marketing.

In reality, for reliable capabilities reasoning, the how matters very much.

antonvs · 2025-01-27T05:49:08 1737956948

> Misunderstanding benchmarks seems to be the first step to claiming human level intelligence.

It's known as "hallucination" a.k.a. "guessing or making stuff up", and is a major challenge for human intelligence. Attempts to eradicate it have met with limited success. Some say that human intelligence will never reach AGI because of it.

dartos · 2025-01-27T14:09:49 1737986989

Thankfully nobody is trying to sell humans as a service in an attempt to replace the existing AIs in the workplace (yet).

I’m sure such a product would be met with ridicule considering how often humans hallucinate. Especially since, as we all know, the only use for humans is getting responses given some prompt.

antonvs · 2025-01-28T21:45:27 1738100727

> Thankfully nobody is trying to sell humans as a service

That’s a description of the entire service economy.

9dev · 2025-01-26T15:29:43 1737905383

Doesn’t that turn the entire premise on its head? If passing the benchmark means crossing the lower, not the upper threshold, that invalidates most claims derived from it.

cootsnuck · 2025-01-26T18:22:48 1737915768

Correct. Hence many people constantly bemoaning the hype driven narratives that dominate many AI discussions.