> how it normally takes him 4 to 8 hours to put together complicated, data-heavy...

jfarmer · 2025-07-17T20:21:45 1752783705

From John Dewey's Human Nature and Conduct:

“The fallacy in these versions of the same idea is perhaps the most pervasive of all fallacies in philosophy. So common is it that one questions whether it might not be called the philosophical fallacy. It consists in the supposition that whatever is found true under certain conditions may forthwith be asserted universally or without limits and conditions. Because a thirsty man gets satisfaction in drinking water, bliss consists in being drowned. Because the success of any particular struggle is measured by reaching a point of frictionless action, therefore there is such a thing as an all-inclusive end of effortless smooth activity endlessly maintained.

It is forgotten that success is success of a specific effort, and satisfaction the fulfillment of a specific demand, so that success and satisfaction become meaningless when severed from the wants and struggles whose consummations they arc, or when taken universally.”

slg · 2025-07-17T20:39:28 1752784768

The proper use of these systems is to treat them like an intern or new grad hire. You can give them the work that none of the mid-tier or senior people want to do, thereby speeding up the team. But you will have to review their work thoroughly because there is a good chance they have no idea what they are actually doing. If you give them mission-critical work that demands accuracy or just let them have free rein without keeping an eye on them, there is a good chance you are going to regret it.

psychoslave · 2025-07-18T07:12:10 1752822730

What a awful way to think about internship.

The goal is to help people grow, so they can achieve things they would not have been able to deal with before gaining that additional experience. This might include boring dirty work, yes. But that means they thus prove they can overcome such a struggle, and so more experienced people should be expected to also be able to go though it - if there is no obvious more pleasant way to go.

What you say of interns regarding checks is just as true for any human out there, and the more power they are given, the more relevant it is to be vigilent, no matter their level of experience. Not only humans will make errors, but power games generally are very permeable to corruptible souls.

perrin_veronica · 2025-07-18T10:33:03 1752834783

I agree that it sounds harsh. But I worked for a company that hired interns and this was the way that managers talked about them- as cheap, unreliable labor. I once spoke with an intern hoping that they could help with a real task: using TensorFlow (it was a long time ago) to help analyze our work process history, but the company ended up putting them on menial IT tasks and they checked out mentally.

slg · 2025-07-18T16:04:18 1752854658

>The goal is to help people grow, so they can achieve things they would not have been able to deal with before gaining that additional experience.

You and others seem to be disagreeing with something I never said. This is 100% compatible with what I said. You don't just review and then silently correct an interns work behind their back, the review process is part of the teaching. That doesn't really work with AI, so it wasn't explicitly part of my analogy.

danielmarkbruce · 2025-07-18T21:47:25 1752875245

What an awful way to think about other people, always assuming the very worst version of what they said.

psychoslave · 2025-07-19T12:14:02 1752927242

Certainly such a message demonstrates that a significant amount of efforts have been put to not fall in the kind of behavior it warrants against. ;)

idiotsecant · 2025-07-18T13:20:08 1752844808

The goal of internships in a for profit company is not the personal growth of the intern. This is a nice sentiment but the function of the company is to make money, so an intern with net negative productivity doesn't make sense when goals are quarterly financials.

jazzyjackson · 2025-07-18T14:01:31 1752847291

Sure, companies wouldn't do anything that negatively affects their bottom line, but consider the case that an intern is a net zero - they do some free labor equal to the drag they cause demanding attention of their mentor. Why have an intern in that case? Because long term, expanding the talent pool suppresses wages. Increasing the number of qualified candidates gives power to the employer. The "Learn to Code" campaign along with the litany of code bootcamps is a great example, it poses as personal growth / job training to increase the earning power of individuals, but on the other side of that is an industry that doesn't want to pay its workers 6 figures, so they want to make coding a blue collar job.

But coding didn't become a low wage job, now we're spending GPU credits to make pull requests instead and skipping the labor all together. Anyway I share the parent poster's chagrin at all the comparisons of AI to an intern. If all of your attention is spent correcting the work of a GPU, the next generation of workers will never have mentors giving them attention, starving off the supply of experienced entry level employees. So what happens in 10, 20 years ? I guess anyone who actually knows how to debug computers instead of handing the problem off to an LLM will command extraordinary emergency-fix-it wages.

OtherShrezzing · 2025-07-17T22:00:20 1752789620

I’ve never experienced an intern who was remotely as mediocre and incapable of growth as an LLM.

johnebgd · 2025-07-18T06:24:40 1752819880

I had an intern who didn’t shower. We had to have discussions about body odor in an office. AI/LLM’s are an improvement in that regard. They also do better work than that kid did. At least he had rich parents.

cluckindan · 2025-07-18T12:21:40 1752841300

I had a coworker who only showered once every few days after exercise, and never used soap or shampoo. He had no body odor, which could not be said about all employees, including management.

It’s that John Dewey quote from a parent post all over again.

idiotsecant · 2025-07-18T13:21:50 1752844910

Was he Asian? Seems like somehow asians win the genetic lottery in the stink generation department.

Culonavirus · 2025-07-18T07:45:01 1752824701

Wait, you had Asmongold work for you? Tell us more! xD

pipe2devnull · 2025-07-18T11:08:37 1752836917

I have always been told expect an intern to be a net loss in productivity to you and anything else is a bonus since the point is to help them learn.

Terretta · 2025-07-17T22:06:31 1752789991

What about a coach's ability for improving instruction?

bluefirebrand · 2025-07-17T23:25:58 1752794758

The point of coaching a Junior is so they improve their skills for next time

What would be the point of coaching an LLM? You will just have to coach it again and again

coderatlarge · 2025-07-18T13:15:07 1752844507

coaching a junior doesn’t just improve the junior. It also tends to improve the senior.

bluefirebrand · 2025-07-19T04:03:04 1752897784

Coaching an LLM seems unlikely to improve you meaningfully

loloquwowndueo · 2025-07-17T23:20:48 1752794448

What about it?

SchemaLoad · 2025-07-17T23:21:52 1752794512

Isn't the point of an intern or new grad that you are training them to be useful in the future, acknowledging that for now they are a net drain on resources.

dimitri-vs · 2025-07-17T22:19:37 1752790777

An overly eager intern with short term memory loss, sure.

fumar · 2025-07-17T22:28:51 1752791331

And working with interns requires more work for final output compared do-it-yourself

intended · 2025-07-18T03:01:29 1752807689

For this example - Let’s replace the word “intern” with “initial-stage-experts” or something.

There’s a reason people invest their time with interns.

wltr · 2025-07-18T06:57:30 1752821850

Yeah, most of us are mortal, that’s the reason.

bawana · 2025-07-18T12:08:25 1752840505

But LLMs will not move to another company after you train them. OTOH, interns can replace mid level engineers as they learn the ropes in case their boss departs.

chatmasta · 2025-07-17T20:52:07 1752785527

Yeah, people complaining about accuracy of AI-generated code should be examining their code review procedures. It shouldn’t matter if the code was generated by a senior employee, an intern, or an LLM wielded by either of them. If your review process isn’t catching mistakes, then the review process needs to be fixed.

This is especially true in open source where contributions aren’t limited to employees who passed a hiring screen.

slg · 2025-07-17T21:05:48 1752786348

This is taking what I said further than intended. I'm not saying the standard review process should catch the AI generated mistakes. I'm saying this work is at the level of someone who can and will make plenty of stupid mistakes. It therefore needs to be thoroughly reviewed by the person using before it is even up to the standard of a typical employee's work that the normal review process generally assumes.

lotyrin · 2025-07-17T22:00:06 1752789606

Yep, in the case of open source contributions as an example, the bottleneck isn't contributors producing and proposing patches, it's a maintainer deciding if the proposal has merit, whipping (or asking contributors to whip) patches into shape, making sure it integrates, etc. If contributors use generative AI to increase the load on the bottleneck it is likely to cause a negative net effect.

skydhash · 2025-07-17T22:11:50 1752790310

This very much. Most of the time, it's not a code issue, it's a communication issue. Patches are generally small, it's the whole communication around it until both parties have a common understanding that takes so much time. If the contributor comes with no understanding of his patch, that breaks the whole premise of the conversation.

SequoiaHope · 2025-07-17T22:07:00 1752790020

I can still complain about the added workload of inaccurate code.

chairmansteve · 2025-07-17T22:12:01 1752790321

If 10 times more code is being created, you need 10 times as many code reviewers..

collingreen · 2025-07-17T22:35:44 1752791744

Plus the overhead of coordinating the reviewers as well!

Quarrelsome · 2025-07-17T22:01:12 1752789672

"Corporate says the review process needs to be relaxed because its preventing our AI agents from checking in their code"

ivape · 2025-07-17T20:01:17 1752782477

”The people who YOLO it with prompting cycles until the code passes tests and then submit a PR are causing problems almost as fast as they’re developing new features in non-trivial codebases.”

This might as well be the new definition of “script kiddie”, and it’s the kids that are literally going to be the ones birthed into this lifestyle. The “craft” of programming may not be carried by these coming generations and possibly will need to be rediscovered at some point in the future. The Lost Art of Programming is a book that’s going to need to be written soon.

ronjakoi · 2025-07-18T14:13:26 1752848006

Too bad no one will be able to read it. Better make it a video essay.

financypants · 2025-07-18T18:48:46 1752864526

with subtitles flashing in the center of the screen

wordofx · 2025-07-18T00:46:19 1752799579

So is here to stay. If you’re unable to write good code with it. Doesn’t mean everyone is writing bad code with it.

NortySpock · 2025-07-17T21:18:45 1752787125

Oh come on, people have been writing code with bad, incomplete, flaky, or absent tests since automated testing was invented (possibly before).

It's having a good, useful and reliable test suite that separates the sheep from the goats.*

Would you rather play whack-a-mole with regressions and Heisenbugs, or ship features?

* (Or you use some absurdly good programing language that is hard to get into knots with. I've been liking Elixir. Gleam looks even better...)

bo1024 · 2025-07-17T21:43:12 1752788592

It sounds like you’re saying that good tests are enough to ensure good code even when programmers are unskilled and just rewrite until they pass the tests. I’m very skeptical.

freeone3000 · 2025-07-17T22:05:48 1752789948

It may not be a provable take, but it’s also not absurd. This is the concept behind modern TDD (as seen in frameworks like cucumber):

Someone with product knowledge writes the tests in a DSL

Someone skilled writes the verbs to make the DSL function correctly

And from there, any amount of skill is irrelevant: either the tests pass, or they fail. One could hook up a markov chain to a javascript sourcebook and eventually get working code out.

collingreen · 2025-07-17T22:38:59 1752791939

> One could hook up a markov chain to a javascript sourcebook and eventually get working code out.

Can they? Either the dsl is so detailed and specific as to be just code with extra steps or there is a lot of ground not covered by the test cases with landmines that a million monkeys with typewriters could unwittingly step on.

The bugs that exist while the tests pass are often the most brutal - first to find and understand and secondly when they occasionally reveal that a fundamental assumption was wrong.

candiddevmike · 2025-07-18T00:22:45 1752798165

Tests are just for the bugs you already know about

thunky · 2025-07-18T01:19:10 1752801550

They're also there to prevent future bugs.

lobochrome · 2025-07-17T22:49:40 1752792580

“The quip about 98% correct should be a red flag for anyone familiar with spreadsheets”

I disagree. Receiving a spreadsheet from a junior means I need to check it. If this gives me infinite additional juniors I’m good.

It’s this popular pattern of HN comments - expect AI to behave deterministically correct - while the whole world operates on stochastically correct all the time…

enneff · 2025-07-17T22:56:22 1752792982

In my experience the value of junior contributors is that they will one day become senior contributors. Their work as juniors tends to require so much oversight and coaching from seniors that they are a net negative on forward progress in the short term, but the payoff is huge in the long term.

mat_b · 2025-07-18T02:26:04 1752805564

I don't see how this can be true when no one stays at a single job long enough for this to play out. You would simply be training junior employees to become senior employees for someone else.

wyclif · 2025-07-18T03:15:57 1752808557

So this has been a problem in the tech market for a while now. Nobody wants to hire juniors for tech because even at FAANGs the average career trajectory is what, 2-3 years? There's no incentive for companies to spend the time, money, and productivity hit to train juniors properly. When the current cohort ages out, a serious problem is going to occur, and it won't be pretty.

nathan_douglas · 2025-07-18T16:22:45 1752855765

It seems there's a distinct lack of enthusiasm for hiring people who've exceeded that 2-3 year tenure at any given place, too. Maintaining a codebase through its lifecycle seems often to be seen as a sign of complacency.

bluefirebrand · 2025-07-17T23:29:13 1752794953

Exactly this

And it should go without saying that LLMs do not have the same investment/value tradeoff. Whether or not they contribute like a senior or junior seems entirely up to luck

Prompt skill is flaky and unreliable to ensure good output from LLMs

intended · 2025-07-18T03:11:08 1752808268

When my life was spreadsheets, we were expected to get to the point of being 99.99% right.

You went from “do it again” to “go check the newbies work”.

To get to that stage your degree of proficiency would be “can make out which font is wrong at a glance.”

You wouldn’t be looking at the sheet, you would be running the model in your head.

That stopped being a stochastic function, with the error rate dropping significantly - to the point that making a mistake had consequences tacked on to it.

dustingetz · 2025-07-17T23:34:13 1752795253

98% sure each commit doesn’t corrupt the database, regress a customer feature, open a security vulnerability. 50 commits later … (which is like, one day for an agentic workflow)

koolba · 2025-07-18T00:07:40 1752797260

It’s only a 64% chance of corruption after 50 such commits at a 98% success.

arthurcolle · 2025-07-18T03:06:24 1752807984

I would be embarrassed to be at OpenAI releasing this and pretending the last 9 months haven't happened... waxing poetically about "age of agents" - absolutely cringe and pathetic

xeonmc · 2025-07-18T00:29:44 1752798584

Or as I would like to put it, LLM outputs are essentially the Library of Babel. Yes, it contains all of the correct answers, but might as well be entirely useless.

kjkjadksj · 2025-07-19T17:43:07 1752946987

> A great use of AI in this situation would be to automate the collection and checking of data. Search all of the data sources and aggregate links to them in an easy place. Use AI to search the data sources again and compare against the spreadsheet, flagging any numbers that appear to disagree.

Why would you need ai for that though? Pull your sources. Run a diff. Straight to the known truth without the chatgpt subscription. In fact by that point you don’t even need the diff if you pulled from the sources. Just drop into the spreadsheet at that point.

casey2 · 2025-07-18T17:45:59 1752860759

In reality most people will just scan for something that is obviously wrong, check that, and call the rest "good enough". Government data is probably going to get updated later anyhow. It's just a target for a company to aim for. For many companies the cost savings is much more than having a slightly larger margin of error on some projections. For other companies they will just have to accept the several hours of saved time rather than the full day.