More

austinshea · 2025-11-10T04:47:06 1762750026

The value of the space is the ability to glean the commit author's thoughts, at the time they committed it.

This is extremely dissimilar to the thoughts of the author.

The value of what this emits is already handled by evaluating the diffs in the per-file history.

It's not good to throw this sort of thing over the fence, and justifying it by considering it to be wasteful of your precious time doesn't change that.

It's better to leave it blank, but a tool like this looks perfect to help someone avoid scrutiny, while simultaneously avoiding providing a tiny depiction of what they were thinking when they committed the change, at the expense of injecting vast amounts of noise.

AmbroseBierce · 2025-11-10T06:01:03 1762754463

I would be more interested in an hybrid approach, an AI that when has low confidence in a generated commit message asks the user for their input: "Was this change meant to fix a bug? y/n" and about splitting commits "you changed 6 html files and 1 SQL file, they seem unrelated" should I split that into 2 separated commits? y/n"

austinshea · 2025-11-14T00:52:11 1763081531

I love it. A bit guided of assistance can really make all the difference.

selfhoster11 · 2025-11-10T06:18:20 1762755500

That actually sounds like a good match for LLMs' ability to do fuzzy intuition and pattern-matching.

fka · 2025-11-10T07:52:00 1762761120

this is a really good idea.

tga · 2025-11-10T09:17:17 1762766237

The value of the space is whatever the user/team finds valuable.

You almost had the right idea there: the value of what this emits is really in the summary of diffs. I'm certainly not going to go through each commit and read the diff each time I look at the log, but I still want to understand what happened and be able to find individual commits. If extra information about the author's thoughts is just not available, I'd much rather have summaries than a blank log of "WIP" comments.

It's absurd to gatekeep commit messages to only "the thoughts of the author", even if that's what usually goes in there. A good diff summary might even be more useful than a ramble that doesn't mention important changes.

austinshea · 2025-11-10T18:42:42 1762800162

It's not absurd, and it's also not gatekeeping.

Why do you want a summary of the changes? The bulk of the information in a commit is the diff itself.

The information that is not contained in the diff is the author's intent, and that's what the space is meant to contain.

I'm not convinced you actually believe that a summary is valuable because this whole comment is coming from a very defensive place.

beefnugs · 2025-11-10T17:33:34 1762796014

But you see now my LLM can use less tokens to parse his commit messages instead of the entire code base to then boil it all down to "85-- minor-- inconsequential-- commits emdash emdash emojji"

derefr · 2025-11-10T21:33:42 1762810422

> The value of what this emits is already handled by evaluating the diffs in the per-file history.

I mean, for the initial development/contribution/PR workflow, I agree with you: any code reviewer should be reading the diffs anyway, and if you're reading the diffs, these messages (being purely summaries derived from the code itself without the LLM having any info about developer intent) don't add anything.

But that's not the only time commit messages matter. A tool "fixing up" bad commit messages before they're pushed to a PR branch like this, might still help with later code maintenance after the code is merged:

• When you or someone else is looking at the commit lines after the fact, in e.g. `git log` to find commits to cherry-pick, such summaries would be a substitute for having to go commit-by-commit reading the diffs to find the one you're looking for. Or when doing e.g. a `git bisect`, they'd allow the likely-offender commit to "jump out" at you from the list of remaining commits, after just the first few bisect steps, without having to do 10 more iterations to narrow it down with actual rebuilds+test suite runs.

• and when someone else is looking at `git blame` while bug-hunting, or seeing the latest commit that touched each file when browsing a github repo tree, having these summaries would be the difference between having an opaque timeline of "fix" -> "fix 2" -> "fix again" -> "update" -> "fix" commits to try to keep distinct in one's head (may as well just try to recognize commits by the abbreviated git ref at that point), vs. having commits with descriptive mnemonic "names".

Note that this tool is supposed to be retroactive, not incremental. It rewrites messages for existing commits, that already had some other message when they were initially committed; it doesn't have any function that you could use to do `EDITOR=this-program git commit` and have it generate a commit's original commit message just-in-time.

As the author says in the README, this tool was created with the goal of fixing one's private git commit history before making it public. At the point when this tool would be run (i.e. at the point a developer is trying to "clean up" their private git history for publication), it's often already been long enough since you created these commits, that you likely don't actually remember what you were thinking at the time you created them. Any information about "what [you] were thinking when [you] committed" has already been lost. "The rice has been cooked", per se.

At that point, there's no value you could add by going back over the commits manually, beyond that which this program could add. In both cases, whether you or the LLM is doing it, the result would just be a reconstruction (i.e. a guess) at what the original developer was thinking/trying to accomplish at the time.

---

Which is not to say that this tool is a substitute for making good commits (instead of random-junk-drawer snapshot-your-work "WIP commits") in the first place. And, in fact, I have a feeling that this tool is not nearly as widely applicable (at least in its current form) as its author thinks it is... because "WIP commits" don't generally have any good way of summarizing them; because "WIP commits" are often not coherent single-purpose edits of the code.

A better version of this tool, I think, would be one that rewrites a private work branch / PR branch by first squashing it into a patch, and then breaks that patch back apart into a series of commits that each "do one thing", essentially introducing the change in a literate programming style where you're meant to read the commit-series top to bottom.

In other words, exactly what an experienced software engineer who knows they'll need to maintain their own code will already do themselves, at commit time, with `git add -p` (and before commit time, by having the discipline to avoid getting distracted solving a second problem while in the middle of solving the first!) And it's also exactly the format that patch-based workflows like LKML already expect contributors to construct [from whatever they were doing internally] when submitting a patch-series, to allow the patch to be read and considered on the mailing list as essentially a linear literate-programming explanation of the changes.

The tool still wouldn't be able to recover the intent of each patch series if it was never added; so the commits would still just be descriptions of the kind a later "software archaeologist" or reverse-engineer would give to the code. But it would at least be generating descriptions for coherent chunks of code, rather than attaching descriptions to commits that were essentially "whatever probably-partial progress on whatever incoherent set of things the programmer was trying to do at the time they needed to sync their work to switch from their laptop to their workstation."

austinshea · 2025-11-14T01:08:54 1763082534

I agree with everything you're saying, and I think other archaeologists feel the same.

My opinionated take is: I wouldn't want to use this space for the information that this tool could provide, and rather leave it as the truth.

The truth is that it was committed without a meaningful message, and now I might recognize a chain of message-less commits, representing a moment in time where the authors were trying to figure out where they wanted to end up.

If the tool is producing this info simply by reading the diffs in the code, why not just use it when you need it, to help explain what you're digging through, instead of changing the commit history?

Either way, the critical detail is: People should get that detail out before the rice has been cooked, and that's what I do for myself, in my own private repos, and when others do that for future archaeologists, we all benefit.

austinshea · 2025-08-21T02:34:06 1755743646

Most work isn’t greenfield.

AWS can be used in a different, cost effective, way.

It can be used as a middle-ground capable of serving the existing business, while building towards a cloud agnostic future.

The good AWS services (s3, ec2, acm, ssm, r53, RDS, metadata, IAM, and E/A/NLBs) are actually good, even if they are a concern in terms of tracking their billing changes.

If you architect with these primitives, you are not beholden to any cloud provider, and can cut over traffic to a non AWS provider as soon as you’re done with your work.

mdaniel · 2025-08-21T15:05:39 1755788739

Of that list, watch out since IAM != IAM != IAM, so "cloud agnostic" is that famous 80/20 split

austinshea · 2025-08-22T05:36:46 1755841006

Some stuff is going to be provider-specific.

Let me explain why we’re not talking about an 80/20 split.

There’s no reason to treat something like a route53 record, or security group rule, in the same way that you treat the creation of IAM Policies/Roles and their associated attachments.

If you create a common interface for your engineers/auditors, using real primitives like the idea of a firewall rule, you’ve made it easy for everyone to avoid learning the idiosyncrasies of each deployment target, and feel empowered to write their own merge requests, or review the intended state of a given deployment target.

If you need to do something provider-specific, make a provider-specific module.

austinshea · 2025-08-21T02:25:37 1755743137

Talk to support: https://support.aws.amazon.com/#/contacts/one-support?formId...

PaulDavisThe1st · 2025-08-21T04:51:11 1755751871

I did. That's just an AI, which says this:

> I understand your situation is a bit unique, where you are unable to log in to your AWS account without an MFA device, but you also can't order an MFA device without being able to log in. This is a scenario that is not directly covered in our standard operating procedures.

The best course of action would be for you to contact AWS Support directly. They will be able to review your specific case and provide guidance on how to obtain an MFA device to regain access to your account. The support team may have alternative options or processes they can walk you through to resolve this issue.

Please submit a support request, and one of our agents will be happy to assist you further. You can access the support request form here: https://console.aws.amazon.com/support/home

That last URL? You need to login to use it ...

austinshea · 2025-08-21T02:25:18 1755743118

Yes, they do.

https://support.aws.amazon.com/#/contacts/one-support?formId...

austinshea · 2025-08-21T01:49:58 1755740998

Thanks for this article. It feels like a huge problem to discuss the nuance described here.

There will be people who will carelessly create unmaintainable software, and they might do it to bamboozle someone who is paying for the work that experienced people can deliver.

Others might fire the sorts of people who would do that sort of work, and try to survive without paying the going rate.

These things are objectively bad.

It is not bad to use these tools, or any others, to do the best work you can provide.

austinshea · on June 29, 2024

This is entirely predicated on the issues this person experienced. Irrespective of whether or not devops teams end up with solutions that look like this, none of them are meant to.

My first experiences had to do with the ability to add new services, monolith or not, and have their infrastructure be created/modified/removed in a environment/region in-specific way, and to be able to safely allow developers to self-service deploy as often as they want, with the expectation that there would be metrics available to observe the roll-out, and safely revert without manual intervention.

If you can't do this stuff, then you can't have a serious posture on financial cost, while also providing redundancy, security, or operating independently of one cloud provider, or one specific region/datacenter. Not without a lot of old school, manual, systems administrator work. DevOps hasn't gone away, it has become the standard.

A bunch of pet servers is not going to pass the appropriate audits.

8organicbits · on June 29, 2024

Agreed, standardization has been hugely helpful.

From the article:

> adopt much more simple and easy-to-troubleshoot workflows like "a bash script that pulls a new container".

This style of thinking let's developers learn every nitty gritty pain of why we have frameworks. I see the same challenge with static site generators, they are all kinda annoying so people write their own, but then theirs is even worse.

austinshea · on March 22, 2024

There were more peripherals to worry about, and fewer generic drivers.

It's not like people are trying to individually purchase PCI sound/ethernet/modem/graphics; they just get these components from the south bridge and they don't think about it at all. This makes things much easier, and gives a clear target for community notes/contributions.

I suspect that the reason he is describing this systems admin work as exhausting is because we've moved on. Personally, I only care about the thing I'm doing with the computer I happen to be sitting at, as opposed to computer itself.

Separately, power management is not a new detail: https://www.lpthe.jussieu.fr/~talon/pentiumII.pdf The Pentium II processor supplies a STPCLK# pin to enable the processor to enter a low power state. When STPCLK# is asserted, the processor puts itself into the Stop-Grant state. The processor continues to snoop bus transactions while in Stop-Grant state. When STPCLK# is deasserted, the processor restarts its internal clock to all units and resumes execution. The assertion of STPCLK# has no effect on the bus clock.

austinshea · on Jan 30, 2024

This is fantastic! Thank you for writing this and sharing it.

austinshea · on June 23, 2023

Okay, I hope so too. The article reads: Rodriguez said: “Facebook knows very well that they have no obligations under the act right now.”

Your original post is full of your ideas, that do not relate to this article.

austinshea · on June 2, 2023

I can agree that there's more to a job than the salary, and it's totally cool to negotiate.

When you make the salary ambiguous in order to make your sales pitch, you are being unreasonable.