Gptcommit: Never write a commit message again (with the help of GPT-3)

Gptcommit: Never write a commit message again (with the help of GPT-3)(zura.wiki)

125 points by zurawiki 3 years ago | 100 comments

ketralnis 3 years ago |

I don't really ever want to read answers from GPT to questions that I didn't knowingly myself ask GPT. If GPT can write a commit message from you, don't write it at all and let me ask it that if that's what I want. It may be a positive to you to spend a few seconds less on commit messages but it's a net negative for the world for it to become polluted with vast amounts of flatly incorrect text with no knowledge as to its provenance. I'd rather have no commit message than one that I can't trust whether it's written by the same human that wrote the code or not.

Put another way, you asking GPT for stuff that it learned from Stack Overflow: good. Using it to post to Stack Overflow: bad.

tomtomtom777 3 years ago | |

For me the point of this demo is that even a good commit message is often redundant information.

As programmers we learn that adding a comment like:

   // The limit is 16

    const SOME_LIMIT = 16

is bad because is redundant information that serves no purpose to the reader and can easily misalign in the future.

So what's a good commit message for changing this limit? Ideally we want to describe why we've changed it but this information isn't always available so even when we're avoiding redundant comments we often use redundant commit messages like "increased SOME_LIMIT" to make browsing through history easier for others.

As we do not need to provide this information (it is already in the code), it seems like a reasonable idea for an AI to help us provide it.

xg15 3 years ago | | |

I don't think the situation is comparable. The comment is redundant because typically you see the commented code right next to it, so reading the code is about as much effort as reading the comments.

In contrast, commit messages often stand alone: If you browse the history, you only see the messages, but now a large number of them; if a commit changes more than one file, the commit message has to sum up the changes from all files.

In all those contexts, a simple, high-level description of what has changed can be enormously helpful.

TeMPOraL 3 years ago | | |

> Ideally we want to describe why we've changed it but this information isn't always available

I struggle to imagine situation in which this is the case. Surely, even in the worst case of you being told to make a particular change with no explanation given, you can at least drop a "increased from 5 at a request of ${name of your boss}", or "increased from 5, see ticket #${ticket number}" in a comment, and/or a commit message.

throwanem 3 years ago | | |

If you don't know why you're making the change, you are not ready to commit the change.

issa 3 years ago | | |

usually the more important information in a comment is WHY the code does what it does.

bee_rider 3 years ago | |

In general I’m pretty skeptical of the ability to get anything deep out of these chat bots, but I think it is wrong to say that the generated commit message is worse than none. The programmer still read the generated message and OK’d it. So, it tells us something about their intent, in the sense that they thought the message summarized it sufficiently (or, they could just OK without reading it, but that’s just lying with extra steps, they weren’t trustworthy in the first place).

nottorp 3 years ago | | |

> The programmer still read the generated message and OK’d it.

You think? For some programmers writing commit messages is like ... i don't know because i'm not one of them... some kind of torture?. I bet the kind of person who likes this service would otherwise put in blank commit messages or at best ticket IDs.

hgsgm 3 years ago | |

GPT-assisted commit messages are fine if the user takes responsibility and gets consequences if they publish bad data, in proportion to the volume of bad data they publish.

dheera 3 years ago | |

Except for startups when commit messages are more like "asdf", "aoeu", "quick fix", or "demo" because some investor barged in and demanded a demo before they would wire funds.

If ChatGPT could change that to something like "disable current limits" or "disable safety checks" or whatever that might be marginally better.

TeMPOraL 3 years ago | | |

A ChatGPT-generated message, pasted without editing, is purely functional transformation of the code, adding zero information. This means I could just as well run it on your diff myself, if I thought it would be useful. More than that, when I do it a year or two after you made your commit, the then-current ChatGPT will likely do a much better job at summarizing the change. So perhaps it's best to leave auto-summarization to (interested) readers, and write a commit message with some actual information instead.

eternityforest 3 years ago | | |

asdf is still better than having lies sprinkled in randomly.

Maybe prefixing them all with gpt: would help

llukas 3 years ago | | |

This indicates commit quality. Why lose this info? If you have only time to put "aoeu" into commit message would you have time to correct ChatGPT output? ;)

javier2 3 years ago | | |

this is just normal every day commit messages in most startups I've seen

teddyh 3 years ago | | |

Haaaaaaaaands

— https://xkcd.com/1296/

pachico 3 years ago |

I find commit messages have more value when they don't just repeat what you can see by looking at a diff but when they explain the reasons behind.

jart 3 years ago | |

The point of a summarization model is if you have a thousand line change, it helps to have a one sentence explanation of what it is. The demo videos the author used here really don't do a good job communicating that, because the summary GPT-3 wrote for his one line commit was longer than the commit itself.

TeMPOraL 3 years ago | | |

Right, and even if GPT-3 could summarize the thousand-line diff in a sensible way, without introducing any falsehoods, it would still be strictly worse than the developer writing a sentence explaining what they think they've accomplished with the commit.

It's just the same thing as with comments and "self-documenting code". The code tells you what (and if written carefully, it may be even somewhat effective at it). It can't tell you why. Neither can a GPT-3 summary of it.

ape4 3 years ago | |

I was hoping GPT-3 was going to give the reasons

waynesonfire 3 years ago | | |

yeah and find the bugs.

smashedtoatoms 3 years ago |

Because what we need is more of the what was done, with no regard to the why. Why provide any context as to why the change was made when you can fill it with an AI description of what one could accurately tell by looking at the code? I kinda can't believe this isn't a joke. Just squash it to the emoji that best captures the sentiment! Why use the tool to enhance you and your peers lives, when you can use AI to make it pointless!

NBJack 3 years ago |

Neat concept, but this opens up a can of worms for corporate security. Pretty sure I won't get approval to submit proprietary code to a third party service just because I was too lazy to write a few lines of text. Might be helpful to open source projects?

xrd 3 years ago | |

Just add fully homomorphic encryption.

I agree with you, but I'm assuming this could just send a diff and that context would be small enough to not leak.

Then again, if GPT can keep track of all the diffs...

TeMPOraL 3 years ago | | |

I don't think the kind of diff you'd want to use GPT-3 to summarize would also be small enough to not leak company IP.

xg15 3 years ago | | |

...and ask OpenAI to reimplement the entirety of ChatGPT to work with homomorphic encryption.

UncleMeat 3 years ago | | |

FHE is slow as shit. Good luck running models at any reasonable pace. Somewhat Homomorphic Encryption is not useful since you've got way too many multiplies on floating point numbers.

polemic 3 years ago |

The very last thing you should do is commit a GPT-3 generated commit message for a fairly simple reason: if GPT-3 can interpret and and explain the change as written, there is no reason to commit that message. You will always be able re-run the generator at any later date, over any range of changes, to get the same or (presumably, in future) improved results.

As pointed out by other comments, the commit message should be telling you facts about the change that are not evident from the change itself. GPT-3 can't tell readers why the change happened.

0x000xca0xfe 3 years ago |

Writing commit messages (or comments in general) is like practicing vocabulary, but for your mental understanding of the current problem.

Taking a step back and thinking about what I have actually done often helps me to find misconceptions, the worst bugs of them all.

Automating this away would be like learning a foreign language by pasting book exercises into a translation app... you may get good grades, but does it help your understanding if you didn't put in the effort yourself?

chrismorgan 3 years ago | |

Yep. More than a few times I’ve finished a piece of work, and in writing the commit message explaining the whys and wherefores, realised my solution was actually flawed, or that a better solution was possible, and so thrown the entire thing away and started again. I love writing commit messages.

jim-jim-jim 3 years ago |

In the early days of Covid, the web was awash with all sorts of stupid fucking designs that reimagined public space under the new normal or whatever. It was chaff that creators and readers alike knew would never be put to practical use, or even be produced in the first place. There's a good writeup about it here.

https://mcmansionhell.com/post/618938984050147328/coronagrif...

I think the same phenomenon is at play here. Everybody sharing their own silly parrot tricks: it's the least interesting topic in the world right now.

Rogach 3 years ago |

I don't want to debate the presence or absence of merits in this tool (these are extensively covered in other comments), but I want to point out that even in the demo examples 2 out of 3 commit messages are plainly incorrect:

- in Demo 1 tool wrote "Switch to colored output..." while in the diff we can see that colored output was already present;

- in Demo 3 tool wrote "Add installation options and demo link to README", while in the actuall diff we only see a link being added, no changes to installation options.

Props to the author for being honest and not cherry-picking the examples.

haney 3 years ago |

This is interesting but I’d hate to work on a project where this was used. Commits should tell me why a change happened not just what code changed.

gkfasdfasdf 3 years ago |

To everyone hating on this...I think a GPT-3 summary of a diff is a great thing to have, because it's a summary of the change and thus can be quicker to grok than picking through a diff. Also this doesn't seem to preclude a developer adding their own text to the commit (the why, etc). Finally, if the summary looks weird/incoherent it could serve as a signal to the developer that the commit needs more work.

funcDropShadow 3 years ago | |

It is not about hate. If there is tool, perhaps GPT-3, that is really good a summarizing code diffs. It should be integrated in your IDE or other tooling to summarize diffs on the fly, when I need that summarization. Not when I commit a diff. Thereby, we could all profit from improvements of that tool over time, and everybody could use it in his or her own language. That is strictly better than running that tool once and integrating it hard with the source code.

SketchySeaBeast 3 years ago |

I can kind of understand getting help writing the description of a large PR. But a commit message? Whose commits are so long so often that they need the help of an AI assistant to come up with the contents?

deathanatos 3 years ago | |

Heh… there are really two types of coders. Those who things commits should have a single, obvious, minimal purpose, and who will split off unrelated changes into separate commits…

… and those who tag you as a reviewer on +8,298, -1,943 commits/PRs with the commit message "JIRA-PROJ-84138".

satvikpendem 3 years ago | | |

> … and those who tag you as a reviewer on +8,298, -1,943 commits/PRs with the commit message "JIRA-PROJ-84138".

At my workplaces, we've told people who do this to break up their larger commit into smaller ones before reviewing. If they haven't done that initially, well, their life is going to get harder for a few days.

TeMPOraL 3 years ago | | |

There is a third type of coder: one doing commits with single, obvious, minimal purpose, that still sometimes end up being +8,298, -1,943 - but with a sensible, detailed message explaining what's being done and why.

This happens in environments where it takes hours for CI to let your change pass, making small commits prohibitively expensive in terms of time and infrastructure.

(And yes, I know the answer is: make it so CI that's part of review takes minutes, not hours.)

wprl 3 years ago | | |

Hey, at least they referenced the (hopefully appropriate) JIRA ticket!

darekkay 3 years ago |

With WhatTheCommit [1], I never have to come up with commit messages again. /s

I even wrote an IntelliJ IDEA plugin 9 years ago [2]. Half as a joke, half to learn about IDEA plugin development. I'm puzzled by seing so many people actually using it. Last month the HTTP link became invalid, and soon after someone opened a PR with a fix. I really hope noone actually uses those commit messages on shared repositories.

[1] https://whatthecommit.com/

[2] https://darekkay.com/blog/what-the-commit-plugin-for-intelli...

yowlingcat 3 years ago |

The worst part about GPT-3 is people using it to automate things where the entire value comes from what the human annotates rather than automates. This is an idea, which like many others involving GPT-3, which I believe will destroy more value than it creates.

zactato 3 years ago |

Did the OP use the tool to write his own commit messages?

A lot of the commit messages were typical and sort of redundant but this one stood out to me https://github.com/zurawiki/gptcommit/commit/82294555e7269e6...

"Add github token to address GH Workflow rate limits"

This is a good commit message, it describes a problem and a solution. I'd be very impressed if the GPTCommit tool wrote this and knew why the github token was being added.

avgcorrection 3 years ago |

There are tools that I wish didn’t exist and this is one of them.

FastEatSlow 3 years ago |

Perhaps this could be more useful if it could be fed information from a bug tracker, so it could use the context to create a meaningful (if inaccurate) commit message.

tjpnz 3 years ago |

If you're unable to write your own commit messages that's a strong signal to me that either your commits are too large or that you're unable to explain in simple words what you just did. While the first can be remedied I would find it hard working with someone who consistently displayed the second.

AnimalMuppet 3 years ago |

1. If automating writing commit messages significantly improves your experience as a developer, you're doing something wrong.

2. If GPT-3 can write commit messages even close to as clear as you can, you're doing something wrong.

warkanlock 3 years ago |

The peak of human society right here

micimize 3 years ago |

Comments here are acting like you can't add/edit the commit. It offers a starting point. Yes it's just-above-diff level, but it is at-least-above-diff level.

But my main though is that IDK about using this for anything closed source. Feed openai's API your codebase, one commit at a time. Even if they promise not to train on your prompt history today, ToS could change. Seems fine if you run it locally though.

xg15 3 years ago |

Fun fact: you can probably turn this around too: Write a fictional commit history and have ChatGPT generate the actual commits for you.

joshe 3 years ago |

This is fun.

Would also be cool to generate commit messages while viewing history, it could really do a good job of orienting you. I'm imagining "human commit msg | gpt commit msg" so you can look at both. It's a little simplistic right now, kinda just describes the diff, but GPT-3.2 could rock.

rawfan 3 years ago |

At least the first line of commit messages shouldn’t describe WHAT changed but WHY the change was made.

nora-puchreiner 3 years ago |

I was wondering if there is a possibility of obtaining an offline version of the service, in order to mitigate the inherent risks associated with transmitting proprietary code to external servers, thus ensuring optimal security and confidentiality of said code?

failuser 3 years ago |

Cool, but I hope his is never used as is, just submit with some keyword and call the latest version of GPT on the diff when looking through the history later. A bad commit message is worse than no message and it can’t be easily fixed.

hooande 3 years ago |

I like writing commit messages. I find it helps me to think through and explain the change that I'm committing. personal quirk: for major commits I'll add fun ascii art, just as a treat

abi 3 years ago |

If you're looking for a Python variant of this tool: https://github.com/abi/autocommit

coding123 3 years ago |

The repo for this isn't eating it's own dogfood.

ilikehurdles 3 years ago |

Might as well commit “I don’t remember writing that commit” because that’s going to be your every answer when someone has a question about what you did.

jupp0r 3 years ago |

This is horrible. Commit messages should contain the reason why this change has been made and not imprecise prose summaries of what the diff looks like.

imiric 3 years ago | |

The comments here are acting as if the messages can't be changed. As someone else mentioned, this should be used as a starting point to summarize the change, but the reason for the change obviously can, and should, be added by a human.

This is far from horrible.

jupp0r 3 years ago | | |

It adds 100% of what I would point out in code reviews to be removed from the commit message. It incentivizes an anti-pattern, so yes it's horrible.

dragonwriter 3 years ago |

Be more impressed if I write the commit message and GPT writes the code than vice versa.

If I wrote the code, writing a commit message is trivial.

boardwaalk 3 years ago | |

You can do approximately that with GitHub Copilot already: Write a comment and have Copilot write the function or what have you to match.

BuckyBeaver 3 years ago |

I'll write a shitload of commit messages before I'll give OpenAI my phone number.

LeicaLatte 3 years ago |

I don’t get the hate. Don’t use it all the time, but this could be useful as part of a danger report.

A readable summary for the ones who may not understand code - your developer will never write that.

tobyhinloopen 3 years ago |

Why are the demos videos?

sigmonsays 3 years ago |

this is awful.

xrd 3 years ago |

Now do this for branching strategies.

This is amazing. Humans should only need to read commit messages, never write them.