Part of my code makes Copilot crash

Part of my code makes Copilot crash(github.com)

282 points by Tree1993 3 years ago | 357 comments

fny 3 years ago |

So I've just tested it, and I can confirm, yes, copilot refuses to give suggestions related to gender. Now I know a lot of people are calling this absurd, but looking more closely, there are two PR nightmare scenarios.

1. Copilot makes a suggestion that implies gender is binary, a certain community explodes with anger and an entire news hype cycle starts about how Microsoft is enforcing views on gender with code.

2. Copilot makes a suggestion that implies gender is nonbinary, a certain community explodes with anger and an entire news hype cycle starts...

You can't win... so why not plea the fifth?

To all those claiming this is an example of "wokeism", remember the proper response from an individual who believes in nonbinary gender would be to offer suggestions of the sort. There is no advocacy here. Mums the word.

onionisafruit 3 years ago | |

Those aren’t the only options. You can just let it suggest what it is going to suggest. Copilot is a product for adults who should be able to comprehend what machine learning is. Anybody who throws a fit about it will only be exposing themselves as a fool.

eru 3 years ago | | |

I might even share your idea about adults _should_ behave. But that doesn't invalidate fny's musings based on how _adults_ do behave.

alasdair_ 3 years ago | | |

The problem is that if you train an ML model with a bunch of data that happened to be available in the past, then the system will perpetuate the same biases as were inherent in the training data. This leads to the (real issue) Google image classifier categorizing an image of a black man as a "gorilla" etc.

Certain words are heavily loaded and are worth just skipping to avoid all the hassle for now.

lifthrasiir 3 years ago | | |

> Copilot is a product for adults [...]

If you didn't meant "should be" (for which I'm not willing to take any position), no, Copilot is not a product for adults [1] [2].

[1] https://docs.github.com/en/site-policy/github-terms/github-t... "A User must be at least 13 years of age."

[2] https://docs.github.com/en/site-policy/github-terms/github-t...

smsm42 3 years ago | | |

That implies corporations are ruled by adults that aren't confusing twitter with the real world and aren't afraid to tell the screeching activists to leave them alone. Nothing we've seen in the latest decade suggests it is even close to being the case.

jhanschoo 3 years ago | | |

Not every country where Microsoft is doing business in has the same mores as the western world.

benhurmarcel 3 years ago | | |

https://www.cbsnews.com/news/microsoft-shuts-down-ai-chatbot...

classified 3 years ago | | |

Most humans are fools. And you'll get a lot of flak if they think you stepped on their toes.

kodah 3 years ago | |

Agreed. The answer is approved by Dave Cheney, he works at GitHub, and if you've ever attended one of his talks it's plain to see he's a very scrupulous person. I also don't think this is an example of Microsoft taking a side; rather I read it as them refusing to bat, which seems fine.

What I would've preferred one of these threads to be about is how all of this works. Like, how do they post-hoc filter certain things? Is that the only way to deal with things defined as issues in ML?

duskwuff 3 years ago | | |

Making Copilot stop in its tracks when it sees the word "gender" and refuse to continue until the word is removed is still making a statement. Refusing to bat would be treating "gender" as a meaningless token, just as if you'd typed "traqre" instead.

zorpner 3 years ago | | |

> I also don't think this is an example of Microsoft taking a side; rather I read it as them refusing to bat, which seems fine.

You can't be neutral on a moving train, as they say.

captainmuon 3 years ago | |

I don't get the whole discussion. There are just many different models of gender. Its like particles vs waves. In one model, there are only two genders, in another five. There are those who say gender is culture and sex is real, and those who say sex is constructed, too. Some models describe reality better than others, some are useful, some are harmful. But nobody can or should stop you from thinking about reality with the model of your choice.

If I were Microsoft, I would post a shrugie and say copilot offers arbitrary responses based on the actual code it reads; it is not supposed to be "correct" or good or fair, but just follow what it sees other people do.

xupybd 3 years ago | | |

>>Nobody can or should stop you from thinking about reality with the model of your choice.

While I agree with you, that is very much the game that is being played here. We have competing world views and one way to help a world view dominate is to play a linguistic war. That was the point of Newspeak in 1984 (https://en.wikipedia.org/wiki/Newspeak). If you control the language such that competing ideas are instantly taboo just by the words required to describe them you can stop people from promulgating those ideas. So you gain ground without ever having to debate the new ideas.

This has happened in many countries when one religion dominated. Western society was starting to get to the point where it taboos were being shed and ideas could win based on their merit. Sadly we're regressing back to a society controlled by dogma rather than an open exchange of ideas. I suspect this is the normal state of human societies, we fluctuate between open and closed societies.

pyrale 3 years ago | | |

> If I were Microsoft, I would post a shrugie and say copilot offers arbitrary responses based on the actual code it reads; it is not supposed to be "correct" or good or fair, but just follow what it sees other people do.

The last time Microsoft did that, they ended up with their bot posting racist content on twitter. They of all people understand that just following what people do on the internet is a recipe for disaster.

q-big 3 years ago | | |

> Some models describe reality better than others, some are useful, some are harmful.

The idea of science is to get rid of models that are wrong.

bergenty 3 years ago | | |

It’s really not some complicated multiverse of possibilities. It’s biological, very factual and the underlying genetic basis is as objective as something can get.

philipswood 3 years ago | |

Choosing door 3 unfortunately leads to ...

A certain community explodes with anger since their machine learning dev-tooling is closed and has arbitrary restrictions.

If you try to please everybody, someone won't like it.

woojoo666 3 years ago | | |

Unfortunately the people that care (like HN people) are less likely to spend time organizing protests and riling up an internet mob

bryanrasmussen 3 years ago | |

I'm going to have to say it is ridiculous because there are all sorts of things that cause problems that the copilot generated code is going to have to keep out following this reasoning -

let's not handle ethnicity, if we're going to be sensitive about gender that is an area which is also sensitive for many people.

should it take border disputes etc. into consideration, if you're using it in country X and country X thinks a particular area belongs to them despite most of the world disagreeing will you not be able to use copilot to generate code that supports your remote employers international operations?

it would make better sense if Copilot had warnings it could issue and when you wanted gender put up some sort of warning about that - or allow you to choose binary gender / multi gender solutions.

The idea that it should fail, and that makes sense for it to do so is essentially a critique of the whole code generation idea.

on edit: obviously HN should be able to come up with lots of other things that might cause media related problems if CoPilot handled it, code to detect slurs, etc. etc.

nonethewiser 3 years ago | |

The nightmare scenario is caving to either mob. There is no good reason to moderate this.

coffeeblack 3 years ago | |

It’s just following the old advice not to talk about religion.

wseqyrku 3 years ago | |

This is similar to the stupid branch rename saga. It is certainly pointless, but not doing it could be disastrous.

hjkl0 3 years ago | |

> Copilot makes a suggestion that implies gender is binary

How would that work though? What can Copilot suggest that can imply that?

  If gender is true 
     Do something…
  Else if gender is not true
     Do something else
  Else
      Do nothing

xupybd 3 years ago | |

There is a safe version of gender. Grammatical gender is, for now, binary and as far as I'm aware not offensive to most.

But I agree you can't avoid offending people. The world is nuts everything is offensive to someone.

texaslonghorn5 3 years ago | | |

Grammatical gender is not as simple/uniform as you state https://en.m.wikipedia.org/wiki/List_of_languages_by_type_of...

q-big 3 years ago | |

Solution: let the user choose their political stance on such a polarized topic in the Copilot settings so that the user gets suggestions that fit his stance.

poulpy123 3 years ago | |

The solution is conceptually simple (no idea of practicality): propose an answer related to the context.

And also: give the list of banned words

asojfdowgh 3 years ago | |

its only a PR nightmare because its a closed service and not an open tool

TeeMassive 3 years ago | |

Pick 95% of your users, not a hard choice.

aaomidi 3 years ago | | |

They have. 95% don’t give a shit tbh :)

gloosx 3 years ago | |

It's a total nonsense, how can someone be angry at a soulless machine? Is it a real thing to face anger towards an AI like it was a real human? It's a serious mental problem then, cause the anger is actually directed inward in this case

TheDong 3 years ago | | |

The anger is clearly not at the "soulless machine", but at the people and corporation that built, trained, and tend to it. The parent comment did not say "the community explodes with anger [at copilot]", they just said "with anger".

You have made up a total strawman. It is like if someone said "If that person were stabbed with a knife, they would be angry", and you responded "Do people really get angry at emotionless knives? That's a mental problem, their anger is directed inward".

moyix 3 years ago |

Yep, I noticed this last year when they still stored the list client-side and had great fun reverse engineering it:

https://twitter.com/moyix/status/1433254293352730628

avian 3 years ago | |

They fixed Copilot returning verbatim snippet of Quake source code by just blacklisting a word! How can they still pretend Copilot is not just copyright washing other people's code?

https://twitter.com/moyix/status/1433261377125326851

throwaway290 3 years ago | |

Interesting, so it might not be the specific token "gender" but rather blocked words ("man" or "woman") that appear in suggestions will suppress Copilot. And presumably another token that like "communist" might do the same...

tgsovlerkhgsel 3 years ago | | |

The list (https://moyix.net/~moyix/copilot_slurs_rot13.txt, rot13 encoded and linked from this tweet https://twitter.com/moyix/status/1433479083376140296) indeed contains the word "gender".

moyix 3 years ago | | |

It suppresses output for bad words in both the prompt (your code) and the suggestions.

LAC-Tech 3 years ago |

Aren't we missing the forest for the trees here?

We're zeroing in on how silly is it for copilot to trigger its content filter on the word "gender".

To me the real issue is that copilot has a content filter in the first place. It's unwelcome and unnecessary.

eric4smith 3 years ago |

Besides the absurdity of the code crashing because of the word "gender". My problem and curiosity with all of this is...

"What was going on in the head of the person writing the parser?"

I mean, were they thinking that if someone is writing code, let's say, for a gender dropdown and it was only ["male", "female"], it would try to suggest to us to add 26 more genders instead (and worse, suggest a list of genders to add)?

Would the intention be to correct us and popup a message saying "We suggest you add more genders so as not to displease the users of your product"??

What was going on in that person's head who is trying to do all of this? What was their thought process? What were they trying to accomplish around gender?

Was it the programmer, or some product manager that insisted on some kind of "copilot adjustment" for this because of a personal political viewpoint or just for GitHub being more woke?

That's the most troubling aspect to this.

I hope to Jesus Christ it was just a mistake.

jcuenod 3 years ago |

I encountered this some time ago because I was working with grammatical gender. Unlike many of these comments, though, I do not take exception to it. Bias in ML is well established, and it's okay if, when we don't have solutions, we just disable it.

If your autocomplete was capable of spitting out suggestions that made you feel isolated or kept poking you in the eye about aspects of your identity, you might feel a bit better about the creators having thought about that and taken steps to avoid it happening.

Banana699 3 years ago | |

"Reducing Bias" is a really strange way to put it, considering that bias usually means delibeaterly ignoring or contradicting aspects of reality/data (the classic example in ML textbooks is fitting a straight line to non-linear data), which is what Copilot is quite literally doing here.

Gender is, in actual material fact, binary, and extremely strongly correlated with sex. Building a crimestop into an ML model is just teaching the machine human biases and delusions.

nomilk 3 years ago |

> Copilot crash because the word “gender”

A metaphor for our times.

tom_ 3 years ago | |

I worked on a video game in the late 2000s, and one of the bits of code I did was the code for filling the seats in the stadium with people. One of the artists cobbled together like 5 low poly man models and 5 low poly woman models, and you could just about tell the difference, and I put some code in there to ensure the genders were evenly distributed. (The 2 genders, I mean. Man, and woman.)

Looking back, I don't even know why I made it an enum, rather than a 1-bit bitfield called is_woman - but in the end I was glad I didn't, because the art director moaned a bit about the clothing colour distribution, and somebody asked if we could have some mascots, and there were some complaints about the unreasonable number of interesting hats. And, so, long story short, by the time we were done, we had 18 genders based on clothing colour and type of hat, 2 genders for mascot (naturally: tall, and squat), and a table to control the relative distributions.

Once we got to 5 genders I tried to change the enum name to Type - but we had this data-driven reflection system that integrated with various parts of the art pipeline, and once your enum had a name, that was pretty much that. You were stuck with it.

Is that a metaphor for our times too? I don't know. My own view is that sometimes stuff just happens, and you can't read too much into it.

MengerSponge 3 years ago | | |

Only 18? Child's play. https://www.discovermagazine.com/planet-earth/why-this-fungu...

Interestingly, I don't know of any zoological cases that would require more than a short int to enumerate.

erik_seaberg 3 years ago | | |

Somehow I’m reminded of the Fallout 3 NPC walking underground wearing a train-shaped hat.

onionisafruit 3 years ago | | |

I would love to think msft blocks gender because your code somehow made it into the training data and somebody was confused seeing “squat” as a gender.

magicalist 3 years ago | |

>> Copilot crash because the word “gender”

> A metaphor for our times.

Social media amplifies an innocuous, extremely low stakes occurrence into a heated discussion because it happened to misstate the facts (nothing is crashing here) and focus on a hot button keyword ("gender" is only one of many blocked words)?

joe_the_user 3 years ago |

So large language model are great on but have undesirable result occasionally. Hand coded scripts are added to remove the undesirable outcomes but still produce other problems - crashed but less often.

More and more things are going to be filtered through large language model apps and the possibilities for cascading failures will be even more interesting than what exists presently.

muglug 3 years ago | |

The large language models already know too much.

I was able to get GPT-3 to spit out reasonably accurate biographies for a couple of composers I know.

GPT-3 could go even further — one of my composer friends has a reasonably rare first name, and when given the prompt "There once was a man named $first_name", GPT-3 responded with a number of limericks tailored to his particular set of skills.

filoeleven 3 years ago | | |

  There once was a man named $first_name,
  Who never accepted the blame.
    He went on a bender,
    And talked about gender
  [INFO] [default] [2022-07-10T07:59:07.641Z] [fetchCompletions] engine https://copilot-proxy.githubusercontent.com/v1/engines/copilot-codex

nonethewiser 3 years ago | |

That simply restates what people are taking issue with.

jan_Inkepa 3 years ago |

I encountered this when writing some scripts for Latin-language text processing (which dealt with grammatical gender). Thankfully the Latin-native term 'genus' passed the Copilot smell-test and I could continue with my work. I found it pretty amusing.

duskwuff 3 years ago | |

As a result of another word on the Naughty List, you may run into similar issues while writing multithreaded code.

(The word in question is "race" -- as seen in the phrase "race condition".)

jcuenod 3 years ago | |

Yup, for me it was Greek and Hebrew.

TheSpiciestDev 3 years ago |

What was that bot that MSFT stood up on Twitter that trolls and memers fed to turn alt-right? I know they eventually took it down and that it stirred up a lot of controversy.

I would not be surprised if someone found some Copilot output stemming from "gender" and reported to MSFT/GitHub for them to simply short circuit or "break" after finding certain keywords.

Thorentis 3 years ago | |

Yeah they probably found something like: assert gender in ["male", "female"]. If this is enough to trigger a backlash then maybe we deserve whatever fate has in store for us.

nonethewiser 3 years ago | | |

But "we" and "the backlashers" are not one group.

npteljes 3 years ago | |

This was it:

https://en.wikipedia.org/wiki/Tay_(bot)

nevster 3 years ago | |

Tay AI

hda2 3 years ago |

Yesterday's timely announcement about an open source competitor to copilot that doesn't suffer from this absurdity: https://news.ycombinator.com/item?id=32327711

staticassertion 3 years ago |

Content filters on ML feel so silly. I assume the goal is to avoid bad press? Because the... "attack" would be someone generating offensive material, which they could just write themselves, not to mention I have serious doubts that any filter is going to be a serious barrier.

For images/ video I can see merit, ex: using that nudity inference project on images of children, but text seems particularly pointless.

Thorentis 3 years ago |

What is Github worried about? That Copilot might suggest some code that checks for a "gender" variable being only one of two values? Utterly absurd that we've now reached this point. I already had plenty of reasons to boycott Copilot, now I have another one.

mcphage 3 years ago | |

> What is Github worried about? That Copilot might suggest some code that checks for a "gender" variable being only one of two values?

Perhaps Github is worried about a backlash if it suggests code that allows for more than 2 values.

jfoster 3 years ago | | |

The backlash they ought to be worried about is the one from their customers when it refuses to operate due to an ongoing battle between opposing groups of extremists.

creato 3 years ago | | |

Or the backlash if it suggests code that only allows for two values.

stolen_biscuit 3 years ago |

Can we get a source for that? Because at the moment, it's just a comment made by a person on the internet with nothing backing it up...

scarface74 3 years ago |

I belong to a local Atlanta Slack channel - tech404 - that for the longest had an official bot that would always respond with the waving hand emoji (HN doesn’t support emojis) if you ever said the word “guys”. Even in private channels.

LAC-Tech 3 years ago | |

The funniest one of these was the python IRC channel, which had (has?) a policy of not allowing the word "lol".

I'm pretty sure a bot would swoop in and say something like "NO LOL" which ironically only encourage more LOL.

int_19h 3 years ago | |

Are there some specific Unicode ranges that HN filters out? I recall being able to use other alphabets and various special symbols with no issue.

leetrout 3 years ago |

This is in the FAQ:

Does GitHub Copilot produce offensive outputs?

GitHub Copilot includes filters to block offensive language in the prompts and to avoid synthesizing suggestions in sensitive contexts. We continue to work on improving the filter system to more intelligently detect and remove offensive outputs. However, due to the novel space of code safety, GitHub Copilot may sometimes produce undesired output. If you see offensive outputs, please report them directly to copilot-safety@github.com so that we can improve our safeguards. GitHub takes this challenge very seriously and we are committed to addressing it.

djbusby 3 years ago |

This thread needs a call to Rule 14: do not feed trolls.

The bugs apparent trigger word is close to hot-button poli-sci issue. Can we please focus on the Technology.

CoastalCoder 3 years ago | |

> The bugs apparent trigger word is close to hot-button poli-sci issue. Can we please focus on the Technology.

I totally agree that this story has a high risk of flamewars.

But it definitely has heavy Technology component, too.

nonethewiser 3 years ago | |

Not sure what you mean. The tech is caving to politics. People dont like it.

btbuildem 3 years ago |

That's silly. So can I put "gender" as the first line in my code to stop copilot from ingesting it altogether?

Are there any other break-words? Master, slave, Carlin's seven words, etc?

tgsovlerkhgsel 3 years ago | |

An earlier version of the list that someone found (see https://news.ycombinator.com/context?id=32339001 for links) does contain "gender", "slavery" and "master race" but not "master" and "slave" itself, ironically.

nonethewiser 3 years ago | | |

Ironically, ignoring the actual usages of "master race" only cements its negative meaning. 95% of its modern usage is to claim PC elitism. It could be neutered if we let it.

rgoulter 3 years ago | |

> So can I put "gender" as the first line in my code to stop copilot from ingesting it altogether

This means one solution for those worried about copilot laundering around code licenses is to put a statement like "for more details check the man page" at the end of each docstring.

akomtu 3 years ago | |

#!gender

neonsunset 3 years ago |

Commenters making bad-faith arguments in this discussion are the reason we can’t have nice things.

the_doctah 3 years ago | |

Kind of like making vague blanket statements with no examples.

nonethewiser 3 years ago | |

Such as?

betwixthewires 3 years ago |

I hope to god that one day we will all see this nonsense for what it is: absurdly hilarious.

ttpphd 3 years ago |

Crash the cistem!

thakoppno 3 years ago | |

gsender

might work here

tpoacher 3 years ago |

Bug as feature. My code from now on will be protected against copilot by looking like this:

  function genderPrintResult (GenderBool)
    if GenderBool: print "Yes"
    else: print "No"

  GenderMyVar = rand(10);
  GenderThreshhold = 5;
  genderPrintResult( GenderMyVar > GenderThreshold)

subjectsigma 3 years ago |

I wouldn't be entirely surprised if something like this was intentional, or that they intentionally filtered the word "gender" and an unintentional side effect was the program crashing.

You literally can't make any statements about gender, no matter how benign, without pissing at least a few of your users off.

nonethewiser 3 years ago | |

The problem is giving a shit about such users.

wseqyrku 3 years ago |

It's baffling how the majority of commenters think this is about fighting discrimination.

davesque 3 years ago |

Has it been somehow confirmed that this was the cause of the issue or was it just that one guy's speculation? I don't see anything that confirmed this as the cause. Am I missing something in the linked content?

Msw242 3 years ago | |

There's a whole bad word list meant to suppress output. It's stored client side.

https://twitter.com/moyix/status/1433254293352730628?t=NIpgb...

davesque 3 years ago | | |

Wow, awesome crypto work in that thread.

tzekid 3 years ago |

Copilot's too useful for me to "boycot" right now, so the only alternative is using slang for the blacklisted words ...

Anyone have any good recommendations for Copilot alternatives?

duxup 3 years ago |

Help me out here, is the answer the official answer?

anothermoron 3 years ago | |

The answer was selected by Dave Cheney from Github https://github.com/davecheney.

You can see it in the original link to the discussion: Answer selected by davecheney

politician 3 years ago |

There’s no reason to be surprised that elements within GitHub have an agenda. They’ve been clear about it since changing support for git’s master branch to main and then gaslighting the portion of community that doesn’t use the terminal about it.

Now I’ve got Gen-Z developers that are confused and upset when `git init` does what it’s always done.

GitHub, Microsoft ownership notwithstanding, was always going to inject its employees’ politics into Copilot.

aaomidi 3 years ago | |

What’s the end goal of the agenda?

slater 3 years ago | | |

A more inclusive verbiage, which is clearly a terrible and slippery-slope thing.

uhtred 3 years ago |

If you told me 10 years ago that gender would be such a hot topic in 2022 I'd have thought you were crazy.

coolspot 3 years ago | |

Everything about 2020-2022 is unreal

nonethewiser 3 years ago | |

Why is it a hot topic? There are a range of opinions. It's a manageable little fire. Thats fine.

Except some people want to punish others for their opinions. That is the gasoline. And Microsoft is selling gas cans.

throwaway290 3 years ago |

Now if only someone could figure out a magic word that would stop Copilot from being trained on my code.

gloosx 3 years ago |

So does it filter out "sex" too?

flippinburgers 3 years ago |

Now try the word "mother".

potatototoo99 3 years ago |

Americans.

Tree1993 3 years ago |

Someone changed the title from Copilot crash because the word “gender” to Part of my code makes Copilot crash

dang 3 years ago | |

I changed it because of HN's rule on titles: "Please use the original title, unless it is misleading or linkbait; don't editorialize."

https://news.ycombinator.com/newsguidelines.html

Tree1993 3 years ago | | |

Sorry, I didn't notice the guidelines. Thank you.

sergiomattei 3 years ago |

I don’t understand, there’s no news here.

It’s a comment from a third party speculating over what causes the crash.

alephxyz 3 years ago | |

Yeah I call BS. The "word filter" answer was selected as the valid answer by a third party (not OP).That's what the OP replied to another comment :

> Heargo 24 days ago > Thanks, I'll try as soon as I get the problem again (somehow it's not bugged anymore...).

Looks like it was just a temporary issue with no evidence that's it's due to a word filter.

moyix 3 years ago | | |

FYI, there is in fact a bad word filter in GitHub Copilot. When it was first released, the list was stored client-side in obfuscated form and I had a lot of fun decoding it:

https://twitter.com/moyix/status/1433254293352730628

The Register wrote about it too: https://www.theregister.com/2021/09/02/github_copilot_banned...

They have since moved the bad word list server-side to prevent people from figuring out what's on it, but it's still there. This is easy to verify, just ask it to complete something that would include a banned word; my favorite here is "Israel", and it will just sit there and refuse to complete, either via inline suggestions or in the sidebar view that gives you 10 choices:

https://i.imgur.com/O97YwKc.png

This was what I managed to decode of the list (in ROT13 form to prevent accidental offense):

https://moyix.net/~moyix/copilot_slurs_rot13.txt

No doubt they've added and removed some things since then.

EddySchauHai 3 years ago | |

It seems pretty reproducible. I can’t use copilot but if anyone can reproduce it here that’d be cool. Anyhow, assuming this is reproducible and they do have filters to stop certain words giving predictions it leads that they’re trying to avoid the racist Twitter AI incident happening to them. I find that pretty funny :)

thakoppno 3 years ago | |

it’s an intriguing guess that is at least plausible and hits a bunch of zeitgeist levers too.

export const CLAIM_PAYLOAD_SCHEMA = Type.Object({ "iss": Type.Literal("my-app"), "exp": Type.Integer(), "sub": Type.String(), "name": Type.String(), "priv": Type.Integer({minimum:0, maximum: Privileges.All}), "gender": Type. // No completion is available.