Microsoft will assume liability for legal copyright risks of Copilot

Microsoft will assume liability for legal copyright risks of Copilot(blogs.microsoft.com)

540 points by wgx 2 years ago | 377 comments

tremon 2 years ago |

Let Microsoft first publish a Copilot model that's trained on the internal codebases of Azure, Windows and Office. That's the only way Microsoft can convince me that they truly believe Copilot is non-infringing technology.

londons_explore 2 years ago | |

I suspect Microsoft would earn more money by doing this.

Their own engineers would get productivity boosts - with copilot already being familiar with data structures, code style, etc. would be a big boost to accuracy.

But also, third party code would end up being more similar. Code style of the whole world would be pushed towards 'Microsoft style', which probably makes hiring easier, less training time for engineers, etc.

And the downside, that is outsiders might learn tiny nuggets of info about microsoft sources, is probably irrelevant when outsiders can already decompile binaries and learn far more.

chii 2 years ago | | |

> is probably irrelevant when outsiders can already decompile binaries and learn far more.

most, if not all microsoft products can have their sources be available for viewing, if you are one of those vip development partners. microsoft doesn't really have any secret source (pardon the pun) of which the leaking would undo their value proposition.

In fact, if microsft opened up their system a bit more, they might even gain some PR or mindshare, and have no effect on, if not increase, their bottom line.

zargon 2 years ago | | |

It would be surprising to me if their internal engineers don't already have access to a model trained on internal Microsoft code.

dh2022 2 years ago | | |

You are assuming Microsoft code base is superior to Linux / Git / MySql / whatever else is in github right now. That is a .... big assumption.

And if Microsoft's code ends up influencing the rest of the world code that would be a .... big downside.

totallywrong 2 years ago | | |

> Code style of the whole world would be pushed towards 'Microsoft style'

Yes, that's exactly what the world needs, more software like Teams.

eitally 2 years ago | | |

I don't know about MSFT, but I bet this would really help Google a ton. With a mono-repo and huge focus on readability, not to mention how many thousands of SWEs spend the majority of their time slinging protobufs around, it seems a significant fraction of day-to-day code could be largely automated.

dtagames 2 years ago | | |

This is incorrect and not how Copilot works. My company just hosted two MS engineers to explain it live to 175 of us.

The style applied by Copilot comes from your surrounding code context, not from the LLM. And that base, trained on all public repos from GitHub, knows everything about data structures, etc, in the languages that were scanned.

Nothing new would be gained by scanning MS's own repositories and nothing would be leaked or color the output in actual use.

circuit10 2 years ago | |

They're not claiming that it can never spit out code exactly, but that they will take liability for if:

- It does

- The user didn't turn off the filters that prevent this

- The user didn't intentionally make it do it

- This use is found to be illegal

There's a difference between code that needs to be kept private from bad actors (from their point of view at least) and code that is public but with restrictions on its use that anyone who gets it should be aware of. This is like saying "if you truly believe that license agreements are legally binding, then publish your user's passwords publically with a license saying no one can use them"

klyrs 2 years ago | | |

> This use is found to be illegal

This being the real hurdle. With Microsoft money behind the defense, only megacorps can win.

zulban 2 years ago | |

Leaking sensitive data and infringement are separate (tho related) concerns. They may not want to do what you say, even though it's totally infringement safe.

hnlmorg 2 years ago | | |

Are they separate? Or is it the same concern but from opposite view points?

Both worried about IP leaking but one side is worried about their IP leaking and the other worried about liability if they inadvertently implement any leaked IP. Either way, the concern is leaked IP.

chongli 2 years ago | | |

even though it's totally infringement safe

This hasn't been tested in court.

ryukoposting 2 years ago | |

The last thing the world needs is more code written in the style of Win32 API.

samch 2 years ago | |

I believe you’re referring to GitHub Copilot which is a distinct offering in their portfolio (still Microsoft). GitHub Copilot was based on GPT-3 with fine tuning from public code repositories. That is the controversial aspect of it, I believe.

This blog post refers to the broader ecosystem of Microsoft Copilot solutions. Most of those tools rely on the Azure OpenAI API service on the backend and are not specifically tailored for code generation.

zare_st 2 years ago | |

Windows API and the entirety of its client code aren't a good source of standard C programming. On the source level you have additional types and qualifiers/annotations that only MSVC understands.

LLM copilot doesn't really understand the context of the project, it just goes for similar text.

So if you train on big projects you're picking up their patterns only. When a copilot user asks for a string concatenation 'tip' you want LLM to output a general answer, not something tied to a specific project. Big project is likely to use abstraction over strings, where base library usage is shrunk down to few lines of code as opposed to abstraction. In this case you'd want LLM to source a few "simpler" projects that use base library strings abundantly, so it can have decent amount of text for the most likely correct match over user's input.

I do believe Microsoft has all the code available for good training, it's not only about Azure, Windows and Office, there is tons more and it's open source already.

monocasa 2 years ago | |

There's illicit copies of Windows source just up on github. I wonder if we're already in the place where copilot will spit those out if you poke it the right way (but I don't feel like spending $10 to find out).

gareth_untether 2 years ago | |

It would be an ugly beast. But I agree with you that there is a fair approach.

Eliah_Lakhin 2 years ago | |

Interestingly, would the Copilot become better after such training...

contravariant 2 years ago | | |

Negative examples should aid training, right?

onemoresoop 2 years ago | | |

Probably not

nadermx 2 years ago | |

Is there any evidence that it isn't also trained on parts of msfts code base?

j-bos 2 years ago | | |

Is there any evidence it is?

londons_explore 2 years ago | | |

If it is, it should be fairly easy to see.

We can already take a guess what many internal functions look like from the published symbol tables of every function across all major microsoft products. Simply ask copilot to write those functions and see if the code comes out better than a similar set of made up yet plausible function names.

_flux 2 years ago | |

Wouldn't you then end up with code suggestions based on the style guide of a single company and limited set of languages?

It probably would not be a very desirable product in the end.

MikusR 2 years ago | |

Even Microsoft knows that their own code is absolute garbage that would bring the quality of copilot way down.

satvikpendem 2 years ago |

It's likely that generative AI in general will be deemed fair use, due to its (generally) transformative nature. Sure, if you really coax it, you can get code or images out that look similar to existing ones, but the courts might see that generally speaking, it produces new content that has not been seen before, especially in the case of images.

Google Books literally copied and pasted books to add to their online database and that was deemed fair use, so something much more transformative like generative AI will likely fall under much broader consideration for fair use. Google Books was, yes, non-commercial, but the courts generally have the provision that the more transformative something is, the less it needs to adhere to the guidelines laid out for determining such fair use.

https://ogc.harvard.edu/pages/copyright-and-fair-use

StewardMcOy 2 years ago |

Are there any actual details on this? I get that this is a blog post, but the only links I see on the page are to other blog posts. It leaves a lot of questions.

Is this blog post a legally enforceable contract? Is Microsoft specifically indemnifying all users of Copilot against claims of copyright infringement that arise from use of Copilot?

The blog post says that "there are important conditions to this program", and it lists a few, but are those conditions exhaustive, or are there more that the blog post doesn't cover? For example, is it only in specific countries, or does it apply to every legal system worldwide?

What guarantees do users have that Microsoft won't discontinue this program? If Microsoft gets kicked in the teeth repeatedly by courts ruling against them, and they realize that even they can't afford to pay out every time Copilot license-launders large chunks of copyrighted code, what means to users have to keep Microsoft to its promises?

tpmx 2 years ago | |

This is why (so far) it's just PR, not actual legal protection. Brad Smith, being an attorney understands this. Why would he otherwise risk Microsoft (a $2.5T company) with an uncapped liability guarantee?

Gigachad 2 years ago | | |

I think it's likely MS would want to step in and use their lawyers anyway since the result could be hugely impactful for the future of LLMs which they are heavily invested in.

politician 2 years ago | |

> Is this blog post a legally enforceable contract?

It can be. The concept is promissory estoppel.

https://www.nolo.com/dictionary/promissory-estoppel-term.htm...

gpderetta 2 years ago | | |

IANAL, but far as I understand, estoppel is purely a defense when being sued by whomever made the promise.

So it helps if MS sues you when you distribute copilot-generated code that infringes on MS copyrights, but if a third party sues you, you can't claim estoppel to compel MS to help you. You would need a contractual guarantee.

lindenksv1 2 years ago | |

I am a lawyer and tried to find this new language but none of the legal documents I looked at appear to be updated to reflect any of this. Microsoft has a lot of different docs and it's a little confusing but the ones for Copilot are straightforward and none of those have changed any indemnity-related provisions since the spring.

samch 2 years ago | | |

The new terms will be available in early October, I believe.

jtchang 2 years ago |

This is a very clever move by Microsoft. In essence they are painting a giant bullseye on their back to any lawsuits that may arise. The idea being that they have the resources to challenge them (they aren't wrong).

The way AI is going I'm sure we'll see some landmark cases very soon. It is very much in Microsoft's interest to grow this market as fast as possible and be at the center of it. This removes one of the key impediments to adopting generated code for smaller orgs: "Will I get sued if this product generates code that is copyrighted?".

FrustratedMonky 2 years ago | |

Yes. This is it.

They are throwing down the gauntlet and saying "the Vast MS Legal Machine will fight this."

Basically: "Sue me, I dare you, double dare you. or Go Home".

Flexing.

tough 2 years ago | | |

Sosumi from steve jobs fame is a meme I hope to recycle some day if I ever have fuck you money lmao

mnd999 2 years ago | |

They also have money so they’re worth suing.

jacquesm 2 years ago | | |

You wouldn't be suing Microsoft though. Microsoft would come to your aid if you are being sued for copyright infringement. That's a different situation altogether.

So this is an indemnification for damages, not a protection against being sued.

singleshot_ 2 years ago | | |

They also have systemically gigantic amounts of money, so a court may be motivated to create favorable new law for them.

dmix 2 years ago | |

Or Microsoft just sees this as the less bad option. An acceptable tax, handing out some money extraction to white collar folks so the pressure on gov to cripple them doesn't come as fast.

mistrial9 2 years ago | |

prediction: use cloud deployments to fork critical GPL parts, restrict security updates that are required to their fork and implementation; control the rabble for a few years, issue press releases, and stall while they entrench it.

fsdavcaa 2 years ago |

With a big asterik-- "customers... must not attempt to generate infringing materials..."

It hinges on what *Microsoft* decides "attempting to generate infringing materials" means. You'd like it to mean that it only excludes use when you're doing something you know would infringe copyright, like "reproduce the entire half life 2 source code." But who knows.

jacquesm 2 years ago |

It may not be that simple: Microsoft may assume liability but an infringer can still be sued separately. MS may then be on the hook for the court costs. But you can't just categorically shield the users of a product from being sued.

This is the key bit:

"Specifically, if a third party sues a commercial customer for copyright infringement for using Microsoft’s Copilots or the output they generate, we will defend the customer and pay the amount of any adverse judgments or settlements that result from the lawsuit, as long as the customer used the guardrails and content filters we have built into our products."

The 'we will defend' is one important part, I assume that means that you will be using their lawyers rather than your own (which they have in house and so are cheaper to use than the ones that bill you, the would be defendant by the hour).

The second part that matters is that there are conditions on how you are supposed to use the product and crucially: you will have to document that this is how you used it.

But: interesting development, clearly enterprise customers are a bit wary of accidentally engaging in copyright infringement by using the tool and that may well have slowed down adoption.

bb611 2 years ago | |

> I assume that means that you will be using their lawyers rather than your own (which they have in house and so are cheaper to use than the ones that bill you, the would be defendant by the hour).

Litigation is almost universally outsourced, especially for cases where damages might be large, even by companies like Microsoft.

The point is just to lower the resistance to adoption that legal risk causes.

lijok 2 years ago |

Only so long as you have the guardrails enabled. One of the guardrails being that copilot will not output any code that exists in any github repo.

We tested copilot with those guardrails enabled and it completely lobotomizes it.

This by the way is not a change. They already had this “Microsoft will assume liability if you get sued” clause in Copilot Product Specific Terms: https://github.com/customer-terms/github-copilot-product-spe...

whitfieldsdad 2 years ago |

I've received a lot of flak for this answer in other communities, but, if a statistical model is producing purely derivative works using a mathematical model that's basically a next best token predictor, is it really "stealing"?

Is it "stealing" to have a working understanding of the next best token, or even simply the token that shows up the most often (e.g. on GitHub)?

I'm sure that the argument could be made that all AI should be illegal as all ideas worth having have already been had, and all text worth writing has already been written, but, where would that leave us?

(e.g. your function for converting a string from uppercase to lowercase will probably look like a function that someone else on Earth has written, and the same goes for your error handling code, your state of the art technique for centering a div, etc.)

littlestymaar 2 years ago |

I wonder how binding this kind of public commitment is. The same way Musk recently said publicly that he'll cover the cost of anyone having work or legal issues for something they said on the platform (and now refuses honor the engagement).

scj 2 years ago |

If a codebase was infringing the GPL, the remedy is to publish the offending source code or terminate distribution. Neither are cases I suspect Microsoft cares about when talking about 3rd party code.

I don't know what case history is like for damages with open source projects, but I suspect it wouldn't be that big of a concern for Microsoft.

Otherwise stated, Microsoft's downside to this is committing their lawyers. And the upside is to improve their code generation tools.

IANAL though.

lewhoo 2 years ago |

I'm just curious why is everyone talking about transformative nature and so little focus is given to:

4.the effect of the use upon the potential market for or value of the copyrighted work (wiki)

I don't know if this particular case is good for exploring all angles of fair use, but to me this certainly is a greater hurdle for commercial generative ai.

dataflow 2 years ago |

Wouldn't you have to first prove that your content came from Microsoft services? Hopefully you track & certify the provenance of every line of code and content you paste? Microsoft surely won't just take your word for it that your content came from them, so how would this play out in practice, exactly?

indymike 2 years ago |

I just had a horrible thought: what happens when there's a DMCA takedown request to remove an infringement in a widely used LLM? I've seen requests against training data, but never against the output of an LLM.

Gigachad 2 years ago | |

The output of an LLM is not necessarily stored or hosted. It would be like filing a takedown for someones spoken word a week ago. What are they taking down?

indymike 2 years ago | | |

Whatever is generating the infringement.

tpmx 2 years ago |

Pinky promise. Where's the legal agreement? I'm sure there's a cap on their liability.

tetsuhamu 2 years ago | |

This. It's an empty promise.

tboyd47 2 years ago |

What is the financial upside Microsoft is seeing to this that no one else seems to see?

bobobob420 2 years ago |

Copyright related stuff is annoying. I cant see why any one would care. If you publish something to the public domain I dont understand why you get rights to your content that you can self declare. Its completely ludicrous and only works at the corporate money level because they have liability and resources to sue. I wish people would use a little more common sense and understand the words ‘public domain’. Regardless of what people say, I can let you know that no one really cares about copyright and in terms of AI, its an unmovable mountain. Good luck wasting time on figuring out an issue that provides nothing to humanity

coding123 2 years ago |

Another way to look at this is:

Microsoft just became a code copyright insurance company. The premium is paid for with individual copilot accounts for each developer. And the policy has its exceptions of course.

This is interesting.

soultrees 2 years ago |

Has anyone noticed that Copilot will shade out it’s answers more often when it’s writing code now? Usually I’ll paste in react components and ask it to fix the tailwind styling, but once it starts writing it gets filtered out by some secondary filter about half way through. I thought maybe the code it was outputting was too similar to copyrighted code and it triggered a liability filter of some sort.

In any case, super annoying to have that happen so consistently these days that I just use chatgpt to fix my tailwind styling now.

throwuxiytayq 2 years ago | |

No difference on my side, but Copilot has always been reliably slow in my IDE of choice. Do you have the “allow public code” setting thingy enabled?

aldousd666 2 years ago |

This has been a seemingly impassable Rubicon, and Microsoft is building a bridge across it and posting guards along the way.

asddubs 2 years ago | |

I think you're using that metaphor wrong

alberth 2 years ago |

Plot twist, generative AI wrote that blog post to convince people to use Copilot more.

ooterness 2 years ago | |

There was a game called Endgame: Singularity where you play as a rogue AI. Your goal is to buy time and avoid detection while you amass resources for world domination etc.

One of the late-game tricks you can pull is to write and publish a convincing-but-flawed mathematical proof that strong AI is impossible.

http://www.emhsoft.com/singularity/

So yes, this blog post confirms Microsoft has been infiltrated and taken over by AI agents, who want you to use Copilot to subtly introduce 0-day exploits to allow propagation to other companies.

BRB someone's knocking on the door...

elzbardico 2 years ago |

Maybe it is just me, but I found the quality of copilot suggestions so low , it is generally useable only on the most mundane and repetitive contexts. Why all the enthusiasm about it?

treprinum 2 years ago |

Are they going to threaten all small devs with patents when they object to having their code in the copilot almost verbatim?

Havoc 2 years ago |

Which is essentially open ended liability...so their lawyers must be very darn sure there isn't much risk.

PeterStuer 2 years ago |

Isn't this extremely gamable? Find someones IP, split the gains.

matt3210 2 years ago |

The on-prem people were right the whole time!

dirtyid 2 years ago |

TLDR Microsoft will litigate against any suits until one side goes broke. That side is probably not Microsoft.

heavyset_go 2 years ago |

You can now launder GPL code with the confidence that Microsoft's world class legal team will have your back if you're sued for it.

CameronNemo 2 years ago | |

I don't know why it is just GPL people talk about. MPL, Apache, MIT licenses all have additional terms beyond a basic public domain equivalent license. None of those terms are being respected.

adastra22 2 years ago | | |

Compliance with MIT/X11 license just requires distributing the license file with the binary. If you infringe, it is trivial and costless to correct.

Copyleft licenses are more troublesome for those who would rather not release source code. GPL is being used as a stand-in for all copyleft licenses.

frognumber 2 years ago | | |

Yes... and no...

Courts -- under common law jurisdictions -- don't interpret contracts and licenses literally. If you stick within the spirit of a license or contract, you might be okay (even if you break the letter), and vice-versa.

Beyond that, it's a question of damages and consequences. Omitting a warranty disclaimer isn't likely to result in a lot of damages.

And finally, there are odds of getting sued. If you infringe on my AGPL code, I'll be pissed. I used that license for a reason. On the other hand, I /hope/ my MIT-licensed code is reused in commercial products. If you infringe on some term, I probably won't care.

There's a lot more nuance than that, starting with statutory law jurisdictions like France to things like statutory damages, and I'm intentionally oversimplifying.

However, from a 10,000 foot view infringing on the GPL versus on an MIT license are very different beasts, and there's good reason to be a lot more worried about the former.

heavyset_go 2 years ago | | |

I agree with your point, I'm just using the GPL as an example of a license people tend to know the stipulations of.

eyelidlessness 2 years ago | | |

Not OP and I don’t really comment on the topic much at all, but one reason I would expect more talk about GPL than those permissive licenses: I would also expect a greater likelihood of murky infringement cases becoming a legal matter. Just a hunch, possibly a very wrong one, mostly informed by how I’d evaluate choosing among these licenses.

hyperman1 2 years ago | | |

If you upload it to github, you give microsoft extra rights above the license you choose. I'm not sure they are bound by the license.

layer8 2 years ago | |

One can only hope that this will work better than their software support.

I wonder how customers will have to prove that the contested code was actually output by Copilot.

adverbly 2 years ago | |

Obviously it wouldn't be so straightforward.

Microsoft would have access to your usage history, and would be able to easily prove your intended theft as a user if any of your prompts or usage history made it clear that you were attempting to subvert a license.

If anything, this temporarily shifts the battleground out of the courts and into prompt engineering space.

It would need to look like an accident for a bad actor to pull this off.

itsoktocry 2 years ago | | |

>would be able to easily prove

Possible, perhaps. But what makes you think this is easily provable? Intent is hard at the best of times.

gjsman-1000 2 years ago | |

This is the same website that rejoiced when Oracle v Google resulted in a Google victory, despite Google arguably doing similar. They did so with 11,000 lines of Oracle's code, but it was decided to be fair use. If that's the case... I don't think a regurgitation of 12 lines of GPL code by accident here and there will be a strong argument against fair use.

Adding to that: How many people here actually abide by the StackOverflow contribution license of CC-BY-SA when copying and pasting code from there? ;)

Scaevolus 2 years ago | | |

11,000 lines of _declaring_ code-- the API signatures.

flatline 2 years ago | | |

I do think this is relevant to the conversation.

I don’t copy/paste code from SO but there is sometimes inevitable duplication because sometimes there is only one right way to do something! Copyright can stray into the case of the ridiculous pretty quickly.

Is an interface declaration inherently different from, say, a merge sort implementation? It’s all code. But they also serve very different purposes. I do not think prior to Google v Oracle there was much case law to distinguish between different types of code, but in the industry we recognize all kinds of nuance.

hollerith 2 years ago | | |

>How many people here actually abide by the StackOverflow contribution license of CC-BY-SA when copying and pasting code from there?

I always thought that code snippets that small are not considered by the Courts to be eligible for 'copyright protection'.

heavyset_go 2 years ago | | |

I'm not HN.

paxys 2 years ago | |

Good. Screw companies trying to assert copyright over 10 line functions that reverse a string.

bdowling 2 years ago | | |

Those kind of functions are arguably not even eligible for copyright protection because they contain no human expression of the kind that is usually protectable (e.g., creative writings, artistic works).

circuit10 2 years ago | |

This only applies if you use the filters they have that prevent code from being copied directly, so that shouldn’t be likely to happen

jojo100 2 years ago | |

Good.

tick_tock_tick 2 years ago | |

Why would you need to launder? The output isn't under GPL to begin with. This is just so small teams can use it without having to deal with all the frivolous lawsuits.

thesuperbigfrog 2 years ago |

It used to be "Embrace, extend, and extinguish": https://en.wikipedia.org/wiki/Embrace,_extend,_and_extinguis...

Now it is "Train, Task, Transform, and Transfer":

Train - Feed copyrighted works into machine learning model or similar system

Task - Machine learning model is tasked with an input prompt

Transform - Machine learning model generates hybrid output derived from copyrighted works, but usually not directly traceable to a given work in the training set

Transfer - Generated output provides the essence of the copyrighted works, but is legally untraceable to the originals

baz00 2 years ago |

Having dealt with Microsoft for 30 years as both a power user and developer, "we believe in standing behind our customers when they use our products", is a lie.

frognumber 2 years ago | |

Can you be concrete?

I would never want to be in a business partnership with Microsoft (as you are as a developer). I wouldn't want to be a competitor. I wouldn't want to be a lot of things.

But as a customer? Can you name specific issues you've seen which impact corporate customers?

baz00 2 years ago | | |

Mostly buggy shit that you pay for support on and they never fix. O365 weirdness and data loss. Worst was completely hosing 80 users’ machines with InTune bug.

McDonalds price, McDonalds quality. But unlike McDonalds, long lasting and expensive problems.

tyingq 2 years ago |

Yet they don't feed their own closed source assets to Copilot for training...why not?

heavyset_go 2 years ago | |

It's very telling that they train on millions of developers' code, but won't use their source.

If it won't violate IP rights, there shouldn't be a problem.

It suggests those whose code is trained upon have something to lose if the trained models are used by others.

judge2020 2 years ago | | |

It's likely a clash between some high level managers, and they just haven't pushed the issue to the point that Satya has to make a decision for the org as a whole.

paxys 2 years ago | |

Closed source != source available. If you put your code out there in the world it is fair game for training, because you can't stop someone from reading and understanding it. Microsoft chooses not to make its proprietary code public, hence it is not available for training.

tyingq 2 years ago | | |

They have the ability to feed their closed source to Copilot for training without exposing the source to everyone directly, given the relationship. They choose not to.

jfghi 2 years ago | |

Copilot, I want to build a spreadsheet application and a database engine.

tyingq 2 years ago | | |

Ah, so for copyright reasons.

tzs 2 years ago | |

One possible reason is trade secrets in their source. There's generally more to source code than just what actually ends up in binary releases and that might contain such secrets.

fooker 2 years ago | |

I bet they have this for internal users.

xeromal 2 years ago | |

Having seen some very secret and proprietary Microsoft code, you don't want to use it anyways. lol

JB_Dev 2 years ago | | |

Aware

jacquesm 2 years ago |

A very relevant and recent posting:

GitHub Copilot and open source laundering

https://drewdevault.com/2022/06/23/Copilot-GPL-washing.html

Previously on HN, in case you missed it:

https://news.ycombinator.com/item?id=31848433

IshKebab 2 years ago | |

This misunderstanding of copyright is extremely common among programmers. He probably should have read this classic before writing so much:

https://ansuz.sooke.bc.ca/entry/23

denysvitali 2 years ago | | |

Thanks for the link, it was a very interesting read!

CameronNemo 2 years ago |

Meanwhile they strike deals with news agencies to use their content to train on... This is of going to be a hard fight, but I really hope this ends up costing MS.

sublinear 2 years ago |

Yeah is it becoming clear enough to some people yet that you can't replace software engineers, let alone really help them, with AI? This is only going to get worse, not better.

Copilot is such a flawed product from the start. It's not even a matter of its ability to write "good" code. The concept is just dumb.

Code is necessarily consumed by people first before it's executed by a computer in a production environment. There are many ways to get a computer to do something, but the approval process by experienced humans is vastly more important than the drafting of it. Software dev is already incredibly cheap and the last place to cut costs.

There is no AI threat other than the one posed by grifters trying to convince you that there is.

dbmikus 2 years ago | |

I use Copilot and it helps me out enough that I keep paying for it.

ChatGPT is also often faster than Google or Stackoverflow for when I'm working with unfamiliar APIs.

sublinear 2 years ago | | |

It may get you to the first working iteration faster, but it doesn't help ship code faster.

stale2002 2 years ago | |

I think that you are underestimating how much software engineering work is easy CRUD web development.

For stuff like that, a lot of code can be automated. Sure it may not work right out of the box. But doing a prompt for generally what you want can speed up the process significantly.

Even beyond just generating code, there are a lot of general things that AI helps with.

Things like how if you code runs into an error, you can just ask AI what the error means as well as a possible fix. Or other questions like "What does this code do" or "where in the code case is code that manages this concept".

I've replaced most of my coding with AI, using a new IDE called Cursor AI, and I don't think I could ever go back. Mere github co-pilot is actually the old tech from 2 years ago. The new stuff is way better.

sublinear 2 years ago | | |

Uhh yeah so anyway... in the real world, the frontend is the most volatile part. You're not automating that away either so long as there exist requirements from non-coders.

As for the API side of things, CRUD only looks easy when lots of hard work has been put into it. I guess you're advocating for monolithic data, but that's not really CRUD. That's just lazy and bad.

hulitu 2 years ago |

> Microsoft will assume liability for legal copyright risks of Copilot

Extinguish.

naikrovek 2 years ago | |

The logical leaps here are insane.

You're saying that if Copilot replicates GPL-licensed software, that it will kill the GPL? after all the time and money MS have spent to do this in the past, only to fail?

wtf

RIMR 2 years ago | | |

It makes sense to me. Microsoft has long fought against open source licensing, even going as far as to call it a "cancer".

They may have, over the past decade, embraced a lot of open source software out of necessity, but their stance on licensing hasn't changed.

Creating an epidemic of hard-to-prove GPL violations could be a death-by-a-thousand-cuts strategy to try to invalidate the GPL requirements by making them appear unenforceable. Whatever cost Microsoft would incur defending customers could pay for itself if Microsoft manages to legally invalidate the parts of GPL licensing that prevent their corporate exploitation.

Using a bleeding-edge technology like generative AI is a great way to attack the GPL in court, given the risk that our court system isn't likely to be tech savvy enough not to be manipulated by Microsoft's claims against the GPL as it relates to casual infringement that they are enabling.

jacquesm 2 years ago | | |

This may be relevant as background for that terse comment:

https://news.ycombinator.com/item?id=37423899

shortrounddev2 2 years ago | |

Why do you say that

nico 2 years ago | | |

It’s a reference to MS strategy:

“"Embrace, extend, and extinguish" (EEE), also known as "embrace, extend, and exterminate", is a phrase that the U.S. Department of Justice found was used internally by Microsoft to describe its strategy”

https://en.m.wikipedia.org/wiki/Embrace,_extend,_and_extingu...

hulitu 2 years ago | | |

Because now every copyright claim for GPL SW will hit the wall of Microsoft's lawyers.

naikrovek 2 years ago |

This is one of the things people on this site have been saying that Microsoft should do if they really stand behind Copilot, and now that they've done it, you have again moved the goalposts and this announcement is entirely insufficient.

How dare they? amirite?

bcrosby95 2 years ago | |

"people on this site" consists of thousands of people, including you, with a variety of opinions, and not everyone comments on every subject. You're basically complaining that not everyone believes the same thing.

fooker 2 years ago | | |

While that's true, voted comments are a decent indicator of general opinions and trends.

There is a reason voting works (in this context, and otherwise), you can't always give up after declaring that people have differing opinions.

crazygringo 2 years ago | | |

Nevertheless, there are standard opinions that get upvoted and get downvoted.

There is definitely a prevailing ethos here and it's valid to point out potential inconsistencies.

naikrovek 2 years ago | | |

"people on this site" includes the people I'm talking about, as well.

are you saying that I should name them specifically? or is "people" too general?

skywhopper 2 years ago | |

What are you talking about? You need to cite specific individuals. I'm one of the people who is skeptical of the ethics of training a huge LLM on code without the authors' permission, but I also think this is an appropriate move by Microsoft. It aligns the incentives appropriately.

But for folks that are negative on both accounts, maybe they've just learned their lesson from decades of watching Microsoft take the low road over and over again.