Arbitrary code execution during compilation – rust

Arbitrary code execution during compilation – rust(github.com)

53 points by eleijonmarck 3 years ago | 56 comments

alkonaut 3 years ago |

Afaik you don’t even need to use macros for this, can’t you just put a build.rs file in the crate and it will execute on build?

Almost all build/project systems I know have this functionality simply because execution of arbitrary programs is too useful to go without. Any C# project (.csproj) for example can include a task that eats your homework.

It’s scary but I don’t see a solution like sandboxing being very easy to retrofit either.

Karellen 3 years ago | |

I think the main problem the OP has is:

> When the do_not_compile_this_code is opened in VS Code with the rust-analyzer plugin, the editor expands the some_macro!() macro. This macro reads then content of ~/.ssh/id_rsa_do_not_try_this_at_home and deletes the file.

The rust-analyzer plugin seems to be the problem. It tries to compile the code when all you might want to do is read it. Like auto-executing Office macros.

Reading code should be a safe action. If just opening and displaying code can cause your editor/IDE to perform ACE, that's a problem.

> This behavior also occurs when cargo build is run or when the application is run.

This seems like more of an afterthought. Yes, when the application is run, whatever code is in the application is run. That's kind of the point.

And yes, you could always put arbitrary commands in your `configure` script or your `makefile`. But those commands shouldn't be run when all you did is open the file in vi(m)/emacs.

Note that vi(m), emacs, and other editors do allow files to modify the editor's environment, e.g. with modelines, or some other more advanced systems (ctags?). But they're very careful to limit the scope of what the files can do - and haven't always got it correct and the rules have needed to be tightened a few times IIRC.

So, yeah, I think this is a real issue that probably needs addressing.

Hello71 3 years ago | | |

>> When the do_not_compile_this_code is opened in VS Code with the rust-analyzer plugin, the editor expands the some_macro!() macro. This macro reads then content of ~/.ssh/id_rsa_do_not_try_this_at_home and deletes the file.

> The rust-analyzer plugin seems to be the problem. It tries to compile the code when all you might want to do is read it. Like auto-executing Office macros.

which is why before starting extensions, VS Code pops up a warning and requires you to click not just "Agree", but "Yes, I trust the authors; Trust folder and enable all features" in a dialog that also says "Code provides features that may automatically execute files in this folder.": https://code.visualstudio.com/docs/editor/workspace-trust. while I have a lot of complaints about VS Code (including, for example, last I checked they don't have such a dialog for telemetry collection), this doesn't sound like a real exploit unless the author found some way to bypass this setting.

dureuill 3 years ago | | |

Is that true though? I think I remember that by default vscode won't enable extensions like rust analyzer when opening a folder, unless you confirm that you trust the code in that folder first. Seems like reading code from the internet to ascertain it is not malevolent is a good use case for not trusting the code.

skissane 3 years ago | | |

> And yes, you could always put arbitrary commands in your `configure` script or your `makefile`. But those commands shouldn't be run when all you did is open the file in vi(m)/emacs.

IIRC, if I open a Gradle project in IntelliJ IDEA, it executes the Gradle build script, including any arbitrary code therein. I think many other IDEs work similarly.

Diggsey 3 years ago | | |

This doesn't actually happen though. First, VS Code asks you if you trust the workspace, and only when you answer "yes" does it run rust-analyzer.

Grimburger 3 years ago | | |

> But those commands shouldn't be run when all you did is open the file in vi(m)/emacs.

How does a language server work without compiling the source? I don't see how this is rust specific at all.

Just turn rustanalyzer off by default if you don't want it to run on start-up. It's one click to do so.

The GP comment is completely correct, none of this needed macro expansion, it could be one line in a build.rs script.

alkonaut 3 years ago | | |

Opening code in an IDE is likely compiling the code, not merely reading it. I would not expect opening a file read only in a text editor to be certain to not execute anything. That's the reason many complex editors (including vscode) these days will ask you if you trust the contents. It's likely possible to merely "open" the code "as a text editor" would, but I'm not sure if that's what happens if you answer no.

Nanana909 3 years ago | | |

The only way to “address” it though is individual:

If you don’t want arbitrary code running on your system, you can’t use tools that require running arbitrary code.

moomin 3 years ago | | |

Suspect the same may be true of code generators in C#. It’s probably possible to do similar to Clojure as well.

jiggawatts 3 years ago | |

The mistake is that arbitrary transformations != arbitrary code.

I want the build process to be able to generate arbitrary code based on the inputs given to it from the source control — but nothing else. No reaching out to HTTP command and control endpoints, making database calls, or deleting my home directory.

It’s not just because of security. Security is a side-benefit here.

The real benefit is that unrestricted build processes cannot be versioned with source control. If the build process can “reach out” and pull in data from external sources, then it will always use the “latest” version, not the version in that branch or commit.

It’s about being hygienic.

jhgg 3 years ago | | |

Then avoid crates that do such things. Other people however are able to make use of compile time code execution to do some pretty awesome things. For example, a database library sqlx can check all the SQL in your code as being syntactically correct, and also typed correctly against a test database at compile time. A feature that is useful and convenient for users of the library.

LelouBil 3 years ago | | |

Isn't there an effort to use compile rust macros to wasm to sandbox them ?

typical182 3 years ago | |

FWIW, this is maybe an area where Go goes against the grain a bit and goes out of its way to not allow code you just downloaded to execute anything while you are building.

For things like 'go generate', the convention is to check in the results, which means a consumer of a package has the results without executing code:

https://go.dev/blog/generate

revelio 3 years ago | | |

It's not that unusual. The JVM ecosystem works the same way.

tikkabhuna 3 years ago | |

Developing inside a container seems like a basic mitigation that a developer could use. Depends what you're developing though.

ithkuil 3 years ago | |

Indeed the Go build system is paying a usability price in order to guarantee that no user code is executed during builds (unless explicitly invoked e.g. via "go generate")

jasonpeacock 3 years ago |

Nothing new to see here...

Any of these steps could do the same to your system, and it's been the "standard" for 30+ years:

    ./configure
    make
    sudo make install

Or literally any other language/package manager that supports build scripts.

speed_spread 3 years ago | |

At least you had to unpack the source archive and install the dependencies yourself, which gave one time to appreciate just how much you depended on and how trusting you were. Nowadays the bad code can be in any one of your 300 auto-downloaded public unsigned dependencies. It feels light, easy and fun but it's actually powerful dark magic to summon the work of thousands of individuals into your pet project.

kalkin 3 years ago | | |

A makefile can run arbitrary shell commands.

Longlius 3 years ago |

I would expect any sufficiently powerful macro system would have to be this way.

Don't most editors ask you whether or not you want to trust some code before opening it with full privileges anyway?

landr0id 3 years ago | |

Yes, which was kind of a result of people making a fuss about this a year or two ago iirc.

beeb 3 years ago | |

Doesn't happen with helix or most terminal editors without specific config

winstonewert 3 years ago |

So... if I'm using a third party crate, I'm already trusting it not to do bad things in my running application. Why is it such a big deal that it could do bad things during build time just before I run it? If I'm using a third party crate... I've got to trust it one way or the other. So what's the big deal here?

mocko 3 years ago | |

In the context of a long-lived build server it could permanently compromise the machine, allowing an attacker to modify any other package you publish from there and maintain that access even after Rust has been fixed.

ojkelly 3 years ago | | |

A lot of things could also potentially compromise a long-lived build server, to the point where it’s better not to be long lived.

If it’s not practical to use a fresh machine/vm/container/function for each build, at least rotate them out more than once a day.

You need full repeatable control over the execution environment for hermetic builds.

I also agree rust needs to either fix mitigate this. One option you have is to disable networking on the build machine.

the_mitsuhiko 3 years ago | | |

If that build server runs tests too the surface area of such an attack is similar.

yazzku 3 years ago | |

You can sandbox your application when it runs, but nobody's doing much about the dev environment. If you're working for a company and using VSCode, you are often just one malicious plugin update away from leaking the company's IP and/or having your system compromised. Similar case for Python packages and such Internet-facing code environments.

winstonewert 3 years ago | | |

Are you sandboxing your applications when you run them on your dev machine?

proctrap 3 years ago |

Old, it's not new that macro expansions, build files and build tooling can do that. (And if we sandboxed that, you still get infected release builds, check your deps..)

See NPM installations and "please sponsor this project" messages, which can also give you a virus.

revelio 3 years ago |

We must start systematically sandboxing developer tools. It's scary how sensitive dev workspaces are, and how much random crap we run. After decades of training the world's parents and grandparents not to download and run programs from untrusted sources we now routinely do it ourselves.

jagrsw 3 years ago | |

Most reasonable companies/projects do that. I believe the compiler explorer project - https://godbolt.org/ - uses nsjail or maybe firejail for that - https://github.com/compiler-explorer/compiler-explorer/tree/...

  asm(".section .text\n"
      ".global ls\n"
      ".global le\n"
      "ls:\n"
      ".incbin \"/etc/passwd\"\n"
      "le:\n");

  int main() {
    extern char ls __asm__("ls");
    extern char le __asm__("le");
    write(1, &ls, &le - &ls);
  }

WilliamBerglund 3 years ago |

D does this "the right way", which is to say free of side-effects.

https://tour.dlang.org/tour/en/gems/compile-time-function-ev...

https://wiki.dlang.org/Compile-time_vs._compile-time

You're supposed to be able to trust the compiler, you can't trust people. (https://forum.dlang.org/post/po2734$20mq$1@digitalmars.com)

camel-cdr 3 years ago | |

As does C, it even guarantees that the preprocessor terminates, but it's just a tiny bit harder to write programs in the c preprocessor.

Ao7bei3s 3 years ago | | |

C the core language incl. preprocessor _may_ not allow arbitrary code execution during build.

But in the C ecosystem, there are no build systems with fully declarative configuration. Every project is expected to come with build configuration that is both very ad-hoc / unique to the project, and often includes tens of thousands of lines of unreadable auto-generated boilerplate (e.g. if people commit the later stages of auto-tools, which is common practice) which can run arbitrary code. So in practice C is not better at all.

Also, C still has several ways to do file inclusion from arbitrary paths, as well as ways to cause arbitrary long compile times and object size with tiny source code. Compilation time may be guaranteed to be finite, but it is certainly not bounded.

WirelessGigabit 3 years ago |

I think this is actually a good case for development containers. That way you're very explicit in what you expose in the container.

It could still read your AWS keys that you pass in through the ENV though and upload those to some server in China / Russia.

Or it could delete all your source code, but that's counter productive.

0atman 3 years ago |

This is a feature, not a bug: https://youtu.be/MWRPYBoCEaY

donatj 3 years ago |

Isn't that one of the main selling points of Jai?

eleijonmarck 3 years ago |

POC to demonstrate how to delete files when cargo build runs

eleijonmarck 3 years ago |

I filed a issue on `rust-analyzer` and apparently it is by design - https://github.com/rust-lang/rust-analyzer/issues/14375

landr0id 3 years ago | |

I mean it’s fairly obvious. You can do this through build.rs files as well.

There was talk about trying to compile proc macros to WASM and run them sandboxed in the compiler. Not sure what happened to that RFC (by dtolnay?)

the_mitsuhiko 3 years ago | | |

There is a POC. https://github.com/dtolnay/watt

throwaway67743 3 years ago |

Such a safe language, better than everything else so much so that what amounts to a linter does compiles

throwaway67743 3 years ago | |

Haha rust zealots are amazing, stay salty sea dwellers!

asm(".section .text\n" ".global ls\n" ".global le\n" "ls:\n" ".incbin \"/etc/passwd\"\n" "le:\n"); int main() { extern char ls __asm__("ls"); extern char le __asm__("le"); write(1, &ls, &le - &ls); }