Debunking that C++ is faster and safer than Rust

Debunking that C++ is faster and safer than Rust(viva64.com)

143 points by payasr 6 years ago | 81 comments

saagarjha 6 years ago |

> As you can see, the documented behavior and the absence of undefined behavior due to signed overflows do make life easier.

Not having undefined behavior does make life easier, but having it be defined and then giving the example which benefits from the way that Rust chooses to define it is not really fair.

> With less effort, Rust generates less assembly code. And you don't need to give any clues to the compiler by using noexcept, rvalue references and std::move. When you compare languages, you should use adequate benchmarks.

Actually, the issue is that C++ just can't match Rust's semantics here. By default it will allow for exceptions, by default it will copy; if you watch the talk rvalue references cause the double indirection and fixing this would require some changes in the language to accommodate the use case.

mehrdadn 6 years ago | |

> the example which benefits from the way that Rust chooses to define it

I'm actually struggling to see what the practical benefit is in having it wrap around. The program is still producing garbage at that point, which you're not handling, so why not let the compiler just forget about that case just like you already did?

steveklabnik 6 years ago | | |

The practical benefit is not in the wrapping specifically. The difference is in it being UB vs not UB; that is, overflow is a "program error" not UB, and so the language defines how implementations must handle this error.

You aren't supposed to rely on this semantic, as it's an error. If the checks get cheap enough, rustc will also check in release.

saagarjha 6 years ago | | |

I mean, if you're doing two's complement arithmetic as the example relies on then it's certainly useful. If you want any other behavior, then it is not. (Rust's differing behavior between different optimization levels IMO makes it basically impossible to use usefully, FWIW.)

oconnor663 6 years ago | | |

> what the practical benefit is in having it wrap around

There's a pretty big practical difference between "getting an unexpected numerical result" and "letting an attacker steal my TLS keys and mine bitcoins on my machine."

zozbot234 6 years ago | | |

There is a real advantage to wraparound in that one can understand add, sub and mul as ring operations (as in common arithmetic), which makes it easier to understand the semantics of more complex expressions.

Reelin 6 years ago | |

> Not having undefined behavior does make life easier

I have mixed feelings about this. It seems like many examples of undefined behavior are things that don't make sense to do (ie ought to result in an error) which would often be undecidable at compile time. Runtime error checks would incur performance penalties so it's up to to programmer to include them.

The borrow checker is an obvious advantage here. Beyond that, I guess you could do many of the same things in unsafe blocks and they could cause problems just as easily. If they end up breaking things no one will blame the language because the tin was clearly labeled.

I do prefer language designs that make it possible to write things that are safe by default. It just seems like many problems are misattributed to undefined behavior but are actually due to systemic issues in the design of the language.

thayne 6 years ago | | |

> Runtime error checks would incur performance penalties

Which is why in rust, there are many runtime checks done in debug builds but not in release builds. (Including checking for wraparound and bounds checks).

The idea being that automated tests should catch the errors, assuming you wrote good tests.

dathinab 6 years ago |

> The bug has been present in LLVM since 2006. It's an important issue as you want to be able to mark infinite loops or recursions in such a way as to prevent LLVM from optimizing it down to nothing. Fortunately, things are improving. LLVM 6 was released with the intrinsic llvm.sideeffect added, and in 2019, rustc got the -Z insert-sideeffect flag, which adds llvm.sideeffect to infinite loops and recursions. Now infinite recursion is recognized as such (link:godbolt). Hopefully, this flag will soon be added as default to stable rustc too.

Be aware that this isn't a LLVM bug but a direct consequence of the insanity of C++ specification (wrt. forward progress induced undefined behaviour).

The C++ rules around forward progress allow C++ compilers to faster eliminate code which doesn't produce any observable side effects (without the code triggering undefined behaviour) but it also removes code which intentionally or not hangs the process in a busy loop or is intended to cause a stack overflow... (e.g. for testing protections).

The flag currently isn't added to rust as the penalty effect on compiler time (needs to run more analysis) and runtime (doesn't eliminate all code it should) is currently pretty high.

So this might take a while until _fully_ fixed (you always can pass in the flag yourself if you want).

Through some fixes which make it harder to hit the bug until a proper solution is found _might_ not be so far of (I hope).

Arnavion 6 years ago | |

It's indeed not a bug for C++, but to be clear it is a bug for C and Rust that use LLVM but don't have that same guarantee as C++. LLVM assumes that guarantee holds for all frontends, and added the sideeffect opcode so that frontends for languages that don't have that guarantee have a way out.

steveklabnik 6 years ago | |

We're currently talking about this in Rust-land: https://blog.rust-lang.org/inside-rust/2020/03/19/terminatin... (comment thread: https://internals.rust-lang.org/t/resolving-rusts-forward-pr...)

nv-vn 6 years ago |

Important to note that PVS-Studio is a static analysis tool for C++. As such it would definitely be in their interest to have more people use C++, but since they're arguing in favor of Rust here it definitely speaks to the fact that their analysis is unbiased.

steveklabnik 6 years ago | |

They have also previously posted some negative ones too, so I was surprised to read this for that reason. I think it also contributed to my confusion around the title vs text, that others have expressed in this thread.

ridiculous_fish 6 years ago |

Here's a case I stumble on: Rust seems to generate unnecessary branches. Compare copying an optional range:

C++: https://godbolt.org/z/VCf638

Rust: https://rust.godbolt.org/z/jRFiw_

I think the key difference here is that C++ allows specializing optional on trivial types - can anyone shed more light?

sigwinch28 6 years ago | |

It looks like the C++ version always does the copy, regardless of whether or not the optional is empty, whereas Rust only bothers copying if there's anything there.

Is this a remnant of #[inline] on Option's Clone impl methods?

With "larger" types (e.g. an optional several-GiB array) it seems like this could save some time depending on where things are in memory.

aninteger 6 years ago |

This article is a bit click-baity. It's more about busting myths by a particular C++ programmer against rust.

Anyway for a truth (well maybe it is a myth?) that we can't bust yet... Rust is simply not available on all platforms that C++ is. Two platforms that I think are missing:

* 16-bit MS-DOS

* 32-bit PowerPC (Linux)

dvfjsdhgfv 6 years ago | |

> * 16-bit MS-DOS

That's not entirely true - you actually can create .COM executables: https://github.com/ellbrid/rust_dos

Narishma 6 years ago | | |

That's 32-bit DOS, not the same.

jefft255 6 years ago | |

How much active development of new software is being done for MS-DOS? Really curious to know.

mrits 6 years ago | | |

I'd hope that he was just joking with that list

dathinab 6 years ago | |

Is anyone besides embedded doing _any_ 32-bit dev outside of maintaining legacy code?

I mean 64-bit PowerPc is by now around 17 Years old and even 15 years ago some of the most widespread users of PowerPc (Xbox) switched to using 64 bit PowerPc architecture...

monocasa 6 years ago | | |

Xbox 360 still ran in 32bit mode with the exception of the hypervisor. No use of 64bit pointers on a system with a max of 1GB of RAM other than just wasting cache space, and there's no real other benefit tacked on like you see in other 64-bit archs.

ridiculous_fish 6 years ago | | |

Depends what you mean by "embedded," but almost every set-top box and TV has a 32 bit ARM SOC. These run software under active development: web browsers, Netflix, etc.

saagarjha 6 years ago | | |

Embedded sounds like a great domain for Rust that it sadly often cannot really be used in.

jacobush 6 years ago | | |

Rust is promoted all the time as the sensible choice for embedded work instead of C or C++.

sneeuwpopsneeuw 6 years ago |

Very interesting article. Most of the time I do not like myth busting articles because they are to much focused on opinions and taking things out of context but this one is very well written and fully based on facts on both sides. Thanks for sharing.

kpp 6 years ago | |

Thank you!

acqq 6 years ago |

Just looking from afar, I don't have time to analyze every other claim, but this one:

> Both C++ and Rust have generated identical assembly listings; both have added push rbx for the sake of stack alignment. Q.E.D.

seems to be completely wrong: a decent compiler is able to align the stack without "touching" it. For the variables inside of the function to be pushed to the aligned stack position, only different offsets have to be calculated. For the stack itself to get to be aligned, only the register has to be updated, surely nothing has to be pushed.

So something else must have been happening there, and I don't have time to analyze what, but I'm sure push is surely not necessary for alignment alone.

Someone 6 years ago | |

Counterintuitively, on some x86 architectures, push ¿is/can be? faster than decreasing the stack pointer’s value because the CPU uses dedicated hardware to speed up subroutine calls (https://stackoverflow.com/a/36633556)

pitaj 6 years ago | |

It may just be for alignment. A push may be just (having a specific case in the CPU) as fast as updating the SP.

ridiculous_fish 6 years ago | |

You are correct, it has nothing to do with alignment. rbx is a callee-saved register, so the callee saves it.

thayne 6 years ago | | |

Do you mean thar rax is caller-saved?

kibwen 6 years ago |

This is an endlessly perplexing headline, as its core assertion, "C++ is faster and safer than Rust", is what the body of the article spends its whole time attempting to refute. A more accurate title for the content of this article would be "Debunking the myths that Rust is not safer or as fast as C++".

gautamcgoel 6 years ago | |

Yeah I was really confused too. You expect that author to bash Rust in favor of C++, but he does the exact opposite.

dochtman 6 years ago | |

Some scare quotes might do the title some good, too.

dang 6 years ago | |

Ok, we've debunked the title above.

junke 6 years ago | | |

Are comments about the title of articles offtopic? Because sometimes comments that discuss titles, without being inflammatory, are flagged, and sometimes they aren't.

scottLobster 6 years ago | |

Clickbait for software engineers.

brenden2 6 years ago |

Strictly speaking it's the compiler that generates faster code.

mschuetz 6 years ago | |

You'll never get faster code out of JS, though. Language design matters a lot.

dathinab 6 years ago | | |

Well, there are some close to native code speeds for some (very constrained) use-cases.

They way it's done is that if you use a certain stile of C the compiler will speculative do assumptions about the code allowing it to basically add all the C optimizations. Except that it always has to check if the assumptions are uphold and then fallback and that because it's a JIT it has much less time to optimize the code and do cross-code-section optimizations.

moonchild 6 years ago | |

Sounds like the 'sufficiently smart compiler[1] myth.

1. https://wiki.c2.com/?SufficientlySmartCompiler

dathinab 6 years ago | | |

It's both. With bad language design you won't get fast code (at least without going through insane loops).

But even with good language design the compiler need to use them, which needs time to be implemented etc.

So in practice it's often more a mixture between how easy/hard the language makes optimizations and how much work (with given expertise) was put into the compiler optimizations.

Through there are insane optimization which need to high amount of knowledge about the code and as such which you will have a really had time to ever realize with Asm,C or similar. But most time they aren't worth it as getting them right is hard and the time is often better spend with adding more straight forward optimizations, maintaining the compiler code etc.

saagarjha 6 years ago | |

They both depend on the same compiler backend, so it's really the language here.