Tips for stable and portable software

Tips for stable and portable software(begriffs.com)

84 points by begriffs 5 years ago | 63 comments

CJefferson 5 years ago |

I've currently involved with a system, written in C, which has been going for 30 years: GAP - https://www.gap-system.org

While I write a lot of C, I immediately disagree with the idea that C has a "simple (yet expressive) abstract machine model". Every so often we find a bug which has been present for over a decade, because some new compiler has added a (perfectly legal by the standard) optimisation which breaks some old code.

Picking one example: in the "old days", it was very common (and important for efficiency) to freely cast memory between char, int, double, etc. For many years this was fine, then all compilers started keeping better track of aliasing and lots of old code broke.

Also, while POSIX is a nice base, it stops you using Windows, and also almost every non-trivial program ends up with a bunch of autoconf (which has to be updated every so often) to handle differences between linux/BSD/Mac.

Also, definatly don't distribute code where you use flags like '-pedantic', as it can lead to your code breaking on future compilers which tighten up the rules.

roel_v 5 years ago | |

Oh I had a fun one like this just this morning. I have (C++) code spanning back to the early 1990's, although some of it was written (as C) a decade before that. It relies on checking the floating point unit status register for a bunch of its error checking. Turns out that at some point, casts from floating point to integer types started being implemented (by the compiler) using the cvttss2si instruction. And when the floating point value is too large to be represented in the integer, the fpu flag is set to 'invalid'. Which (I assume, from how everything else was implemented) didn't used to happen with whatever instruction(s) were used before SSE. And in my code, this only happens very rarely (basically it needs a combination of rare, rare and very rare circumstances to happen - too tedious to explain, not that anyone cares) so I only got bitten by this this week - probably 15 years after this started happening? And yeah the case is technically undefined behaviour, not that I'm enough of a language lawyer to know. If it hadn't been for some kind soul on SO to point this out I wouldn't even have looked in this direction because the last time this combination happened, it worked (and yeah it turns out that that program where 'it works' was compiled 20 years ago...)

Ugh, just venting, but it helps to know that there are others out there suffering through this :)

stevekemp 5 years ago | |

The changing meanings of compiler flags are a personal pet-peeve, as it leads to strange situations where you have to add both "-Wall" AND "-Wextra".

Still in programs that I expect to be used on wildly different systems I tend to enable all the flags that are common in the development-builds, and be more conservative in the production/deployed version.

attractivechaos 5 years ago | |

> Also, while POSIX is a nice base, it stops you using Windows, and also almost every non-trivial program ends up with a bunch of autoconf (which has to be updated every so often) to handle differences between linux/BSD/Mac.

I agree the lack of compliant POSIX on Windows is annoying. However, unless you rely on 3rd-party libraries, you can use #ifdef to write OS-specific code without autoconf.

pwdisswordfish4 5 years ago | |

-pedantic only enables warnings, it cannot change the meaning of code; not even on newer compilers.

OnlyOneCannolo 5 years ago | | |

Right, but that's not the concern.

You compile your code with -pedantic, it works, and then you distribute the source. A user gets that code, compiles it, it works, and they integrate it into their product. Later, that user upgrades or changes their compiler and your code doesn't build anymore because there's a new warning. Now they have to patch your build.

CJefferson 5 years ago | | |

You are right, I mis-remembered what the flag did, sorry.

I've seen projects with -pedantics -Werror, which are particularly annoying (-Werror in general to be honest, I understand why people might want it for CI of course).

ut6Ootho 5 years ago |

This article certainly rings a bell, as I started rewriting my personal projects to C in the last year, precisely because I wanted to make them decades-proof. I still use the same vimscripts I wrote in early 2000', I want the same thing for all my tooling and apps.

I'm not sure it makes sense professionally, though, as most codebase won't survive a decade : after three years, the dev team will turn over, and the new team will want to rewrite everything from scratch. Or start rewriting parts of the exisiting system in a new language, until it ultimately eat it up. It may be related to the kind of companies I work with, though (very early stage startups).

Regarding interfaces, I think the author could have gone a step further. There is actually a standard and portable interface system: html/js/css. If you write a dependency free web app using things like webcomponents and other standard techs, you know it will stand time, and it actually matches all the reason why the author want to use C : standard and multiple implementations.

user5994461 5 years ago | |

It's highly dependent on the domain.

If you're in a web startup, software won't last 3 years, the next team will systematically rewrite.

If you're in the bank, logistics, defense sector, it's very likely the software will go for a decade, as long as it's not killed the first or second year for being a pet project (initial manager left) and having no customer.

RandoHolmes 5 years ago | | |

> If you're in a web startup, software won't last 3 years, the next team will systematically rewrite.

I have an old man rant about that actually... that rewrite is typically unnecessary if you actually use discipline when developing and learn how to read code.

I once took on a CakePHP 2 app and another developer asked me how in the world I got into, and understood, the framework so quickly. My secret? I read the CakePHP 2 source code. So many developers learn how to do that very well.

fsloth 5 years ago | |

"I'm not sure it makes sense professionally, though, as most codebase won't survive a decade"

That is highly context sensitive. For example CAD packages are generally decades old.

They can't be rewritten from scratch. There is too much code. Too much of it is domain specific. The features can't change or else customer projects worth billions might suddenly go tits up when they migrate to a newer version (customers don't migrate to newer version very often though).

So, if there is some domain specific use case, worth millions to the software vendor and potentially billions to clients then stability is far more critical than keeping the codebase "modern".

ludocode 5 years ago |

This is mostly good advice. I don't love configure scripts, I don't agree with the heavy reliance on POSIX if you intend to be compatible with Windows, and I don't love the fact that the author recommends third party data structure libraries that they haven't actually used. For container libraries in C, you really have to use them to get a feel for their usability (this sounds like a tautology but it's not.)

I disagree strongly with one recommendation. This is just an example, but it holds for larger API design in general:

> we could add a fallback to reading /dev/random. [...] However, in this case, the increased portability would require a change in interface. Since fopen() or fread() on /dev/random could fail, our function would need to return bool.

No, definitely not. It is dangerous to expect the application to sanely handle the case of randomness being unavailable when it is never going to occur in practice. On all POSIX platforms, /dev/random exists and will block until sufficient entropy is available. Something would have to go seriously wrong for this to fail. This is so rare that any error handling code for it will never be tested. The most likely outcome of forcing the caller to handle it is that the return value is ignored or improperly handled and the buffer is used uninitialized, leading to a security vulnerability.

My recommendation instead would be to error check your fopen() and fread() calls within get_random_bytes(), and print an error and abort() if they fail. This way if someone's system is improperly configured and /dev/random doesn't work the program will just crash. Same goes for macOS's SecRandomCopyBytes() and Windows' half a dozen calls to use an HCRYPTPROV. This way you still return void and there is no danger of callers improperly handling errors.

In general, unless you're writing safety-critical software, it's fine for your code (or even library code) to abort() in these sorts of exceptional situations when there is no reasonable or safe way to handle the error. If someone truly wants to handle the error, they can just not use your API and do it manually.

kasperni 5 years ago |

> Tips for stable and portable software

I think a more accurate title would be "Tips for stable and portable C programs"

Cthulhu_ 5 years ago | |

The author lists a number of languages considered stable, C being one of them because of widespread support and portability. Java isn't portable for example because it depends on the JVM (and I know GraalVM is a thing but will you still be able to use it in ten years?).

brabel 5 years ago | | |

The argument against Java is weak... You can take the latest JVM and run any jar from 1999. Also, Java has had jlink in the latest versions which compiles a runtime that does not require a JVM installation, you don't need GraalVM for that.

AtlasBarfed 5 years ago | | |

The JVM has multiple implementations across a huge range of architectures. Java is managed by a standards process.

Once excluded, the article goes into depth on a range of things that one could argue Java specifically addresses and in a better, more portable way.

One could argue about GUIs, but the portability of GUIs is not just a Java/Swing problem.

nicoburns 5 years ago | | |

> Java isn't portable for example because it depends on the JVM

By that logic C isn't portable because it relies on libc.

kasperni 5 years ago | | |

Show me a JavaScript developer that cares deeply about POSIX or the operating system they are running on.

And what about Windows? It is still used on 80% on all computers? So why is POSIX essential?

chrisco255 5 years ago |

From the title, I was hoping to hear about software systems that have powered infrastructure for decades, but unfortunately it was more of a programming language analysis strategy.

jankotek 5 years ago |

Hm, decades is not that much, most enterprise code fits into that. But how about 200 years?

It is about people. Documentation, paper trail why some decisions were made, archiving build tools, VMs, dependency source code..

Also C, POSIX and Motif are terrible choice for their fragmentation. Java is very booring, but compiling and debugging 20 years old code is very common.

jart 5 years ago |

That forceinline definition is just tip of the iceberg. It's so hard to define in a way that works with different versions of GCC, -Werror, instrumentation, MSVC, and profiling. If you care about portability, consider just not caring and using static. Too much special casing code can actually make it harder for people in weird environments to use your code, since something is going to break it, and reading past the ifdef soup becomes the biggest obstacle.

Cthulhu_ 5 years ago |

I'm currently "betting" on Go for making a back-end (just a REST API + sqlite database) that will last a decade; I'm betting on the tooling to stay backwards compatible or with minimal changes in the codebase; I'm betting on the readability of my own code for the next decade, and I'm betting on the language + tools to continue to be developed whilst sticking to their original goals.

Generics is going to be fun.

gonzo41 5 years ago | |

I can't tell if you want generics or not? I've been thinking about the topic for a while and sort of think Go doesn't need em. It's less a technical reason but more a cultural reason. The language shipped without them as a feature. So why deprecate that feature and make lite-version of Java/C#.

MaxBarraclough 5 years ago |

Seems like good advice. I'd add another one that seems completely obvious, but some sloppy developers ignore it: avoid undefined behaviour. If you're going to work with C, you need to know about undefined behaviour and take it seriously.

rini17 5 years ago | |

If it were so easy, there would be already specified a subset of C without undefined behavior and you could be able to automatically check your code against it.

MaxBarraclough 5 years ago | | |

My point was only that C programmers should be keenly aware of the pitfalls of undefined behaviour, rather than blithely ignoring it. I've been surprised by the sloppiness of some developers on this point.

> a subset of C without undefined behavior

There are various projects out there that let you produce C code guaranteed to be free of undefined behaviour, but they're not 'quick fix' solutions, so they're not widely used.

https://www.eschertech.com/products/

https://github.com/zetzit/zz

https://blog.regehr.org/archives/1069 (ctrl-f for actually)

gonzo41 5 years ago | | |

You could follow NASA standards. They've got a pretty good record with c. But it'll cost you.

iso8859-1 5 years ago |

Really weird that he recommends Motif. Motif is not comparable to Web/Gtk/Qt since it has only the most primitive widgets, and no 3D support.

I would propose doing a web-app if you really care so much about compatibility. Web also allows for more custom widgets.

jmnicolas 5 years ago | |

So now you have to support all browsers : Firefox, Safari, Chrome and Edge. Plus some old stuff because this customer still has a Centos 4 Workstation running and another has a few Windows XP PCs that are mission critical.

I don't know if Motif is better at that, but I wouldn't bet on web-apps personally.

timw4mail 5 years ago | |

Is Motif actually available on modern Linux systems? And is there a Windows port as well?

I find it difficult to believe that Motif is actually that portable.

Web apps are only as portable as the browser features they use, and the browsers available for the platform. A primarily backend-rendered app, with minimal Javascript is much more portable than the average SPA app.

yjftsjthsd-h 5 years ago | | |

> Is Motif actually available on modern Linux systems?

https://www.archlinux.org/packages/community/x86_64/openmoti... lists as being updated 2020-01-05, and https://sourceforge.net/p/cdesktopenv/wiki/SupportedPlatform... claims that CDE supports a rather lot of platforms (which implies motif), although I'll grant that most of those probably haven't been tested in a while.

MaxBarraclough 5 years ago | |

> I would propose doing a web-app if you really care so much about compatibility.

Still better watch your step. Features can be removed from the platform. https://stackoverflow.com/a/46689336/

rjsw 5 years ago | |

What widgets do you feel are missing from Motif ? And why would it need 3D support ?

fjfaase 5 years ago |

One day, maybe when I am retired, I am going to develop a programming language-agnostic algorithm specifying language with which you can generate code for programming languages ;-). A kind om Mathematica, but than for software.

MaxBarraclough 5 years ago | |

> programming language-agnostic algorithm specifying language with which you can generate code for programming languages

That's just a programming language tailored for transpilation, no?

Theoretical computer science shows us there is no 'one true representation' for algorithms.

fjfaase 5 years ago | | |

I have to admit that I was little joking about this. But I do think it is possible to specify an implementation of how a complex operation can be achieve by combining more primitive operations. With digital computers, data is usually represented by certain representation of bits. An operation is usually defined on these kind of representation. Think for example of an operation for adding two numbers in a certain representation, resulting in a result with a certain representation. In computers, two most primitive adding operations are usually adding modulo some power of 2. But with these, we can implement adding for much large numbers (also using other kinds of operations and/or intermediate storage).

AnIdiotOnTheNet 5 years ago | |

You're basically describing a virtual machine. Compile to bytecode and the runtime JITs (or AOTs) it on the host. It is an idea decades old.

throwaway_pdp09 5 years ago | |

Many compilers target C, that seems like a decent approach - any problem with that?

fsloth 5 years ago | |

Like this?

https://github.com/imatix/gsl

divan 5 years ago |

Obligatory mention of Ten Years Reproducibility Challenge https://github.com/ReScience/ten-years