Weekend projects: getting silly with C

Weekend projects: getting silly with C(lcamtuf.substack.com)

238 points by nothacking_ 1 year ago | 113 comments

> The above example will print the value of a, but it won’t be initialized to 123!

It certainly could do though. In C, using an uninitialised variable does not mean "whatever that memory happened to have in it before" (although that is a potential result). Instead, it's undefined behaviour, so the compiler can do what it likes.

For example, it could well unconditionally initialise that memory to 123. Alternatively, it could notice that the whole snippet has undefined behaviour so simply replace it with no instructions, so it doesn't print anything at all. It could even optimise away the return that presumably follows that code in a function, so it ends up crashing or doing something random. It could even optimise away the instructions before that snippet, if it can prove that they would only be executed if followed by undefined behaviour – essentially the undefined behaviour can travel back in time!

uecker 1 year ago | |

UB can not travel back in time in C. Although it is true that it can affect previous instructions, but that code is reordered or transformed in complicated ways is true even without UB.

emmericp 1 year ago | | |

The time-travelling UB interpretation was popularized by this blog post about 10 years ago [1].

I'm not enough of a specification lawyer to say that this is definitely true, but the reasoning and example given there seems sound to me.

[1] https://devblogs.microsoft.com/oldnewthing/20140627-00/?p=63...

ant6n 1 year ago | | |

> but that code is reordered or transformed in complicated ways is true even without UB.

Without undefined behavior, the compiler emits code that has the behavior defined by the code —- the ordering may be altered, but not the behavior.

kazinator 1 year ago | | |

If a compiler can determine that some statement is UB, it can treat that as an assertion that the code is unreachable. All other statements which reach only that code and no other are also unreachable.

A compiler's analysis can go backward in time. That is to say, the compiler can build a model of what happens in some section of code over time, and analyze it whichever way it wants.

You cannot go back in time from execution time to translation time, but the translator can follow the code as if it were executing it at translation time.

kazinator 1 year ago | |

In C, it is only undefined behavior to access an automatic object that has not been initialized.

Static objects are always initialized, so the situation cannot arise.

That leaves dynamic ones, like uninitialized struct members in a malloced structure.

Accessing uninitialized dynamic memory means isn't undefined behavior in C. It results in whatever value is implied by the uninitialized bits. If the type in question has no trap representations, then it cannot fail.

JonChesterfield 1 year ago |

This features the construct

  switch(k) {
    if (0) case 0: x = 1;
    if (0) case 1: x = 2;
    if (0) default: x = 3;
  }

which is a switch where you don't have to write break at the end of every clause.

  #define brkcase if (0) case

That might be worth using. Compilers won't love the control flow but they'll probably delete it effectively.

leni536 1 year ago | |

Surely the following would work just as well?

  #define brkcase break;case

kinda defeats the purpose of the macro even.

MaxBarraclough 1 year ago | | |

That strikes me as better. The original macro presumably misbehaves if there's more than one statement in a sequence, as the if will only affect the first statement.

wrsh07 1 year ago | | |

I think the behavior is slightly different since this one breaks the above case, and the other one only omits its case from fallthrough

Incidentally, what happens if you use your brkcase as the first case?

I don't find either particularly exciting - a macro that would append break to the current case feels better

jppittma 1 year ago | |

I think it is super unclear how this works, and I would prefer the same control flow using goto, rather than the duffs device style switch abuses.

asveikau 1 year ago | |

It only works if the case label body is a single line or is enclosed in brackets.

I'll confess, I've used this construct to mean "omit the first line of the next case label but otherwise fall through".

If you think of the case label as merely a label and not a delimiter between statements all of this makes sense.

geon 1 year ago |

This can be used to implement coroutines in C. https://stackoverflow.com/questions/24202890/switch-based-co...

emmericp 1 year ago | |

uIP (TCP/IP stack for tiny microcontrollers) is a another fun real-world example for these types of coroutines: https://github.com/adamdunkels/uip/blob/master/uip/lc-switch...

nj5rq 1 year ago |

Why did I not know that this:

    case 1 ... 10:

Is valid C? I have been programming in C for years, what standard is this from?

G4E 1 year ago | |

Unless it has been recently standardized it's not valid C, it's a GNU extension.

dekhn 1 year ago | |

It appears to be a GNU C extension: https://gcc.gnu.org/onlinedocs/gcc/Case-Ranges.html but I couldn't find the history of the extension. I believe it is not in standard C (not sure about clang).

nj5rq 1 year ago | | |

I just tried it, and it works with clang version 17.0.6.

astrange 1 year ago | | |

Clang supports almost all GNU C extensions. Maybe not nested functions because they need executable stacks.

jftuga 1 year ago |

This reminds me of some silly C code I once wrote for fun, which counts down from 10 to 1:

    #include <stdio.h> // compile & run: gcc -Wall countdown.c -o countdown && ./countdown
    int n = 10; int main(int argc, char *argv[]) { printf("%d\n", n) && --n && main(n, NULL); }

Python version:

    import sys # run: python3 countdown.py 10
    def main(n:int): sys.stdout.write(f"{n}\n") and n-1 and main(n-1)
    main(int(sys.argv[1]))

Shell version:

    # run ./countdown.sh 10
    echo $1 && (($1-1)) && $0 $(($1-1))

quietbritishjim 1 year ago | |

Nitpick: you could replace sys.stdout.write(f"{n}\n") with print(n). The current code looks very much like it was written for Python 2 (apart from the f string!), where print was a statement. As of Python 3, print is just a regular function. It returns None, which is falsey, so you'd also need to change your first "and" to an "or".

jftuga 1 year ago | | |

Thanks for this suggestion - it works great.

    import sys # run: python3 countdown.py 10
    def main(n:int): print(n) or n-1 and main(n-1)
    main(int(sys.argv[1]))

This also works and is definitely more Pythonic:

    _ = [print(n) for n in range(10,0,-1)]

cbrpnk 1 year ago | |

I don't think I've ever thought of explicitly calling main(). Made me chuckle.

akdev1l 1 year ago | | |

I think it is UB

Edit: actually looks like it is UB in C++ but not C

teo_zero 1 year ago |

Another source of surprise:

  4[arr] // same as arr[4]

stefanos82 1 year ago | |

Thanks to array decay to pointer, we basically have `*(array_label+offset)` which in this case of yours we have `*(offset+array_label)`; in other words, `*(arr+4)` is the same as `*(4+arr)`...that's it, really!

trealira 1 year ago | |

By the same principle, these are exactly the same:

  arr[i][j]
  j[i[arr]]

These are the simplifications you'd do. You only need to know that a[x][y] is equivalent to (a[x])[y], and that a[x] is the same as x[a].

  arr[i][j]
  (arr[i])[j]
  (i[arr])[j]
  j[i[arr]]

codext 1 year ago |

The final obfuscated code snippet in the article brought to light another GCC extension:

https://stackoverflow.com/questions/34559705/ternary-conditi...

smusamashah 1 year ago |

Found these silly tricks by the author of this blog on twitter first. Switch statement can do loops too https://twitter.com/lcamtuf/status/1807129116980007037

viraptor 1 year ago | |

Also on the actually social network https://infosec.exchange/@lcamtuf/112701486085621844

mgaunard 1 year ago |

aren't the switch shenanigans important to the duff's device?

tialaramex 1 year ago | |

Duff is relying on the fact you're allowed to intermingle the switch block and the loop in K&R C's syntax, the (common at the time but now generally frowned on or even prohibited in new languages) choice to drop-through cases if you don't explicitly break, and the related fact that C lets your loop jump back inside the switch.

Duff is trying to optimise MMIO, you wouldn't do anything close to this today even in C, not least because your MMIO is no longer similarly fast to your CPU instruction pace and for non-trivial amounts of data you have DMA (which Duff's hardware did not). In a modern language you also wouldn't treat "MMIO" as just pointer indirection, to make this stay working in C they have kept adding hacks to the type system rather than say OK, apparently this is an intrinsic, we should bake it into the freestanding mode of the stdlib.

Edited to add:

For my money the successor to Tom Duff's "Device" is WUFFS' "iterate loops" mechanism where you may specify how to partially unroll N steps of the loop, promising that this has equivalent results to running the main loop body N times but potentially faster. This makes it really easy for vectorisation to see what you're trying to do, while still handling those annoying corner cases where M % N != 0 correctly because that's the job of the tool, not the human.

masklinn 1 year ago | | |

> Duff is relying on the fact you're allowed to intermingle the switch block and the loop

That's just a special case of being able to intermingle switch with arbitrary syntax, which is what TFA does, before it jumps to computed gotos.

mgaunard 1 year ago | | |

To me the duff's device is just a mechanism to unroll a loop without having to duplicate the code for the trailing case.

While you can't use SIMD you can still benefit from instruction-level parallelism.

It's potentially better in some scenarios where you want to minimize instruction cache usage and there are few iterations of the loop.

uecker 1 year ago | | |

Not sure what you mean by "hacks to the type system". All modern computing essentially converged to unified memory, which is exactly C's model.

o11c 1 year ago |

Due to the way lifetimes work in C (they begin with the block, not the declaration), the following is legal:

  #include <stdio.h>
  #include <stddef.h>

  int main()
  {
      {
          int *p = NULL;
          if (p)
          {
          what:
              printf("a = %d\n", *p);
              return 0;
          }
          int a = 123;
          p = &a;
          goto what;
      }
  }

junon 1 year ago |

> switch (i) case 1: puts("i = 1");

I've seen this in the wild, particularly with macros.

    #define assert(c) if (!c) ...
    
    if (foo) assert(...);
    else bar(); // oops!

pdimitar 1 year ago |

Fun at parties alert:

Let's stop getting silly with C, too many CVEs!

---

Serious comment:

It's a rather cool article actually. Not something I'd do daily but it's kind of sort of useful to know these techniques.

drzzhan 1 year ago |

I am so lost at the final block of code. Does every C developer have to deal with this everyday?

ICameToComment 1 year ago | |

Certainly not. That's the purpose of the article where they say in the final sentence that it's entirely possible to write readable, yet totally befuddling code in C that stands a chance in the IOCCC.

BenjiWiebe 1 year ago | |

Not even close. If any C developer ever has to deal with that ever, something somewhere went horribly wrong.

fanf2 1 year ago |

Metaprogramming custom control structures in C by Simon Tatham

metadat 1 year ago | |

Discussed in July 2021 (43 comments):

https://news.ycombinator.com/item?id=27781784

nxobject 1 year ago |

If only there was a way of using setjmp/longjmp-style contexts instead of goto, un/winding the stack as required. So we could travel around in time... unfortunately you can't work with a setjmp buffer before it's actually created, unlike gotos.

gpderetta 1 year ago | |

sigaltstack tricks to the rescue! (Although POSIX only, not ISO C)

JohnMakin 1 year ago |

My undergrad was entirely in the C language and I’m very glad for it. Sometimes more modern languages can throw me for a loop, no pun intended, but the beauty (and horror) of C is that you are pretty close to the metal, it’s not very abstracted at all, and it allows you a lot of freedom (which is why it’s so foot gunny).

I will never love anything as much as I love C, but C development jobs lie in really weird fields I’m not interested in, and I’m fairly certain I am not talented enough. I have seen C wizardry up close that I know I simply cannot do. However, one of the more useful exercises I ever did was implement basic things like a file system, command line utilities like ls/mkdir etc. Sometimes they are surprisingly complex, sometimes no.

After you program in C for a while certain conventions meant to be extra careful kind of bubble up in languages in a way that seems weird to other people. for example I knew a guy that’d auto reject C PR’s if they didn’t use the syntax if (1==x) rather than if (x==1). The former will not compile if you accidentally use variable assignment instead of equality operator (which everyone has done at some point).

This tendency bites me a lot in some programming cultures, people (ime) tend to find this style of programming as overly defensive.

extern volatile int x; int ub(int d, int c) { int r; x += 3; r += x; int _div = d / c; r += _div; for (int 2 = 0; i < 100; ++i) { x += 3; r += x; r += _div; } return r; }