First C compiler ported to GCC

First C compiler ported to GCC(github.com)

137 points by vegesm 5 years ago | 87 comments

ggm 5 years ago |

The

  a[b] implemented as *(a+b)

Thing, is how we were taught to think about array indexing in the CS lectures of the 70s

st_goliath 5 years ago | |

And that's how it's still taught nowadays.

Both the C89 and the C99 standard draft contain the following:

> The definition of the subscript operator [] is that E1[E2] is identical to (*((E1)+(E2)))

In fact the expressions a[b] *(a + b) and b[a] are equivalent.

Here is a perfectly valid snippet of C code that will print out 't':

    putchar(3["test"]);

mschuster91 5 years ago | | |

I understand why that example works but I struggle finding a valid use-case, aside from code golfing...

kevincox 5 years ago | | |

How does this work in C++ with operator overloading. Are they still the same? That would make for some interesting obfuscated code.

sgtnoodle 5 years ago | |

Are those not equivalent expressions in modern C?

I imagine there are more optimal and less optimal ways of actually doing the indexing in machine code and the former may be better semantics, but I would think a compiler would generate identical machine code for both.

tinus_hn 5 years ago | | |

I’m pretty sure you have to take the size of the objects in mind.

raverbashing 5 years ago | |

Hence why it "can be written" as b[a] as well

Edit: it doesn't blow up, not even with -Wall and -std=c99

st_goliath 5 years ago | | |

> (yes it will probably blow up in modern compilers, or at least give you a warning)

Nope. For the code snippet I posted an hour ago, even with -pedantic -Wall -Wextra gcc won't issue any warnings. And why should it? It's perfectly standards conformant, because the standard actually defines the [] operator through the equivalent addition expression.

stephencanon 5 years ago | | |

It will not “blow up” in modern compilers, nor can it, because that’s _how the operator is defined_.

nspain 5 years ago | |

That's how we were taught a few years ago too! It really helped it "click" that array elements are stored contiguously.

dvko 5 years ago | |

It’s also in the K&R book IIRC, stating that the two are equivalent.

burstmode 5 years ago |

Did this compiler really support "auto" as a variable type, as seen in example fizzbuzz ?

1ris 5 years ago | |

"auto" in c is a storage class specifier, like "register", "extern" or "static".

https://en.cppreference.com/w/c/language/storage_duration

It was considered pretty useless by most, so c++11 recycled the keyword to mean something different.

jwilk 5 years ago | | |

mywittyname 5 years ago | | |

Auto is the implicit default right? As in function scoped, stack allocated, and lives until the function is returned?

K&R (Second Ed). Makes no mention of the auto keyword in Section 1.10, but it does say,

> Each local variable in a function comes into existence only when the function is called, and disappears when the function is exited. This is why such variables are usually known as automatic [sic] variables[...]

bawolff 5 years ago | |

Auto is not a type, it means local variable. I think its still a thing, just its the default so nobody uses it.

layoutIfNeeded 5 years ago | |

Hehe, “auto” used to mean “automatic storage” aka the stack. Then much later C++ repurposed the keyword for type deduction.

captainmuon 5 years ago | |

"auto" probably is the storage class, it tells what kind of variable this is. Automatic as opposed to "register" which would force the variable to be a register, or "static" or "extern".

The type is not given at all, I think by default it would be "int".

dvt 5 years ago | | |

> The type is not given at all, I think by default it would be "int".

Yep, this is called the "implicit int" rule, and it was specifically outlawed[1] by C99 and onward.

[1] https://herbsutter.com/2015/04/16/reader-qa-why-was-implicit...

ufo 5 years ago | | |

One of the unusual things in this early version of C is that "int" can be used for any word-sized value, including pointers. The type system was very loose.

IncRnd 5 years ago | |

No. "auto" is not a type but a storage class that means automatically allocated instead of being allocated to a register, extern-al to the file, or in the static code segment.

Taniwha 5 years ago |

"The compiler runs only in 32 bit mode as the original code assumes that the pointer size and word size are the same." ... which was um, 16-bits

vesinisa 5 years ago | |

I think he means that int and pointer address must be interchangeable. As long as that holds, the size can be either 16 bits or 32 bits.

On a PDP-11 int would have been 16-bit. On x86 32 bits. But on x86_64 int is 32 bits but pointers are 64-bit. The easiest way to retain the original assumption with minimal changes to the historical source code while targeting a modern CPU is to compile in 32-bit mode.

Taniwha 5 years ago | | |

My original comment was rather tongue in cheek - but I have actually ported this compiler (well a later version of it, from the v6 release) to a 32-bit target - it was a different time, and C was a different, definitely more forgiving and simpler language - with other systems languages like BCPL/Bliss/etc around at the time the whole 'int is the same as a pointer' was definitely a way of thinking about stuff at the time

jart 5 years ago | | |

Why can't it be 64-bit? I don't see any reason why we can't have an ILP64 data model. If int and int* were both 64-bit then it would restore so much of the original beauty of C.

nils-m-holm 5 years ago | |

Yes, and on 16-bit and 32-bit systems, sizeof(int) == sizeof(int*). On 64-bit systems, this is most probably not the case. This is a common roadblock when porting old C programs.

29athrowaway 5 years ago |

If the first C compiler was written in C... how could it be first C compiler? How could you compile the first C compiler?

polska 5 years ago |

I do not understand why we should create a C compiler ported to GCC.

vkoskiv 5 years ago | |

Not a C compiler. This is the original C compiler from ~1972. It's just an experiment to bring a bit of computing history to life.

polska 5 years ago | | |

Oh! At first I did not understand but now looks reasonable thanks