Computer Science from the Bottom Up (2016)

Computer Science from the Bottom Up (2016)(bottomupcs.com)

618 points by merlinsbrain 6 years ago | 76 comments

delsarto 6 years ago |

Author here ...

Yes as pointed out many times not "computer science" and I somewhat regret the name. However, it came out of me being a teaching assistant for people doing computer science degrees. A surprising number of people got to 3rd year operating systems courses without realising things like 2^10 is a kilobyte, 2^20 is a megabyte, etc. Let alone how a program was linked and loaded. I hope for this to be helpful, there are plenty of similar resources but sometimes the way one person says something resonates more than another.

I deliberately wanted to avoid x86-only to illustrate for similar reasons. Unfortunately Itanium proved to be a poor choice, ARM would have been better, but it gives me something to update if I get time! However, much of the basic content still remains relevant many years on.

acbart 6 years ago | |

Isn't it amazing how people pass courses, even perhaps demonstrate mastery at a point in time, but manage to get to later courses without fully understanding things? Students end up with a very strange mental framework by the time they graduate, and it seems like most students actually know shockingly little. I don't know that there's a solution, or even that it's an issue. To me, this is simply a reflection of how people learn.

Regardless, this seems like a nice little document for those 20% of students who are still willing to read supplementary materials. Then again, this is short and clear enough that if it were properly chunked within a course and had graded assignments attached, many students would probably actually skim it.

If you ever want to develop this some more, I hope you'll consider something like Runestone[0], in order to give it some auto-graded questions along the way!

[0] https://runestone.academy/runestone/default/user/login?_next...

henrikschroder 6 years ago | | |

When I did my CS programme decades ago, we all started with Scheme, which is very high-level, and forced everyone to focus on the algorithms, the math of it, instead of what the hardware is doing, or how the Scheme interpreter is implemented. It gets you going, thinking about values and execution and iteration and recursion.

Starting with bits and bytes and hardware would be pretty goddamn boring, I think. You'd lose a lot of students that way.

That said, in the third year we had a course in analog electronics, which was basically transistors and logic gates.

Following that course was one in digital electronics, where we all built our own little toy 8-bit computer, wiring the CPU ourselves, writing the microcode ourselves. I'll never forget the a-ha moment when you realize that your instruction set are just binary patterns representing which wires to put a current on, which units to toggle on and off. The instruction to move a value from a register to an address in RAM has to look like this, because you need to toggle the read input on the correct register, and the write input on the RAM unit, and everything else has to be off. Blew my mind at the time.

The clock was manual if you wanted to, so you could step through and watch your little CPU run a program, or you could set it to like 1Hz and watch the thing go. And from there, you can sort of get how a modern computer works, it's just a matter of going from 1Hz to 1GHz, wider buses, wider instruction sets, but it's no longer "magic" how the CPU works, it's all ones and zeroes, for a reason, and you now know that reason.

314 6 years ago | | |

Most students approach courses as a form of top-down search with a cache for items they see often. It is generally difficult to get them to reason about subjects from the bottom up.

A smaller proportion of students approach courses from the bottom-up (10-20%?) - they tend to start with the basic pieces and glue them together over time to understand larger concepts.

I’ve read about this phenomenon (it is extensively documented in pedagogic literature) and seen it in action (I taught CS for about ten years) but still could not explain to you why that split occurs.

I would speculate that it is a difference in type-1 vs type-2 reasoning based on the level of familiarity and comfort with the prerequisites to each course, but even that guess is heavily biased by studying constructivism in CS pedagogy.

vegetablepotpie 6 years ago | | |

>it seems like most students actually know shockingly little

I think that having students take 5-6 classes together in 16 weeks doesn’t promote mastery in any of those classes. Tying the performance in those classes to scholarship eligibility and job placement incentives grades, not necessarily understanding. Grades and mastery can be separated because 2-3 exams in a class, which determines the majority of the grade, rewards students the most, on a time investment vs. performance basis, for understanding technicalities in the grading system and for hyper focusing on the types of problems that can be on an exam. This doesn’t promote mastery, this is a game academia and students play for the satisfaction of government, finance and corporations. Definitely, a problem, solutions could include more frequent sampling of understanding, more diverse ways of measuring knowledge, decoupling performance from financing and longer periods to learn topics.

inanepenguin 6 years ago | | |

> Isn't it amazing how people pass courses, even perhaps demonstrate mastery at a point in time, but manage to get to later courses without fully understanding things?

People remember what they are required to recall (citation needed). Classes can only test for so many things, so people are going to remember what they needed for assignments or tests. It feels like that's one of the reasons apprenticeships are hailed by some as useful. They "test" what is required in real world application. I've never needed to know that 2^10 is a kilobyte as a developer building websites, but would that be surprising/amazing? There are many things I've needed outside of school that were never taught.

As long as CS is used as the path to software development, it will be a balance between theory and application.

Abishek_Muthian 6 years ago | | |

Recently I had a conversation with someone who has been featured in Kaggle leaderboard(top 1%) telling 'CS background isn't helpful for Machine Learning'.[1]

I believe that he didn't understand what Computer Science actually is, besides having a degree in it. I'd say it's because of the educational system, where one could attain a degree without actually qualifying in it.

In under developed/developing economies if only qualified people get their degrees, then only a fraction would get their jobs and it would be very bad for that economy and so bad educational systems are by design.

[1] https://twitter.com/heavyinfo/status/1209330850363404288

userbinator 6 years ago | |

I actually think "bottom up" is more of a misnomer than "computer science" as a title for this material, since when I think of "bottom up" I think of things like nand2tetris and Petzold's book. This looks more like "selected topics in operating systems" with some review of computer architecture basics mixed in.

delsarto 6 years ago | | |

This was really seeded for me when trying to teach people taking computer science courses who were trying to do practical things like integrate C libraries into their higher level projects, but didn't really understand how things were working under the hood and getting themselves into a mess.

If they understood a little more about how their program was built and run ... from the bottom up, as it were :) ... they would have had less pain.

Of course, it just shows as usual the hardest problem in computer science is indeed, naming!

Edit: BTW you're right, "Code" by Petzold should be required reading http://www.charlespetzold.com/code/index.html

diego898 6 years ago | |

Hello, thanks for releasing this! Are there any plans to release chapter 10?

delsarto 6 years ago | | |

It is written in DocBook, which has become increasingly annoying to publish with and I think is, unfortunately, unmaintained now.

I have been considering some parts on containers and virtualisation which isn't covered at all.

The Itanium and PowerPC examples probably haven't aged well. There is no question that Itanium is a very interesting architecture with many interesting features, but now it is dead it's like deciphering hieroglyphs. I think I have to update these to ARM, or maybe even RISC-V to be more relevant moving forward.

So that would be my plans, as they were :)

Exuma 6 years ago | |

Hey, so this is a great work you did. One thing perhaps you can explain... I've stared at this example now for like 30 minutes...

https://www.bottomupcs.com/chapter01.xhtml

Section on "Masking"

How do you get 0x09 or 0x90?

I get how using <memory> & <mask> = <extracted data>...

so:

10100101

11110000

10100000

But I have yet to see how this is 0x90 or 0x09, perhaps I'm misunderstanding. I'm trying to understand the 'shift' piece of it

314 6 years ago | | |

It’s a typo: he meant 0xa0 or 0x0a and the shift is the four positions to the right to extract the top nybble of the byte.

yellow_lead 6 years ago | |

Looks like someone has vandalized your guide.

> Reordering

> This bit is crap

delsarto 6 years ago | | |

Yes, it has some rough edges. One lesson you can learn is probably not to write things like this, and just let what you have stand as it is, and incrementally improve it.

dang 6 years ago | |

What year should we put up there?

delsarto 6 years ago | | |

2016 probably

zelly 6 years ago | |

*kibibyte

yters 6 years ago |

While not technically all of what's covered in CS, in my experience with a PhD in Comp. Eng., 8 years as an AF comm officer, and 3.5 years in the commercial sector as a soft. eng., this is vastly more useful than most of what's covered in a CS curriculum.

The CS curriculum probably made more sense back in the day when everyone was essentially an embedded developer. But nowadays, the most useful knowledge I have is the low level mechanics of how things like the OS and networking protocols work. S/W eng. classes are a bit useful, but mostly knowing how to write in C++, Java, and now Python has gotten me most of the way. As it is, I have almost never run into a situation where most of my CS classes have been relevant. And, where they are relevant, it can be covered by a week course in the basics.

I feel the CS curriculum would be much better service for students if it covered more of the knowledge of how to get things done. And not in a faddish, framework du jour manner, but there are constant elements throughout all the fads that a good developer should learn cold, and are not covered very well, at least in my 8 years in CS academia.

IMHO the real problem with CS is that it's driven by AI envy, and much of what is considered important only makes sense in light of the assumption the human mind is basically a computer, and CS is all about how to recreate a human mind. However, almost none of that line of thought matters in the real world, and is most likely false.

gumby 6 years ago |

Here’s a slightly different take: The Secret Life of Programs (https://nostarch.com/foundationsofcomp ). It starts with a practical set of things and then ends with some survey overviews for people who want to know more.

A complement, not really an alternative, to this pdf.

(I fumble fingered this the first time and left out the link! Unfortunately I can't delete the confusing comment but fortunately it is being downvoted away)

dillonmckay 6 years ago |

This seems more for OS fundamentals than full-on CS, still good, though.

userbinator 6 years ago |

Previous discussions:

https://news.ycombinator.com/item?id=13249675

https://news.ycombinator.com/item?id=7611093

smitty1e 6 years ago |

Seems an excellent place to mention http://linuxfromscratch.org/

barbs 6 years ago |

Related: nand2tetris: https://www.nand2tetris.org/.

"This site supports a course and a textbook that guide students and self-learners through the construction of a modern, full-scale computer system - hardware and software - from the ground up."

Thorentis 6 years ago |

> "This bit is crap" (https://www.bottomupcs.com/chapter02.xhtml under "Reordering")

Might want to proofread some of this and remove the ... proof-reading notes? Not sure what you thought was crap about this section?

delsarto 6 years ago | |

I think in my mind at the time I was comparing it to Hennessy and Patterson; something that is obviously playing a in a completely different league!

Over time I have become a bit more realistic :) I've stopped worrying if it's better or worse than anything else. As long as it's factually correct, I think a rising tide floats all boats so the more we all write and read others work, the better off we'll be.

monoideism 6 years ago |

Looks like a decent resource for Unix OS / system programming, but it's definitely not computer science.

lbatx 6 years ago | |

Agreed. I don't see how you can leave out (among other things) databases, Big O notation, networking, and call it a computer science primer. Good for what it is, but misnamed.

hootbootscoot 6 years ago |

This is great. It covers a pretty wide range of stuff one is presumed to know and allows for both a big picture view and sufficient detail for some "hole-filling" and connection between ideas. I would consider this an interesting lecture series.

arm1c 6 years ago |

Does anyone know if there's a book like this? Basically intro to OS but conceptual stuff only - for someone who didn't take it in school and wants to learn the high level details, not to the level normally covered in a college course.

FiberBundle 6 years ago | |

Computer Systems - a Programmer's Perspective

m10i 6 years ago |

I have this favorited/bookmarked for the foreseeable future. Thank you for working on this (I hope it's still an active project if readers point out discrepancies)

quocble 6 years ago |

Not very good to be honest. Nothing about schedulers, bootstrap, and most of it is very general information. it doesn't give actual clear picture how everything comes together, and it is very system specifics (unix). CS should be agnostic to any system architecture. You do cover binary , but the more fundamental question what is the foundation of binary mathematics. It goes back to Turing machine, and how transistors are built and how you compose them into bigger units, and then you talk about CPU architecture, the adder, multiplexer, the PLU, etc.

malandrew 6 years ago | |

Can you recommend one or more resources that you think do a better job achieving a bottom up approach to CS?

yellow_lead 6 years ago |

This looks like a great resource. I wish I had it for my OS class.

gumby 6 years ago |

Here’s a slightly different take: starts with a practical set of things and then ends with some survey overviews for people who want to know more.

A complement, not really an alternative, to this pdf.

rodneyzeng 6 years ago |

Call it Information industry corners from bottom up.

TwoNineFive 6 years ago |

I remember reading this a few years back.

I stopped reading at some point because there were so many errors, unfinished bits, and just flat-out garbage.

Sorry to the author but this needs to improve. It's a good start, but you need to invite fixes and implement them. Put your email address on every single page and invite fixes and then implement them.

This could be an awesome resource, but right now it's too full of errors to be useful.

Nobody should be recommending this. Nobody who had actually reviewed it all would recommend it.

delsarto 6 years ago | |

Everyone who tries to create something eventually comes up against being told their work is flat-out garbage. Unfortunately I don't think there is a vaccination for it, and one must just take their chances and hope to build a mental immunity so it doesn't ruin your day. It's sad to say that many don't develop a resistance, and I think we lose many good contributors that way.

outoftheabyss 6 years ago | | |

It's always good to take constructive criticism but when the negative commenter contributes little of worth, both in the comment and to the wider community, I think it is safe to ignore.

Thanks for sharing

JohnL4 6 years ago | | |

As has been said before: ignore the boos. They usually come from the cheap seats.