Anatomy of a Program in Memory (2009)

Anatomy of a Program in Memory (2009)(manybutfinite.com)

110 points by chaitanyav 8 years ago | 26 comments

okket 8 years ago |

Previous discussions: https://hn.algolia.com/?query=Anatomy%20of%20a%20Program%20i...

(or click "past" under the title, also helpful to check when you submit a link)

finchisko 8 years ago |

After reading many articles about virtual memory and how kernel space is mapped into every process I don't understand why it is necessary. Why can't process only have mapped it's user mode space? Also it only seems to be case of unixes and windows. Not sure how exactly it's done in OSX but "Mac OS X does not map the kernel into each user address space, and therefore each user/kernel transition (in either direction) requires an address space switch." https://flylib.com/books/en/3.126.1.91/1/

MarkSweep 8 years ago | |

On x86, I t was presumably for performance, so that the TLB does not have to be flushed when switching from user to kernel mode. x86 requires some kernel memeory to be mapped always, for example the stack for syscall and trap handlers. So by keeping everything mapped into memory, the kernel did not have to worry about which parts were needed to handle syscalls and which were not. These kernel pages were marked as “supervisor only”, so only the kernel code could actually read and write them.

I say all of this in the past tesnse, since Meltdown makes it possible to read all that kernel memory. Kernels now keep most of the kernel memory unmapped when user mode is executing.

bogomipz 8 years ago | | |

>"x86 requires some kernel memeory to be mapped always, for example the stack for syscall and trap handlers."

Can you elaborate on what you mean be x86 requires that the kernel stack always be mapped into a process address space in order for system calls?

The kernel always knows where a process's kernel stack is located as there is a pointer to it in the user process's task_struct. It is only in kernel mode that the kernel switches the CPU's stack pointer to use that that processes kernel stack.

monocasa 8 years ago | | |

This is it. Most of the RISC chips had a ASID tag with the MMU metadata that allowed you to switch address spaces without flushing the TLBs, but x86 added this super late. It ended up being added on the second round of virtualization extensions on x86 (and it's different between AMD and Intel).

bogomipz 8 years ago |

The author states:

>It is also possible to create an anonymous memory mapping that does not correspond to any files, being used instead for program data."

This isn't strictly true though is it? It was my understanding even mmap() MAP_ANONYMOUS used a file interface, and that the way the kernel creates anonymous maps is by creating an instance of /dev/zero in tmpfs. Although I believe the file descriptor might be ignored however.

monocasa 8 years ago | |

It seems to just pass around a null struct file pointer and special cases that.

Myrmornis 8 years ago |

if the process depicted in the diagram were to start a second thread, where would that second thead’s stack go in the diagram? The two threads would share the same heap.

monocasa 8 years ago | |

Another ~8MB(8MB plus guard pages) chunk of virtual memory that's free up there near the shared mappings.

You're totally right though, that threads complicated the traditional "stack grows down heap grows up" view of a Unix user address space.

Myrmornis 8 years ago | | |

Thanks. So can I check I'm understanding correctly

- If a process has many threads, their stacks are all located within a single virtual address space corresponding to the user process?

- If one thread grows down and is about to overwrite the top of another thread's stack, does the OS detect this automatically and do some sort of reallocation procedure?

dsign 8 years ago |

>> In Linux, kernel space is constantly present and maps the same physical memory in all processes.

That's right there together with the city states of Greece and other ancient memories. Meltdown and Specter happened.

monocasa 8 years ago | |

You could still make an argument for that if you squint hard enough. The virtual memory is still reserved, and a transition to kernel mode still has user space mapped, and the kernel's view of memory as well.

dsign 8 years ago | | |

Thanks for clarifying! Is whatever the kernel does now in transition to user space expensive because it's somehow proportional to the amount of actual memory that the kernel is using or has reserved?

qntty 8 years ago | |

Explain?

monocasa 8 years ago | | |

The Meltdown fixes change the model to unmap most of the kernel when switching to user mode. The issue with Meltdown was that it was possible to read memory through timing side channels that was technically mapped, but permissions shouldn't allow you to touch.

newscracker 8 years ago |

The title needs 2009 in it, since this article is from that year.

coffeeacc 8 years ago | |

64-bit update would be great.

sctb 8 years ago | |

Updated. Thanks!

chaitanyav 8 years ago | |

not able to edit the title now :-(