Memory Allocators 101 – Write a simple memory allocator (2016)

Memory Allocators 101 – Write a simple memory allocator (2016)(arjunsreedharan.org)

271 points by DonbunEf7 7 years ago | 51 comments

FreeRTOS also has a nice collection of different heap implementations with varying degrees of sophistication. They are a good read for those interested in such things.

https://www.freertos.org/a00111.html

https://github.com/aws/amazon-freertos/tree/master/lib/FreeR...

WalterGR 7 years ago | |

For the uninitiated, RTOS == Real-Time Operating System. Real-time OSes have constraints that don’t exist in non-real-time OSes.

RTOS on Wikipedia: https://en.m.wikipedia.org/wiki/Real-time_operating_system

q3k 7 years ago |

Please don't write custom memory allocators for production code without seriously considering the security implications of doing so. The glibc allocator has benefited from years of security hardening against very creative attack vectors.

You don't want a simple double-free to lead to an RCE bug, do you...

pjc50 7 years ago |

K&R includes an example memory allocator as well: https://stackoverflow.com/questions/13159564/explain-this-im...

To me, the K&R one seems much less readable and also doesn't include the global malloc lock, since even the updated edition of K&R predates standardised threading.

I note it also does the "bp = (Header *)ap - 1;" trick, so if that's undefined behaviour then it's a good example of how hard it is to write C without relying on UB.

01100011 7 years ago |

Good timing. I literally had to do this on a whiteboard in an interview about 11 hours ago.

Interesting things to consider: Fragmentation prevention, real-time performance, minimizing locking(lock-free techniques, or per-thread free lists), and reusing the freed memory to contain the free list structure. I basically started out whiteboarding what the article lays out and by the end of the interview realized everything wrong with it. It's a good starting point though.

sbmthakur 7 years ago | |

If I may ask, what position/designation you were interviewing for?

dvdbloc 7 years ago | | |

I was asked this as well few months ago for a New Graduate Software Engineer position to work on embedded avionics.

01100011 7 years ago | | |

Embedded/Platform engineering, working on a new OS based around a microkernel.

DivisionSol 7 years ago |

I gotta say this is a very topical read. Was messing around last week or so trying to learn some x86 Assembly from scratch (Linux subsystem on Windows,) and memory was very much a sticking point. Seeing this is helping me grok just what is sorta going on, which is the best way for me to learn I think.

slashvar2701 7 years ago |

While I strongly support the idea that no one should write a generic allocator in production, writing it as an exercise is a very good idea.

The article looks at lot like the tutorial I wrote a long time ago ... (Every now and then, I see my old PDF in post about implementing allocators, which is disturbing since I wrote it in a hurry as a quick support for a lecture and I found it very poorly written ... )

I think it's interesting to note that using sbrk(2) is a bit deprecated, it's way easier to implement malloc(3) using mmap(2) ...

There's also better ways to handle pointers arithmetic and list management. Someday I'll put only cleaner version only ... Someday ...

apankrat 7 years ago |

It's worth noting that

    realloc(ptr, 0)

behavior is undefined. The vast majority of modern C libraries will implement it as

    return free(ptr), NULL;

and it will be documented on man pages as such, but there are systems where this will be equivalent to

    return free(ptr), malloc(0);

Furthermore, in theory, this is also permitted:

    return NULL;

so as tempting as realloc() might be as a single override for implementing custom allocators, there are some worms.

tom_ 7 years ago | |

Realloc's behaviour is well-defined: https://port70.net/~nsz/c/c11/n1570.html#7.22.3.5p3

If the size of the space requested is zero, the behavior is implementation-defined: https://port70.net/~nsz/c/c11/n1570.html#7.22.3

richardwhiuk 7 years ago | | |

Being implementation defined is hardly well defined.

rv11 7 years ago |

Knuth in one of his volumes ( I think vol I) has described this very nicely. way better than what was in K&R.

paavoova 7 years ago |

  header = (struct header_t*)block - 1;

Isn't this UB?

anon49124 7 years ago |

Is there any effort to extract ORCA from Pony (which is better than Erlang's and Azul's C4)?

badminton1 7 years ago |

Good for didactic purposes but in the real world you may want to try an allocator like tcmalloc or jemalloc.

wongarsu 7 years ago | |

In the past I've sometimes achieved 20% or better speedups by writing custom allocators (speedup on overall performance, not just malloc performance). tcmalloc or jemalloc are great for the general case, but sometimes you know invariants about object sizes, alloc patterns and free patterns that allow much more performant allocation.

The simplest case is if you know that you will free everything at once, or nothing at all. This allows you to eliminate most bookkeeping and allows a completely lock-free architecture. But there are also more complex cases where you can still get big benefits from exploiting known invariants.

flurrything 7 years ago | | |

> but sometimes you know invariants about object sizes, alloc patterns and free patterns that allow much more performant allocation.

jemalloc allows you to query these invariants if you don't know them, and to use the information to re-configure the allocator to match them :/

I'm pretty sure most modern allocators allow you to do this as well.

> The simplest case is if you know that you will free everything at once, or nothing at all.

That's pretty much a one liner with jemalloc.

saagarjha 7 years ago | |

Or even the standard memory allocator provided by your system. I'm pretty sure this article was meant as a way to understand how malloc works and not as a high-performance replacement for the one you're currently using.

lasagnaphil 7 years ago | |

I’ve heard many gamedevs make specialized allocators for some of their code (the most being arena allocators)...

01100011 7 years ago | | |

It's fairly common to avoid naked malloc()/free() in systems with real-time requirements. Memory pools are a great way to go if you want deterministic behavior and better reliability.

Thaxll 7 years ago | | |

All major game engines have custom memory allocator.

apankrat 7 years ago | |

Not sure why you are downvoted, but this is a toy allocator.

typon 7 years ago | | |

It's downvoted because it's missing the entire point of this article?