Redis on the Raspberry Pi: Adventures in unaligned lands

Redis on the Raspberry Pi: Adventures in unaligned lands(antirez.com)

201 points by bjerun 8 years ago | 59 comments

drej 8 years ago |

I never deal with such low level issues, so I don't have to read this, but... reading these posts by antirez is such a joy. He makes this topic so clear and understandable, he doesn't assume much, he doesn't use overly complex explanations, he just "says it like it is" :-)

Thanks!

hellwd 8 years ago | |

++ :)

drewg123 8 years ago |

I fondly remember unaligned access faults "back in the day" with FreeBSD/alpha. We implemented a fixup for applications, but not for the kernel. I seem to recall that even though x86 could deal with unaligned accesses, it caused minor performance problems, so fixing alignment issues on alpha would benefit x86 as well.

Most (definitely not all) of the mis-alignment problems were in the network stack, and were centered around the fact that ethernet headers are 14 bytes, while nearly all other protocols had headers that were a multiple of at least 4 bytes.

I've said it before, and I'll say it again: If I had a time machine, I would not kill Hitler. I'd go back to the 70s and make the ethernet header be 16 bytes long, rather than 14.

IgorPartola 8 years ago | |

Why in god's name did they make it 14?!

pjc50 8 years ago | | |

Ethernet was invented in 1973 and the first 32-bit processors were available in 1979.

While you've got the time machine, can you fix it so that "network byte order" and Intel endianness are the same too?

jandrese 8 years ago | | |

It's all they needed. 6 bytes per address, and 2 more bytes to mark the protocol. Back in the 70s and 80s memory was very expensive and developers bent over backwards to save bytes everywhere. This is also why IP addresses are only 32 bits long, even though they knew that it wouldn't be enough if the protocol went global.

Hindsight is 20/20, and a lot of times people don't appreciate the constraints these old systems had. This was being developed decade before the Commodore 64 came out with its luxurious 64 kilobytes of memory (39k usable).

jimktrains2 8 years ago | | |

They didn't feel like they needed those 2 bytes and, hey, why waste space?

Also, was a "byte" standardized at the time? Didn't they still have systems working in not-8-bit "byte", nibbles, byte, and 2-byte boundaries?

blattimwind 8 years ago |

There is a funny mode on ARM processors (turned on in some images, by default) which causes unaligned reads to silently return bogus data (just increasing a kernel counter).

PowerPC, and really, most non-x86 architectures, do this one way or another.

faragon 8 years ago | |

PowerPC (and POWER) has reasonable hardware support for unaligned memory access, at least for 32-bit data, and if the data is in the data cache. Depending on the processor, the exceptions that reach the OS can be more or less frequent.

ARM v6-A and later (except for some microcontrollers, like Cortex M0/R0, that don't support hardware unaligned access at all, triggering a exception) is similar to the Intel x86 case (reference in transparent unaligned memory access -except for SIMD, where x86 can raise exceptions, too, in the case of unaligned load/store opcodes-), where there is hardware support for unaligned memory access.

For software that uses intensive non-aligned data access, e.g. data compression algorithms doing string search, PowerPC, ARM v6-A (and later ARM Application processors), new MIPS with HW support for unaligned memory access, and Intel are pretty much the same (i.e. b = * (uint32_t * )(a + 23) will take 1-2 cycles, not requiring doing a memcpy(&b, a + 23, sizeof(uint32_t))).

For SIMD, though, there is no transparent fix, although there are specific opcodes for aligned and unaligned memory access (e.g. load/store, unaligned load/store).

antirez 8 years ago | | |

I would say that ARM v6 and later is a major step forward, but is v8 that really seems to be similar to Intel finally. The v6 was able to deal only with single fetch/store unaligned instructions, but things like accessing a double or multiple words with the same instruction would raise an exception.

throwaway000002 8 years ago |

I'm probably the only weirdo that thinks this, but if you support byte-addressing you'd better as well be happy with byte-alignment. Atomics being the only place where it's reasonable to be different.

Which brings me to padding. I wonder what percentage of memory of the average 64-bit user's system is padding? I'm afraid of the answer. The heroes of yesteryear could've coded miracles in the ignored spaces in our data.

MrBuddyCasino 8 years ago |

Accessing memory locations ending in 0x7? Gather round the campfire folks, James Mickens has a story to tell: https://www.usenix.org/system/files/1311_05-08_mickens.pdf

luhn 8 years ago |

> Redis is adding a “Stream” data type that is specifically suited for streams of data and time series storage, at this point the specification is near complete and work to implement it will start in the next weeks.

This sounds like it could be really exciting. Is there anywhere I can find out more?

Specifically, I've been struggling to find an appropriate backend for HTTP Server-Sent Events, could this feature help with that?

antirez 8 years ago | |

Hello, please check my two Redis Conf 2017 talks on youtube. There is info about Streams.

luhn 8 years ago | | |

Thanks antirez! This looks exactly like the feature I've been searching for. :)

For posterity, here's the referenced videos:

General overview: https://youtu.be/U7J33pd3hLU?t=23m54s

Implementation details: https://youtu.be/Wzy8dIjsY6Y

fancy_pantser 8 years ago | | |

Did my enhancement make it into the skip list implementation being used for the STREAM type? I am hoping it would be in place before you publish benchmarks for it.

https://github.com/antirez/redis/pull/3889

yeswecatan 8 years ago | |

Here's a discussion on reddit. There's a link to the proposal on github, too.

https://www.reddit.com/r/redis/comments/4mmrgr/stream_data_s...

johnny22 8 years ago | |

I'm pretty sure I saw implementations that used the existing publish subscribe mechanism in Redis to handle it and seemed happy with it. I have no personal experience with it though.

msarnoff 8 years ago |

Recently I've been doing a lot of low-level work with ARMv7-M microcontrollers (specifically, NXP's Kinetis Cortex-M4 chips) and was quite pleased to find out that they are pretty lenient about unaligned accesses. To quote from the ARM Cortex-M4 Processor Technical Reference Manual:

"Unaligned word or halfword loads or stores add penalty cycles. A byte aligned halfword load or store adds one extra cycle to perform the operation as two bytes. A halfword aligned word load or store adds one extra cycle to perform the operation as two halfwords. A byte-aligned word load or store adds two extra cycles to perform the operation as a byte, a halfword, and a byte. These numbers increase if the memory stalls."

However, multi-word memory instructions (LDRD, STRD, LDM, STM, etc.) always require their arguments to be word-aligned.

type0 8 years ago |

Great article, this project just begs the name of Redisberry Pi

JefeChulo 8 years ago |

In future project I might be interested in the use of Redis for queuing jobs, this comes very handy to now early the main issues I could get when developing.

amelius 8 years ago |

Could Rust's typesystem catch unaligned pointer dereferences?

bbatha 8 years ago | |

Sort of, Rust is supposed to make references to packed structure members unsafe, but currently doesn't. An RFC was accepted to change the behavior but it has not been fully implemented. Here's the tracking issue: https://github.com/rust-lang/rust/issues/27060

wofo 8 years ago | |

Considering dereferencing a pointer after doing some arithmetic on it can only be done within unsafe blocks, I would say you are at least warned about it. But it will happily compile.

dis-sys 8 years ago |

wondering what kind of performance overhead it is going to cause by letting the kernel to handle unaligned access vs. fixing the software to actually always use aligned access?

crncosta 8 years ago |

Nice article!

k__ 8 years ago |

OT: Is blattimwind shadow banned?

make3 8 years ago | |

? I see his post

yorwba 8 years ago | | |

Probably someone vouched for it.

retox 8 years ago | |

No, but posting while green will usually get your comment downvoted to oblivion, even if you are erudite and contribute to the conversation.

Turn on "show dead comments" and see how many greens are deleted. I screenshot many examples.

taneq 8 years ago | | |

Is this cause (ie. people downvote greens out of prejudice) or effect (greens are often created to shitpost?

And to concentrate all my meta in one place... Is shadow banning a thing at HN? I thought they just, well, banned you.

icebraining 8 years ago | | |

I doubt it's downvotes. Probably cases of the spam/ringvoting detector gone wrong.