Sandsifter: find undocumented instructions and bugs on x86 CPU

Sandsifter: find undocumented instructions and bugs on x86 CPU(github.com)

447 points by argorain 8 years ago | 91 comments

mcculley 8 years ago |

This is great. That a program can learn about and exploit the CPU on which it is running from unprivileged userspace reminds me of the notion in Charlie Stross' Accelerando of running a timing attack against the universe to learn about the virtual machine in which we are being simulated.

michaelmior 8 years ago | |

It would be even better if there was a web service that would collect these logs for different processors so everyone didn't have to invest the time to run the analysis.

tux1968 8 years ago | | |

Well they do mention in the description:

The results of a scan can sometimes be difficult for the tools to automatically classify, and may require manual analysis. For help analyzing your results, feel free to send the ./data/log file to xoreaxeaxeax@gmail.com. No personal information, other than the processor make, model, and revision (from /proc/cpuinfo) are included in this log.

qubex 8 years ago | |

I'd never heard of Charlie Stross or his Accelerando book. Thanks for mentioning that, it looks right up my hard-sci-fi alley.

vsviridov 8 years ago | | |

Then definitely also check out the Quantum Thief trilogy, by Hannu Rajaniemi.

moomin 8 years ago | | |

He actually posts on here from time to time...

He's fairly prolific and very talented. The only one of his books I wouldn't particularly recommend is his first. The sequel's great, though.

Many of his straight up thrillers e.g. Saturn's Children have well realised universes they inhabit.

pmarreck 8 years ago | |

I believe there was also a Rick & Morty episode about this (of course...)

winter_blue 8 years ago | | |

You mean the first episode of the third season where Rick breaks out of the virtual reality interrogation room he was in, by simply presenting some data (i.e. equations) that turned out to be code that took control of the system?

_wmd 8 years ago |

tl'dr of the slides:

    Found on one processor... instruction
    Single malformed instruction in ring 3 locks
    Tested on 2 Windows kernels, 3 Linux kernels
    Kernel debugging, serial I/O, interrupt analysis seem to confirm
    Unfortunately, not finished with responsible disclosure
    No details available [yet] on chip, vendor, or instructions

He's found a new f00f bug, winter 2017 is going to be interesting :)

tempay 8 years ago | |

For those not aware: https://en.wikipedia.org/wiki/Pentium_F00F_bug

Can these kind of bugs possible to exploit to cause anything more than minor annoyance?

hexadecimated 8 years ago | | |

If it works inside a VM, an attacker could potentially cause a widespread denial of service on cloud computing platforms like Azure and AWS.

viraptor 8 years ago | | |

Use them to exploit the system itself - not likely. (Unless they cause some specific bad behaviour rather than a crash) But you can definitely use a DoS issue for other effects. For example if someone is using an auth revokation system which fails open, you could kill that part to use expired credentials. Or if you're able to sometimes inject data, you can keep killing the caching systems until your response is the saved one. (Like in DNS hijack)

qb45 8 years ago | |

Observation: the length of the censored "XXX hardware bug" text on the slides matches neither Intel, AMD nor Transmeta. Unlikely to be VIA too.

Either it's deception or perhaps some obscure low-end embedded vendor.

edit: for the curious, it's "(redacted) hardware bugs" :)

Veedrac 8 years ago | | |

You mean the black bar on the PDF? That just says "(redacted)".

rst 8 years ago | | |

Or they were smart enough to change the size of the box so that it can't be used to easily identify the vendor (from among a very small set of candidates).

ericfrederich 8 years ago | | |

"Obscure" enough to run Windows though

duskwuff 8 years ago | | |

Possibly something weird like Vortex86?

rfth 8 years ago | |

If I was a betting man I would say ARM.

adrianmonk 8 years ago | | |

Isn't this fuzzing tool x86-only?

hellbanner 8 years ago |

"Everybody hates the golden screwdriver upgrade approach, where a feature is either hidden or activated through software, but the truth of the matter is that chip makers have been doing this sort of thing for decades – and charging extra for it."

""We are moving rapidly in the direction of realizing that people want unique things and they are going to want them in silicon. In some cases, it will be done in software," said Waxman."

Also, Github says "several million" undocumented instructions.. is that right? I don't know much about assembly but that number sounds absurdly high.

dtx1 8 years ago |

This is highly interesting. I assume a lot of those are going to be debug and instructions to help the binning process. Some of these might even unlock access to parts of the CPUs we aren't supposed to have access too, opening the doors to custom microcode (unlikely that anyone outside the CPU OEM can do that though) but may allow us to disable "security features" such as the Management Engine. This is a really interesting approach and i would love to see the results ported to other hardware/vendors. The same could potentially be done with GPUs, ARM-CPUs, etc.

abainbridge 8 years ago | |

I expect Intel burn a fuse bit at the end of the binning process to prevent such features being accessed in the finished product.

duskwuff 8 years ago | |

Separate research has been done on microcode. The general consensus is that Intel's microcode binaries are encrypted, and are secured with a RSA2048-SHA256 signature.

http://inertiawar.com/microcode/

webreac 8 years ago | | |

I am surprised private keys have never leaked.

fovc 8 years ago |

Here's a link to the slides [pdf]: https://github.com/xoreaxeaxeax/sandsifter/raw/master/refere...

badminton1 8 years ago |

Also from the same author https://sites.google.com/site/xxcantorxdustxx/visual-re

chungy 8 years ago | |

That looks fun, but the site doesn't seem to have any downloads available?

badminton1 8 years ago | | |

There was a demo around somewhere. Not hard to find.

There is a similar --but less featured-- open source project. https://github.com/wapiflapi/veles

cornchips 8 years ago | | |

https://www.reddit.com/r/ReverseEngineering/comments/1izity/...

SAI_Peregrinus 8 years ago |

Christopher Domas does some very cool work. His System Management Mode exploit a few years back was quite nice. It will be interesting to see which processor it is that he found the ring 3 hard lockup instruction in...

rasz 8 years ago | |

He works for spooks - Battelle Memorial Institute, a long-time NSA/CIA contractor. One of the places that hires officially retired spies.

azinman2 8 years ago | | |

and therefore.... ?

d33 8 years ago |

...isn't the usability of the tool limited because it's running in userspace, which has fewer privileges in terms of what instructions can be ran?

c12 8 years ago | |

I was wondering this myself until I read the pdf:

> For effective results, the injector should be able to identify instructions in more privileged rings, even if it cannot actually execute those instructions.

>This approach allows the injector to detect even privileged instructions: whereas a non-existing instruction will throw a #UD exception, a privileged instruction will throw a #GP exception if the executing process does not have the necessary permissions for the instruction. By observing the type of exception thrown, the injector can differentiate between instructions that don’t exist, versus those that exist but are restricted to more privileged rings. Thus, even from ring 3, the injector can effectively explore the instruction space of ring 0, the hypervisor, and system management mode.

michaelmior 8 years ago | | |

So basically the same as throwing a 403 instead of 404 for authenticated resources in HTTP :)

poizan42 8 years ago | | |

When it comes to discovering possible bugs then there is really no guarantee that the instructions are acting as they should though.

askvictor 8 years ago | |

As the slides say, this approach prevents the system from falling over entirely, while still resolving instructions from deeper rings.

d33 8 years ago | | |

Makes sense. I was thinking if there could be a bootable fuzzer of this kind, but you're right that it would be very difficult for it to be both usable and not crash very quickly.

badminton1 8 years ago |

Lot of weird stuff done happening nowadays in CPUs.

There's a lot of mystery in microcode (equivalent to the CPU firmware), the "system management mode" aka protection ring -2, and the infamous management engine.

tonyg 8 years ago |

I wonder what dbe0, dbe1, and df{c0-c7} do? They are present and undocumented in all of Intel, AMD and VIA's variations (see p4-p5 of the paper).

pbsd 8 years ago |

For what it's worth, the size-prefixed jcc/call binutils bug had already been fixed a couple of years ago: https://sourceware.org/bugzilla/show_bug.cgi?id=18386

pwdisswordfish 8 years ago |

The slides mention an 'apicall' opcode 0ffff0; searching the web turns up nothing but these same slides. Does anyone know anything about it?

wmu 8 years ago | |

It seems to be a MS antivirus bug: http://securityaffairs.co/wordpress/60434/hacking/microsoft-...

rurban 8 years ago |

Regarding the ring 3 hard lockup he didn't disclose yet: isn't that the recent kaby lake/skylake error, released about a month ago?

ngneer 8 years ago |

Chip vendors do the same in the course of validation, and technically even before any silicon has been fabricated, using simulators.

shdon 8 years ago |

No instructions there to disable the IME?

pgeorgi 8 years ago | |

If anything, I'd expect such a flag to hide behind MSRs (http://wiki.osdev.org/Model_Specific_Registers)

That's a mostly unused namespace of 2^32 64bit registers. To hide things even better, it would also be possible to change behavior based on officially unrelated registers (eg. MSR $x only acts as IME-switch if the calling address also ends in $y and esi is $z)

cesarb 8 years ago | | |

They could also be multiplexed (MSR $x is address/command, MSR $y is data). Or require a sequence of operations (write this magic sequence of numbers to MSR $z). Or memory-mapped/IO-mapped (with the mapping enabled/disabled by MSR or PCI registers). Or be locked by the BIOS during the boot sequence.

But IMO, it probably can't be disabled at all. The "disabling" would be to change the program it runs to a program which does nothing. So there wouldn't be a "disable IME" bit; there would be bits to either make its memory visible to the main CPU cores, or to read/write to its memory, and it's possible that these bits are accessible only from the IME side, or from SMM.

__jal 8 years ago | |

Given Intel's behavior around the IME, I rather doubt there's an instruction for that. The only verifiable way to do so that I know of, on some chips, is here:

https://hardenedlinux.github.io/firmware/2016/11/17/neutrali...

YMMV, not responsible for bricked chips, and note the caveats at the end.

egberts1 8 years ago |

found another that is QEMU-specific.

https://github.com/unicorn-engine/unicorn/issues/364

egberts1 8 years ago | |

It is more about modifying executable code space and not making it stick. Good enough for fooling AV.

purpleidea 8 years ago |

wow... anyone have a link to the video of his talk?

pmarreck 8 years ago |

Is this basically a CPU fuzzer?

deathanatos 8 years ago | |

The subtitle at the very top of both the page and the README…

> The x86 processor fuzzer

brawny 8 years ago |

Out of curiosity, are there any toy compiler projects out there that try and make use of the incedental instructions? Could you possibly expect to see a with while performance boost (I'm thinking it would be unlikely...)

m00dy 8 years ago |

Someone built a fuzzer for cpus