OpenBSD disables Intel's hyperthreading due to security concerns

OpenBSD disables Intel's hyperthreading due to security concerns(mail-archive.com)

478 points by mereel 8 years ago | 145 comments

jimrandomh 8 years ago |

> We really should not run different security domains on different processor threads of the same core. Unfortunately changing our scheduler to take this into account is far from trivial.

This suggests a long-term compromise solution where threads within a process can use hyperthreading to share a core, but threads in different processes can't. Given that hyperthreads share L1 cache, this might also be better for performance.

mww09 8 years ago | |

>This suggests a long-term compromise solution where threads within a process can use hyperthreading to share a core, but threads in different processes can't. Given that hyperthreads share L1 cache, this might also be better for performance.

Intuitively this may sound logical, however in practice it's often not the case. For many workloads putting two threads of the same program on a core ends up being worse than co-locating with threads from different programs. The reason is that two threads of the same program will often end up executing similar instruction streams (a really good example is when both are using vector instructions (these registers are shared between the two hyperthreads)).

ajross 8 years ago | | |

In practice it sometimes is the case, though.

SMT/hyperthreading is complicated. If you have a workload dominated by non-local DRAM fetches, it's a huge win because when the CPU pipeline is stalled on one thread it can still issue instructions from the other.

If you have a workload dominated by L1 cache bandwidth, the opposite is true because the threads compete for the same resource.

On balance, on typical workloads, it's a win. But there are real-world problems for which turning it off is a legitimate performance choice.

amelius 8 years ago | | |

> The reason is that two threads of the same program will often end up executing similar instruction streams

Why is that bad?

classichasclass 8 years ago | |

I'm not sure that would necessarily fix the problem definitively. Say you had a browser running web-exposed JavaScript on a thread. You could still finagle a Spectre-type information leak that way by having the JavaScript thread snoop other browser threads, assuming no other mitigations.

endianswap 8 years ago | | |

Don't most browsers run one process per page/tab nowadays?

stormbrew 8 years ago | | |

In theory marking threads even within the same process as part of a different 'security domain' shouldn't be impossible, though obviously it'd involve proprietary interfaces to the kernel at first.

Tepix 8 years ago | | |

Once operating systems offered this mitigation mechanism, I'm sure browser vendors would use them.

chasil 8 years ago | |

Perhaps it makes more sense to require that all processes on an individual core share the same UID.

Browsers are particularly problematic, and it would be nice to alert the scheduler that a particular process is untrusted and extra care should be taken to sanitize caches before and after its time slice.

the8472 8 years ago | |

> Given that hyperthreads share L1 cache, this might also be better for performance.

If userspace thread writes something into a buffer, does some syscall initiating asynchronous work in the kernel wouldn't it be better for the kernel thread to be located on the same core instead of shuffling the data into another cache?

keldaris 8 years ago |

So... they "strongly suspect" (but don't know and haven't shown) there may be a Spectre-class bug enabled by current HT implementations and improving their scheduler is hard, so they'll pre-emptively disable HT outright on Intel CPUs now and others in the near future?

I'm not an OpenBSD user (and glad for it, if this is anything to go by), but I'm curious - is this really how they operate, or does this decision stand out?

Scramblejams 8 years ago |

I've never trusted hyperthreading for workloads I haven't tested. Sometimes it's faster, often it's slower. Beyond that, I've been suspicious of its security implications from day one. My first trip through the BIOS on a personal machine always includes turning it off.

ailideex 8 years ago | |

Can you give example of where it was slower for you with HT enabled?

Scramblejams 8 years ago | | |

Running finite element models with MSC NASTRAN, basically heavy matrix math. Matrices were NxN, N was around 10 million. This was on a server with 36 cores and a half terabyte of RAM, purchased in 2014.

Also seen Erlang workloads where you could get a bit of throughput increase with your VM scheduler scheduling more threads than your physical cores (so starting to use HT) but the latency would spike and become very unpredictable, which was a bad tradeoff for the use case.

gnufx 8 years ago | | |

HPC workloads normally at least won't benefit and probably take a hit from HT on Xeon-ish hardware at least. It's normally turned off on HPC compute nodes (perhaps in software so the resource manager can enable per-job if necessary). There are exceptions, particularly with KNC and, perhaps, KNL. The situation is likely different for POWER, but I don't have experience of it.

rythie 8 years ago | |

You could just buy i5 based machines instead which don't have hyperthreading.

krylon 8 years ago | | |

From what I have seen, I think many dual-core i5 CPUs for notebooks support hyperthreading.

GrayShade 8 years ago |

There are some Linux HT benchmarks here: https://www.phoronix.com/scan.php?page=article&item=intel-ht...

mehrdadn 8 years ago |

Do you get the exact same performance characteristics by ignoring the extra virtual cores as you would have gotten if you could actually disable hyperthreading in the CPU via the firmware setup? Or does it result in some CPU resources becoming unusable that would otherwise be usable if HT were truly disabled?

notaplumber 8 years ago | |

Operating systems can't disable HT/SMT in the same way as the BIOS/firmware can, but presumably it will be fine if the kernel only schedules the idle process on HT "cores", it will spend much of, or all its time in a lower power state (MWAIT? C-states?), presumably the CPU is smart enough to handle that.

mehrdadn 8 years ago | | |

I guess figuring out whether it's only "presumably" or actually "actually" was why I asked the question in the first place.

blattimwind 8 years ago | |

At least in previous generations some resources where statically shared, but most were dynamically shared.

Someone1234 8 years ago |

Ouch. I will say though, Hyper-Threading is a lot less valuable these days than it was when it was first introduced (except for the few dual core CPUs still available).

When you have four-six-eight or more cores, there's less value in doubling that number. The gain is lower.

hermitdev 8 years ago | |

Except the performance of hyper-threading today is far better than it was first introduced. I had a dual-socket P4 Xeon box w/ HT around 2003. Single-threaded performance with HT enabled was around 70% of what it was with HT disabled. Today, I think you'd see only about 95-98% of enabled vs disabled performance.

I don't have hard numbers to back this up, it's purely my personal experience/recollection. On my 2 socket P4 Xeon box, I disabled HT. On my current I7 6-core box, I have HT on.

derekp7 8 years ago | |

On the other side, a hyperthreaded CPU used to be about 10 - 30% gain, but in tests I've ran on recent hardware (HP DL380 Gen 10) hyperthreading gives around 70% more performance (the test I used was running pigz [parallel gzip] on a large file).

greglindahl 8 years ago | | |

That's a great example of how hyperthreading's performance effects are extremely workload dependent.

moab 8 years ago | |

It's still important to hide latency and saturate the memory controllers for programs with irregular memory accesses (e.g. graph algorithms), although the difference is not 2x, but something more like 10-15% over running without hyper-threading.

garganzol 8 years ago | |

Depends on load. I run parallel integration tests on hyper-threaded machines and usually see 80% gains.

classichasclass 8 years ago |

The implication seems to be that other architectures are also soon to have SMT disabled by default. That would definitely hurt POWER, for example.

mrpippy 8 years ago | |

I think the only other OpenBSD architecture that supports any SMT chips is sparc64 (like the US T1/T2). Unless an actual vulnerability is found, I don't see other OSes following this lead

temprature 8 years ago | | |

An "actual vulnerability" has been found. It's amazing that even after the lazy FPU fiasco, people think OpenBSD did this on a complete whim.

aade 8 years ago | | |

Why not? It’s arguably a way to make it slightly safer to run on Intel.

andreiw 8 years ago | |

Also 64-bit Arm...

joesavage 8 years ago | | |

As far as I’m aware SMT in Arm cores is pretty uncommon actually.

equalunique 8 years ago |

I was going to submit this news from the source I learned it from, which has the novel peculiarity of coming from a site that's name is similar to this one: https://thehackernews.com/thn/2018/06/openbsd-hyper-threadin...

tynecomputers 8 years ago |

Does anyone know when they are going to patch this or is it a permanent fix?

epynonymous 8 years ago |

i didnt see this posed in the comments, but it was certainly tops on my mind. is this the same issue for linux kernel?

petee 8 years ago | |

If they are using Hyper Threading, then yes, unless they already have a different architecture:

"We really should not run different security domains on different processor threads of the same core. Unfortunately changing our scheduler to take this into account is far from trivial."

aade 8 years ago | | |

The (recent) SPARC Hypervisor does a fair job at this. Fujitsu has an interesting implementation. But it would be conceivably difficult to do this with time sharing on Intel chips without exposing side channels. That kind of control should be supervisory and in control of the chip. I haven’t yet seen that on Intel, but I’ve heard there are some hardware manufacturers that are looking to do something like that.

kojon99 8 years ago |

They should make it easier to find the diff behind all openbsd emails. I can’t find this one.

foodstances 8 years ago | |

https://github.com/openbsd/src/commit/96c11352863a7f6240b4e5...

petee 8 years ago | |

Although not ideal, and there is likely an easier way to do it in full CVS (but i lack those skills), but you can always go to their Web CVS and manually check the files listed in the commit:

https://cvsweb.openbsd.org/cgi-bin/cvsweb/

https://cvsweb.openbsd.org/cgi-bin/cvsweb/src/sys/arch/amd64...

DSingularity 8 years ago |

Ouch. Huge hit for performance.

Forbo 8 years ago | |

From the commit message, it sounds like that might not necessarily be the case: "Note that SMT doesn't necessarily have a posive effect on performance; it highly depends on the workload. In all likelyhood it will actually slow down most workloads if you have a CPU with more than two cores."

AHTERIX5000 8 years ago | | |

Wonder why, because of poor SMP scalability and coarse locking?

I've encountered some cases where SMT made performance worse such as with very optimized HPC libs but in general SMT can really help. Compiling projects got a nice boost when enabling HT on Intel's recent arch for example (all of this on Linux though, last time I checked OpenBSD its SMP perf was abysmal)

creo 8 years ago |

What scares me is that they do OS wide change based of wording "This can make", "And since we suspect" and "In all likelyhood" instead of doing actual tests. I know that open systems doesn't have required workforce, but doing changes based on subjective reasoning is slippery slope.

flurrything 8 years ago | |

They care about making OpenBSD secure, not about producing security exploits.

Many OpenBSD devs are security researchers in academia. If they hear whisphers over beers that there are new Spectre attacks coming that exploit this or that, they might not be able to reproduce the exploit without putting a lot of work into it (it's research after all), but they might be able to prevent it by making a simple change, like disabling hyperthreading.

OpenBSD cares more about security than basically any other trade-off in OS design (performance, usability, ...), so it makes sense to me that they went this way. If you want a balance of security and performance, OpenBSD is not for you any ways.

detaro 8 years ago | |

Did it scare you when your operating system started to support it, on the basis that it would "in all likelyhood" be fine?

For a system aiming at security, it's a completely valid choice to disable things that start to look questionable, even if it's not conclusively proven yet. Just like potential software vulnerabilities are patched even if nobody has demonstrated that they actually are exploitable yet.

monort 8 years ago | |

If it's a response to LazyFP bug, then it's under embargo, you can't have a test yet.

gerdesj 8 years ago |

FFS: so far I've seen shit loads of "oooo - stuff <wave hands>" here from people who are clearly not experts or even understand the issues properly in this. Neither am I.

OP (and environs) has names on it that I have seen before and respect as knowing what the hell they are on about.