How Does an Intel Processor Boot?

How Does an Intel Processor Boot?(binarydebt.wordpress.com)

178 points by bytefire 7 years ago | 47 comments

burfog 7 years ago |

This part is a bit of a mess:

"16-bit Real Mode with insruction pointer pointing to address 0xffff.fff0, the reset vector. In this initial mode, the processor has first 12 address lines asserted, so any address looks like 0xfffx.xxxx. This fact combined with how addressing using segment selector (SS) register works, allows the CPU to access instruction at reset vector address 0xffff.fff0"

SS is the Stack Segment register. CS is for code.

In real mode, the instruction pointer is only 16-bit and can not hold a value in excess of 0xffff.

Ignoring those issues, the explanation still doesn't match what I've seen before in the documentation. If things have changed, when did that happen? The old explanation:

The CS base is set to something like -16, that is with all but the lower 4 bits set. This covers all of physical address space, with any higher bits just being ignored. The instruction pointer is set to 0. The result is execution that starts 16 bytes below the first address that is beyond the end of the physical address space. For example, with 44-bit physical addresses this would be at 0x00000ffffffffff0.

bytefire 7 years ago | |

yes CS not SS. i should fix that.

regarding how the CPU addresses 0xffff.fff0 is not exactly specified in the post. actually CS register is loaded with 0xf000 and normally this would yield a segment selector address of 0x000f.0000 (CS left-shifted by 4 bits). but on a reset, like the post mentions, first 12 address lines are asserted so the base address ends up being 0xffff.0000. these address lines remain asserted until a long jump is made, after which the first 12 address lines are de-asserted and normal CS segment selector calculation resumes.

instruction pointer contains -16 as you mentioned, the resulting address is:

base address + IP = 0xffff.0000 + 0xfff0 = 0xffff.fff0

i am not sure if this is worth adding to the post but it is definitely useful.

atq2119 7 years ago | | |

I recall reading that it's not that those 12 bits are explicitly asserted, but rather that the CS descriptor after reset is in an "unreal mode". After all, x86 segment descriptors consist not just of their numeric value, but also of a base address, segment size, and privilege information.

So at reset, CS is set to a descriptor whose numeric value is 0xf000 and whose base address is 0xffff0000, or something to that effect. All the rest follows naturally -- there's no special case logic that asserts lines of the address bus until the first long jump, it's simply that the reset value of the CS descriptor is rather magical, and that long jumps by their nature load a new CS segment descriptor which isn't magical.

burfog 7 years ago | | |

The instruction pointer is the IP register. It is zero. It does not contain -16 or 0xfffffff0. The linear address is a different thing (computed from the CS base plus the IP/EIP/RIP content), as is the physical address.

Unless something has changed in recent hardware, there aren't 12 address lines just asserted. This is a side effect of the CS base being a particular value.

An important thing to realize is that x86 has hidden registers associated with segments. These registers get set when a segment selector register is loaded, not when it is used. The CS base is one of these hidden registers. If CS is loaded in protected mode, the base comes out of the descriptor table, and it remains when switching back to real mode. (this is the "unreal mode") If CS is loaded in real mode, the base comes from the selector shifted left, and this base remains even if you switch to protected mode. Switching modes doesn't change a segment base. Loading segment registers is what changes a segment base.

So initially, the CS base is not set in a way that matches what you would get if you loaded the CS selector value that is seen. It is set to a value that is possibly 0xfffffff0, 0x00000ffffffffff0, 0x0000fffffffffff0, or 0xfffffffffffffff0. The older documentation I've seen would use the largest of those values. I suppose it could then be cut down to 32-bit by the bottleneck that is normally a part of addressing when not in long mode. This is the sort of area where Intel, AMD, and others may differ.

Perhaps there is a hardware debugger for x86 (like a JTAG debugger) that would show the initial CS base. One could also guess that Simics or VMware might be correct, disassembling them to find out what they use. Another idea is to examine the badly-documented state used by the virtualization instructions.

Sir_Cmpwn 7 years ago | | |

By "asserted" do you mean "pinned" or "fixed"?

bytefire 7 years ago | |

@burfog i have updated the post with explanation of how the reset vector address is calculated. thanks for pointing out :)

JdeBP 7 years ago | | |

As others here, I strongly recommend reading the IA manuals on this subject, as well as the equivalent AMD doco. Most of the processor part (but not the firmware part) of this subject is in the manufacturer doco.

And yes, one has to be careful about outdated information.

* https://superuser.com/a/347115/38062

* https://superuser.com/a/695716/38062

* https://superuser.com/a/345333/38062

* https://unix.stackexchange.com/a/461774/5132

garganzol 7 years ago |

Hey, thanks for the great article on an intricate topic. It is nice to see the resurgence of WordPress blog posts recently. Back in the day it was the primary source of sacred knowledge for me and many other people around the world.

hlandau 7 years ago |

It's a bit light on details in the ME and pre-real mode stages. For example, it omits to mention the 'Authenticated Code Module' which now executes before the BIOS reset vector.

ajross 7 years ago | |

Lots of those details are vendor-specific. Really the specifified behavior exists in the modern world not to enable "boot" (which as you point out is mostly the job of firmware running on the ME and not the application CPUs), but to clearly define the initialization sequence for SMP startup (i.e. how do the auxilliary CPUs start -- Yes, that's right, they start in real mode) and some other minutiae like mode switching for legacy interaction with BIOS calls.

userbinator 7 years ago | |

Emphasis on the before.

AFAIK this is a completely new behaviour and only for newer versions of ME; the older versions still boot the main CPU like a 386 did, and the ME processor is a separate thing (Don't quote me on this; just information I gathered from brief research.)

bytefire 7 years ago | |

yes true. the article deliberately focuses on reset-vector onwards steps. perhaps it should make that clear in the beginning. the ME mention is just to create a context for the starting point. of course ME is a huge topic in itself

sebazzz 7 years ago |

Doesn't this article confuse BIOS and UEFI? UEFI starts the bootloader in 32-bit mode, however, the classic BIOS still jumps through the bootloader in 16-bit mode.

hyperman1 7 years ago | |

That's a hard question to answer. On the original PC, ROM BIOS (Basic I/O System) was the name for both the (EEP)ROM-based firmware sitting at address F0000 and the INT based interface it provided to the OS.

This included how the system handed of control to the OS. BIOS just loaded the first disk sector and executed whatever it found there. MBR-based partition tables were a DOS-convention, the BIOS couldn't care less what the first disk sector did once it was in control.

When UEFI, the new boot interface, was invented, we needed a name for the old boot conventions. So we called them BIOS.

It's hard to claim the article confuses anything if the original word BIOS has such a confused meaning to begin with. If you say BIOS=PC firmware except the option ROMs, the article is correct.

JdeBP 7 years ago | | |

The use of "BIOS" to name the PC, PC/AT, PC98, and suchlike firmwares long pre-dates the existence of EFI, and was not a reaction to it.

And the name is even more confusing than you paint it to be. The "BIOS" was also the bottom-half of MS/PC/DR-DOS, contained in IO.SYS in (pre version 6) MS-DOS and in IBMBIO.COM in PC-DOS and (post version 3) DR-DOS.

JoshTriplett 7 years ago | |

> UEFI starts the bootloader in 32-bit mode

Or 64-bit mode, on most current systems.

bytefire 7 years ago | |

it's a good point. the article doesn't say classic BIOS hands off to bootloader in 32-bit mode. most systems now-a-days leave off the system in 32-bit mode and that's the point this article makes. the only mention to classic BIOS is as an example of boot firmware.

having said that it is perhaps worth clarifying in the article that classic BIOS would hand off in 16-bit mode :)

usr1106 7 years ago |

It would be interesting to understand the fundamental difference why ARM systems can even boot without a BIOS and a BIOS chip. Of course they also need to have their DRAM configured, but when compiling U-Boot I don't recall any complication like stackless code. Just that your address space is extremely small in the beginning.

zaarn 7 years ago | |

IIRC from my Embedded Linux course, ARM has a small amount of SRAM embedded in the chip that is available during boot and is later disabled in favor of the bigger DRAM chips once the controller is initialized.

Uboot is responsible for this in some chips.

usr1106 7 years ago | | |

Yes, ARM has this small amount of SRAM. Having a bit of memory available was such a trivial concept that I did not even think of that the much more complicated x86 processor could just lack it.

But I guess this might not be the only fundamental difference.

(I have thought to utilize the SRAM for some optimization later at system runtime because it should be incredibly fast. At least if you don't have to care about power consumption that should be possible or does the specification require to turn it off?)

bogomipz 7 years ago |

I have a suggestion and a question. The article makes numerous references to "memory initialization" but I didn't see anywhere where its explained what it means to "initialize" DRAM hardware. If I overlooked this I apologize.

My understanding has always been that initializing DRAM consisted of two things:

The BIOs had to enumerate how much physical memory that motherboard had installed. And then to test that that memory is working by writing a bit to each location and reading it back.

Would this be accurate?

Would also be worth noting that many BIOSes allow you to hit the space bar to skip memory initialization presumably because it somewhat time-consuming.

bytefire 7 years ago | |

no you didn't overlook. the article doesn't discuss actual mechanics of DRAM init, so thank you for adding this info :) i know there is a process of memory training whose aim is to arrive at the right parameters for that DRAM. the way i see it, it is sort of in-field caliberation. boot firmware can then store those parameters inside BIOS chip and then on next reboot just use those parameters, because memory training is a time-consuming process.

asianpopupwork 7 years ago |

If the memory initialize code is released as a binary blob, it is called as FSP (which usually is used with Coreboot). Why not just call it as the original name - memory reference code(MRC)?

bytefire 7 years ago | |

you're right, MRC is a major part of FSP but i think FSP does more work than just initialise memory. it also performs some CPU init and also ICH.

The address FFFFFFF0H is beyond the 1-MByte addressable range of the processor while in real-address mode. The processor is initialized to this starting address as follows. The CS register has two parts: the visible segment selector part and the hidden base address part. In real-address mode, the base address is normally formed by shifting the 16-bit segment selector value 4 bits to the left to produce a 20-bit base address. However, during a hardware reset, the segment selector in the CS register is loaded with F000H and the base address is loaded with FFFF0000H. The starting address is thus formed by adding the base address to the value in the EIP register (that is, FFFF0000 + FFF0H = FFFFFFF0H).