Zelda Screen Transitions Are Undefined Behaviour

Zelda Screen Transitions Are Undefined Behaviour(gridbugs.org)

366 points by Kaali 6 years ago | 100 comments

yc-kraln 6 years ago |

I don't think I would call this undefined behavior... the behavior is defined by what happens on the hardware when it's executed!

There are many, many classical effects on raster hardware which are accomplished by changing registers within the horizontal blanking period... copper bars, mode 7, certain paralax scrolling. When you're on a resource limited system it becomes an art to get the most out of the platform. Look at the difference between Mario 64 and Conker's Bad Fur Day... or Genji: Days of the Blade (PS3) vs Persona 5 (PS3) Even with modern consoles, there is a marked improvement in the apparent visual quality over the lifetime of the device, as developers learn how to squeeze more and more out of the platform.

papln 6 years ago | |

"Unefined" refers to the spec, not the hardware.

https://en.wikipedia.org/wiki/Undefined_behavior

This case appears to be "undocumented scenario" or "unsupported use-case", though.

klodolph 6 years ago | | |

This definitely does not appear to be an “unsupported use-case”. This technique is used in many first-party launch titles. As far as I can tell, this is the reason that sprite zero tests exist in the first place.

If you want an example of something that’s really an unsupported use case, consider mid-frame palette changes—in order to do this, you actually have to disable and re-enable rasterization during the horizontal blanking interval. It works, but it is difficult to get right and the PPU is clearly not designed with this in mind.

Short of having an actual spec in hand, it would appear that mid-frame scroll register updates were entirely normal and an intended way to use the device, based on the evidence (no other reason for sprite zero check, lots of first-party titles using this feature, other mid-frame updates are much more difficult).

kazinator 6 years ago | | |

Do we know that the developers didn't confirm this hack with the hardware people?

It could be "defined by e-mail".

dllthomas 6 years ago | |

> developers learn how to squeeze more and more out of the platform

That's a part of it, to be sure, but another dynamic is that developers have to squeeze more out of the system. Dropping old techniques on new hardware will probably give you a game that looks "better" than what's already out. Once everyone has done that, to look "better" you need something new other than hardware.

furyofantares 6 years ago | | |

That’s another part of it, to be sure, but yet another dynamic is that it being a mostly fixed system, rather than a wide array of target hardware, is what makes it feasible to implement these tricks and accumulate knowledge in the first place. It all kind of goes hand-in-hand — the fixed nature of the hardware makes it so you have to find these tricks in order to compete with games that play on newer hardware, and also makes it so that it’s possible to do so. I find it fascinating. But I suppose the market would have just found different stabilizing points were this not the case.

0xcde4c3db 6 years ago | |

Also, the specific phrase "undefined behavior" is typically invoked in a context related to the C programming language. In the C standard and some standards related to it, undefined behavior is just one point on a spectrum of (un)definedness:

- implementation-defined: Not defined by the standard, but implementations must choose a consistent behavior and document it (e.g. what's shifted in when right-shifting signed integer types)

- unspecified: Not defined by the standard; implementations must choose some way of addressing the situation, but need not document it (e.g. the order in which function arguments are evaluated)

- undefined: Entirely outside the scope of the standard; implementations may assume that such situations never occur, and need not have any sensible or consistent behavior (e.g. dereferencing NULL)

In that taxonomy, I think this is much closer to "unspecified" than "undefined". The latter is usually used in scarier contexts like random memory corruption or crashes, not consistent behaviors that rely on deliberate implementation choices as we see here.

bluedino 6 years ago | |

Super Mario Bros, vs Super Mario 3 (NES) Combat vs Keystone Kapers (Atari 2600)

reificator 6 years ago | |

I'd love to compare but I don't know what Persona's giant enemy crabs look like.

daniel5151 6 years ago |

I actually had to wrestle with this exact effect while working on wideNES [1]. By saving a screenshot of the screen at each frame alongside with it's PPUSCROLL value, it's possible to gradually build-up a map of the level as it's explored. Moreover, on subsequent playthroughs of the same level, it's possible to sync the map with the on-screen action, effectively enabling a "widescreen" mode for old NES games (with certain limitations).

Lots of games used funky scrolling mechanics, typically to create status bars, but of all the different games I tested with, TLOZ was by-far the weirdest, requiring an entire special case to get working!

I don't have any screenshots of my own, but some japanese website recently covered wideNES, posting screenshots of it working with the original Legent of Zelda.[2]

[1] http://prilik.com/blog/2018/08/24/wideNES.html

[2] https://emulog.net/fc-nes-emulator-anese-how-to-use-widenes/

anon_cow1111 6 years ago | |

*Note to mobile/metered internet users: first link contains 30MB+ of gif images, click at own risk.

Avamander 6 years ago | | |

I got flashbacks of 2000s with this comment

OJFord 6 years ago | | |

Why isn't there an HTTP request header like 'Accept-Content-Length' to limit maximum response size?

soulofmischief 6 years ago | |

That was a great write-up. Thanks for sharing.

You could probably run an async loop which slices up painted frames and compares hashes of the slices to find identical slices to anchor and stitch similar frames together, still maintaining separate layers in case a better match is found later on. Something like that should solve for games like SMB.

jedberg 6 years ago |

It feels like programming used to be a much harder job in the past. You not only had to figure out the program logic, but you had to work within very tight hardware constraints.

Reading articles like this, or about the Atari and how the code would double as a sprite in pac-man, or how 3D was rendered in Wolfenstein, makes me think one had to be much more clever back then.

MobiusHorizons 6 years ago |

Undefined behavior in C or C++ is possible because the language specification is built to deal with different hardware architectures, so it's not possible to build portable code. In the case of something like NES game development, there is only one hardware target, so the actual observed hardware behavior can be relied upon when doing things that aren't explicitly documented. In this case it is possible to know before hand exactly what will happen because there is only one hardware target. Undefined behavior in a language like C is unknowable at compile time because its behavior has not been specified by the language, and it can't be specified by the hardware. Technically I guess you are correct that it is undefined behavior, but in practice it's pretty different IMO.

codebje 6 years ago | |

I'm not sure it's really true that the NES represents a single hardware target. There's two families of CPUs (2A03, 2A07) with different clock speeds depending on whether the device is NTSC or PAL, within each family there's a half dozen or more revisions, DRAM controller chips changed frequently causing many interesting variations in how and when the object attribute memory could be read (or not read, as the case may be).

And that's just the NES devices! The Famicoms were different again, there were at least two licensed clones, and dozens of unlicensed clones (if you, eg, wanted your game to sell in Russia, you'd care about being compatible with the Dendy as well as the genuine NES).

Bluecobra 6 years ago |

Thanks for posting this... as a non-programmer, I really enjoy reading how the games I grew up with worked. If you enjoyed reading this, there's a great Youtube channel called Retro Game Mechanics Explained:

https://www.youtube.com/channel/UCwRqWnW5ZkVaP_lZF7caZ-g/vid...

cableshaft 6 years ago | |

I didn't know this channel existed and it's very interesting. Thanks for sharing it.

klodolph 6 years ago |

There are a few games that do diagonal scrolling. In general it's very difficult to do well on the NES, and you will likely have to live with some amount of glitching--unless you have extra name table RAM on the cartridge, which is fairly rare.

See http://bootgod.dyndns.org:7777/ for a database of the hardware inside each cartridge.

einr 6 years ago | |

There are a few games that do diagonal scrolling.

Paperboy comes to mind. It's smooth and looks good without any noticable glitching, and without using custom chips, too. Not sure how they did it.

klodolph 6 years ago | | |

Paperboy draws a bunch of black sprites along the left side of the screen to cover up the glitches. Depending on the game this can be acceptable, but you only have a budget of 8 sprites per scanline and 64 total sprites, and this technique can eat up a lot of that budget.

Edit: as sibling comment noted, Paperboy does have custom logic on the cartridge, a 74HC161 4-bit counter. I think this is just used to switch between CHR banks.

eropple 6 years ago | | |

I read this and was shocked - surely this is incorrect!

Nope. Paperboy uses a CNROM, which is basically an MMC3. Maybe they're doing some clever edge-replacement stuff on the mirrored table, like SMB3 does with the same hardware? But you can see glitching when SMB3 does it.

(edit: my sibling post answers it. That's slick.)

airstrike 6 years ago | |

> See http://bootgod.dyndns.org:7777/ for a database of the hardware inside each cartridge.

Wow, this is chock-full of content. I wish there was one for the Sega Master System too

Off-topic, but thought I'd link to this very fun comparison of various games across the two platforms: https://huguesjohnson.com/features/nes-vs-sms/

coldpie 6 years ago |

This is a great description of a commonly-used technique for splitting the screen in NES games that scroll smoothly. It may or may not have been intended, but this is a common technique for games that have a "status bar". Super Mario Bros 3 is another obvious example, but even Super Mario Bros uses it long before then for its top status bar. I first read about it in the excellent "I Am Error" book by Nathan Altice, but googling around for "nes sprite zero split" turns up plenty of other articles, too.

einr 6 years ago | |

Nitpicking, but SMB3 uses the MMC3 chip which "adds an IRQ timer to allow split screen scrolling without the sacrifice of sprite 0" (Wikipedia) so it does not use this technique.

SMB1 actually also does not use the sprite zero split technique because it never scrolls vertically. Its status bar is just a bunch of fixed background tiles.

coldpie 6 years ago | | |

Ah, didn't know that about the MMC3!

Regarding SMB1, I'm quite sure it uses the sprite 0 thing to keep the status bar stationary while the level scrolls smoothly beneath it by setting the scroll register only after when the status bar is done drawing. See more thorough description here: https://retrocomputing.stackexchange.com/questions/1898/how-...

pubby 6 years ago |

The NES designers goofed and made the size of the view window (nametable) 240 pixels tall. This makes vertical scrolling awkward as it throws a non-power-of-two divisor into the math. The NES doesn't have a division instruction - only bit shifts, so having to divide by 240 is a real pain!

Also, Y-scrolling wasn't completely figured out until late in the NES's life. The register writes needed to do so are very strange, and Zelda certainly doesn't do it correctly!

simcop2387 6 years ago | |

I believe that's one of the reasons that games such as Super Mario Bros 3 used additional hardware in the cartridge to do the y scrolling. The memory mapper had special support for just y scrolling and scanline counting.

http://wiki.nesdev.com/w/index.php/MMC3

pubby 6 years ago | | |

Oh, you don't need special hardware to do y-scrolling correctly. It's just a strange set of writes: $2006, $2005, $2005, $2006. MMC3 is for the scanline counter, which allowed SMB3 to have the score bar on the bottom of the screen.

bonzini 6 years ago | |

You can probably do it without divisions if you use the name table creatively... You can place line 720 of the input at line 192 of the name table (720 modulo 256 is 192) as long as everything above and below it is displayed correctly.

and0 6 years ago |

Vertical scrolling, and emulating the weird side-effects of the registers being written to, was the hardest part of recreating the NES using 3D meshes. It took me a few weekends to get Zelda 2's intro working reliably. I wrote about it a bit myself (probably got a few details wrong or simplified them) here:

http://n3s.io/index.php?title=How_It_Works

maaaats 6 years ago | |

What a cool project!

tinus_hn 6 years ago |

Weird to have this limitation that you can’t vertically scroll mid-frame, when it turns out you can if you just circumvent the blockade.

baruchthescribe 6 years ago |

This reminds me a lot of Mode X which, although a funky 320x240 mode with square pixels built in to standard VGA, only became popular after Michael Abrash popularized it in Dr Dobbs. And then there was the utterly gorgeous mode Q - 256x256 with 256 colors. No muls or shifts - high byte is Y and low byte is X.

bloopernova 6 years ago | |

That "Graphics Programming Black Book" is available online here: http://www.drdobbs.com/parallel/graphics-programming-black-b...

Mode X or Q reminds me of the amazing Mode 7 SNES graphics used to great smooth effect in F-Zero and many other titles.

raverbashing 6 years ago | |

It's funny, you get Mode X by not picking a mode "the easy way" (like mode 0x13 for 320x200x256) but by setting the lower level registries in the VGA controller, it's an unofficial mode

duxup 6 years ago |

It's always interesting how the NES cartridges had their own hardware that could expand the system's capability. Allowing for simple cartridges for simple games and more expensive cartridges for more advanced games.

jordanmorgan10 6 years ago | |

The same idea extended to the SNES too if I recall, a quick dig up on Wikipedia:

"The system was designed to accommodate the ongoing development of a variety of enhancement chips integrated in game cartridges to be competitive into the next generation."

whermans 6 years ago | | |

Going as far as co-processors for 3D rendering[0] - image buying a game today that comes with its own GPU!

[0] https://en.wikipedia.org/wiki/Super_FX

duxup 6 years ago | | |

I wonder if we'll ever get back to that sort of thing... I guess with digital downloads not so much, but i really like the idea.

shultays 6 years ago |

It is not really undefined behavior when you have single hardware that will behave in a very well defined way. Old console games have all sort of hacks that allows them to do stuff that the system is not designed for. Having such a basic hardware with no security checks allows a lot of potential!

Also isn't vertical split quite common? I would assume this is something the hardware designera thought of, not a game company figuring it out. They even put stuff like sprite 0 hit bit for this kind of tricks

llao 6 years ago |

Warning, 33 megabytes of (great) GIFs.

chungy 6 years ago | |

It's probably about time that WebP should get promoted, especially instead of animated GIF. the libwebp library comes with a gif2webp program to make the conversion especially easy.

Just doing it now, converting all the animations to WebP makes it 1.6MB. and it works in all current browsers.

EGreg 6 years ago |

meta-irony: "in a manor likely that was unintended by its designers"

msla 6 years ago |

Is anyone else getting a blank white page?

penagwin 6 years ago |

Thanks for the great visuals!