Google Books Ngram of George Carlin's Seven Dirty Words(ngrams.googlelabs.com) |
Google Books Ngram of George Carlin's Seven Dirty Words(ngrams.googlelabs.com) |
In handwriting or italic, the tall 's' was rather like the integral symbol, but when setting serifed font it looks pretty much like an 'f', but missing half or all the crossbar.
Even with relatively unique names, it can be tricky. The case of completely or almost completely unique last names (like "Nietzsche") is easy, but with the available interface to the data, it's difficult to handle cases where First+Last is unique, but last alone isn't. You need to count things like "First Last" and "Last, First", plus variants like "First M. Last", without double-counting.
(And most of it does in fact seem to be for wrongly-OCRed "suck", unsurprisingly.)
I don't think I'm suddenly going colour-blind but it strikes me as odd that google would pick two colours that are so close to each other for a graph that needs so few separate colours...
"first" looks like "firft", but "This" and "that" look pretty standard.
(http://ngrams.googlelabs.com/graph?content=shit,piss&yea...)
Maybe life just really fucked back then.