Python 2 vs. Python 3: A retrospective

Python 2 vs. Python 3: A retrospective(dropbox.com)

114 points by mitchelllc 12 years ago | 109 comments

justinmk 12 years ago |

I'd really like to see the video for these slides. But here's what caught my interest:

    Set and dict comprehensions
    {x**2 for x in range(10)}
    {x: x**2 for x in range(10)}

    Why reduce() must die:
    ... the applicability of reduce() is pretty much limited
    to associative operators, and in all other cases it's 
    better to write out the accumulation loop explicitly.

    int [divided by] int should return float

    nonlocal

Explicit nonlocal variable modifier which (I guess) "promotes" the variable outside its local scope. Kind of the inverse of Java requiring 'final' to bind a variable to a closure.

jzwinck 12 years ago | |

You know what's funny? All of those things could have been done in Python 2.8, apart from the int division change. And the division change does as much harm as good, because lots of people use Python and also use another language where int division works the "old fashioned way"; for them (me) this change is counter-productive because it adds a pointless distinction. It is a great change for programming novices, for sure, but that's only part of Python's audience, and probably won't be the longest-lived part.

justinmk 12 years ago | | |

> another language where int division works the "old fashioned way"; for them (me) this change is counter-productive ... It is a great change for programming novices

Douglas Crockford made a point in a recent interview[1] (not the first time, I'm sure) that this is exactly the wrong reason to keep doing things "the way it's always been". Other examples he mentions: line endings (CR/LF), integer overflow, short vs long. Big vs little endian would be another obvious example.

Fred Brooks (Mythical Man Month) calls this "accidental complexity".

> [novices are] only part of Python's audience, and probably won't be the longest-lived part

By definition. But, it's not really a good use of anyone's time to be dealing with truncation in a time when it no longer has any reason to be the default except historical accident.

[1] http://hanselminutes.com/396/bugs-considered-harmful-with-do...

unfamiliar 12 years ago | | |

I have tracked down many confounding bugs caused by people accidentally using integer division. This change is one of the main reasons I use python 3; it gives me one less thing to worry about, because integer division (which is only correct occasionally) stands out clearly with //.

masklinn 12 years ago | | |

> All of those things could have been done in Python 2.8

The first one is in 2.7 (it's completely backwards compatible). The division is in 2.5 or 2.6 imported from __future__. #4 skirts the line, new keywords have been added in point release in the past. #2 wouldn't fly.

But here's the thing, P3 was not about these (or not only about them), they got bundled in because P3 was allowed to be significantly backwards-incompatible and thus a lot of changes became acceptable which were not justifiable or much harder to justify in a point release. The primary breakage point of P3 is not any of these, it's the string changes.

anonymous 12 years ago | | |

All of these things, including the int division "thing" are present in python 2.7 already. For division, you need to import it:

    from __future__ import division

And then dividing numbers works a bit more logical (well I think it's more logical). You can do division with rounding with the // operator: a // b, and it even works with floats: (3.9 // 1.2) == 3

yen223 12 years ago | | |

The division change is good. I have a hard time understanding why you'd want

    3/2 = 1

as the default behaviour.

yeukhon 12 years ago |

The one thing I wish they could change in the future is forcing list and dictionary iterable same syntax:

instead of writing

for index, element in enumerate(some_list)

for key, value in my_dict.items()

they should unify and make items and enumerate default behavior. i.e.

for index, element in my_list:

for key, value in my_dict:

I really don't see the benefit of not doing this as default behavior. I always find if I need to loop a list there is a good chance the index can help, and even if I don't need it it doesn't hurt to have one either. Simple is better. And the whole looping dict and get back the key only sucks too because you often need the value as well so you essentially do dict["key"] but why not just default return both key and value?

MBlume 12 years ago |

Can anyone else not read the last lines of some of the slides?

goronbjorn 12 years ago | |

Here is a version with no problems (I used the Box View API): https://view-api.box.com/view/VoRxuIIQel26CLNAgt8KskrQxgUpwD...

ChronosKey 12 years ago | |

Dropbox's powerpoint viewer isn't perfect. I had to download the pptx. Works fine in Keynote.

jzwinck 12 years ago | | |

Thanks for pointing out it's a Dropbox thing--at first I thought it might be a Chrome problem. Specifically the viewer seems not to use the proper font for this presentation. Preview on Mac OS works fine.

gsnedders 12 years ago | | |

It's some PPTX -> PDF converter and then it's just pdf.js.

blaze33 12 years ago | |

Ubuntu here, tried with LibreOffice, same issue. Had to reupload it to google drive, problem solved: https://drive.google.com/file/d/0B8MSXu_W6_e4ZE02ZUQ5dU03cXM...

shocks 12 years ago | |

I am also having this problem.

_random_ 12 years ago |

"People positively hate incompatible changes – especially bad for dynamic languages", "Never again this way – the future is static analysis and annotations".

Wouldn't it be better to pick a better-suited language then?

John Carmack put it nice way:

"One of the lessons that we took away from Doom 3 was that script interpreters are bad, from a performance, debugging, development standpoint. It’s kind of that argument “oh but you want a free-form dynamically typed language here so you can do all of your quick, flexible stuff, and people that aren’t really programmers can do this stuff”, but you know one of the big lessons of a big project is you don’t want people that aren’t really programmers programming, you’ll suffer for it!"

LBarret 12 years ago | |

Epic might disagree, the UnrealEngine is heavily scriptable. This was one of its major selling points in the last generation.

I have a huge respect for Carmack but some other people prooved him wrong in the past. His opinions are often taken as gospel but more discreet people (like Sweeney) may have different and a s worthy points of view.

tomp 12 years ago |

Why does Guido think that slices syntax is screwed up? I mean, it's not exactly natural, but at least it's consistent (first bound is included, second is excluded):

  a = '12345'
  a[0:-1] == '1234'
  a[-1:0:-1] == '5432'

Personally, I think that "downcounting" slices are rarely used. For code clarity, I prefer reversing the string/list first.

JeffJenkins 12 years ago | |

This came up on python-ideas recently, there was a long thread: https://mail.python.org/pipermail/python-ideas/2013-October/...

midgetjones 12 years ago |

I would have been more interested in learning Python if there wasn't such a great divide. I read the first chapter of several books that said "Python 3 is out, but we're going to stick with 2.7 because too much shit is broken".

pdonis 12 years ago | |

That was true early on in Python 3, but it's not true now.

jzwinck 12 years ago | | |

People have been saying this for three years, but it's not true now.

jzwinck 12 years ago | |

As someone who learned Python when 3.2 came out, I completely agree with you. I have only really used Python 2.7!

Because too much shit is broken (NumPy, hello). Because Python 3 has been the default on basically no system ever (OK, maybe this is changing right now, slowly).

As Guido says, it's been five years and it will take another five. This whole experiment has been a huge misstep for Python, an absolutely massive gaffe. Some of Python's peers did it too, roughly around the same time (Perl, and to a lesser extent Ruby).

Python (Guido?) noticed its own maturity a bit too late. The damage is incredible; along with the performance stuff (which is in a way easier to overcome) this may be a key factor leading to the fall of a great language.

nron 12 years ago | | |

On the other hand, my experience has been very different: I learned Python when 3.2 was current as well, using Lutz' "Learning Python", which takes the approach of "teach Python 3, and explain how 2 is different whenever necessary". I've followed suit and taken the approach of writing Python 3 code first, and to make it work on 2.7 only when I need to, which I found fairly easy to do, though it can make the code a bit uglier sadly (writing cross-version-compatible metaclass code is the one that annoys me, since it adds some verbosity).

I'm looking forward to 2.x dying out to eliminate that retrofitting step (and it's happening: the improving dependency landscape means I find I have to do it less and less often), but I've not experienced any major pain overall. From where I'm sitting, Python 3 is a better, cleaner language, and as someone new to Python, I'm happier for it.

ac29 12 years ago | | |

NumPy works fine on python 3.

caligo 12 years ago | |

Are those books, books that came out this year?

michielvoo 12 years ago |

Question for the professional Python developers: do you (on a daily/weekly/monthly) basis switch between projects in Python 2 and Python 3? Is that hard to do (e.g. do you have to constantly and consciously remind yourself of syntax/semantic differences), or does your mind sort of automatically adjust to the new/old patterns?

pmelendez 12 years ago |

This is my problem with Python: "Rename func_name —> __name__, etc Rename .next() —> .__next__()"

Too many ugly renames, too few alternatives of doing things. To be honest the only attractive thing to me is all the libraries that they support but I don't find the language itself interesting.

rdtsc 12 years ago | |

What do you mean by "ugly renames?" How many times in using Python did you have to look at .__next__() or .func_name ?

I have been using Python full time for the last 7 years and I very rarely have to call either of those function.

> Too few alternatives of doing things.

Can you explain that as well? What do you mean by alternatives of doing things? Like say you want to read a file and you might want to use a wider variety of options when opening the file handle or say you want to parse JSON and you'd like standard library to have more parsers available?

lmm 12 years ago | |

Not having alternatives is python's greatest strength. The language is easy to read because as much as possible there is only one way to write a given concept.

mercurial 12 years ago | |

> too few alternatives of doing things

That's what you want for maintainability. I'm not interested in maintaining a codebase where every programmer have their own idea of how something should be done. Of course, you have code reviews for this kind of thing. Except when the code is already written. And when it is not, arguing over minor points is an unnecessary timesink.

sillysaurus2 12 years ago | |

__ is basically a namespace for official language extensions. How would you suggest they do it? Prevent "next()" from being a valid method name?

pmelendez 12 years ago | | |

That has been addressed in some many other ways by several languages that goes from the C++ way where you actually have namespaces to the C way where you don't worry about it and pick another name. From all of them I find this the most odd way to address it, specially when python was supposed to improve legibility by design (at least for me those underscores are very distracting)

mhenr18 12 years ago | | |

There's no need to make it not be valid. C++ uses begin() and end() for obtaining iterators to containers, but nothing's stopping you from using those method names for your own purposes.

It's just that if you want to use a few new language niceties like range-based for loops then you'll need to conform to that convention.

izzle9 12 years ago |

whoa did he just say static analysis is the future?

andreisoare 12 years ago | |

yeah, I'd love to understand the reason behind that statement.

nly 12 years ago | | |

PyPy?

bsaul 12 years ago | |

That line made me feel warm and fuzzy inside. He also mentionned mypy which i thought was a one man lonesome soon to be abandonned ( but absolutely fantastic) project.

fidz 12 years ago |

Could someone create/convert the PDF version? I don't think my computer good enough to open PPT* format

Edit: not really needed, just loaded the dropbox preview, it is still readable

mortenlarsen 12 years ago |

Angry noscript user here. Visit URL... almost blank page... with non-working download button. Enable Javascript... Get .pdf named .pptx.

justinmk 12 years ago | |

> Angry noscript user

Redundant, I think.

zwegner 12 years ago | | |

As a noscript user, I think you're probably right.

rpedela 12 years ago |

Does Python 3 fix the import system? Can I import a file from any location in the file system?

baq 12 years ago | |

you always could, see __import__ and imp module.

pstuart 12 years ago |

Now that I get to use Go, I have no desire to go back to python. I think I'm not alone, and that python will soon enough become the new perl.

U2EF1 12 years ago |

empty dict: {:} empty set: {}

Ah, the road not taken.

andyl 12 years ago |

It seems like I've been reading about the difficulties of Python V2 -> V3 for awhile. Why is that? Is this Python upgrade unusually difficult/ambitious? Or is the Python community just very reluctant to jump on new things?

mixmastamyk 12 years ago |

As a dev that hasn't been able to move to Py3 yet (but will soon), I'm wishing they'd fix the rest of the issues, bundle PyPy, and ship Python 4 instead! Make it a compelling upgrade.

Then Py3 could be nicknamed "Vista," I suppose.