Optimizing Python – A Case Study

vosper 11 years ago |

He's missing Cython, which is another good option when you're looking for speed.

My personal favourite optimisation, from needing to shave a few milliseconds off our API response times, was discovering that it's measurably slower to use * args and * *kwargs, and switching to explicitly declaring and passing arguments in the relevant parts of the code.

We also did a few other neat things:

- Rolled our own UUID-like generator in pure Python (I was surprised this helped, but the profiler doesn't lie)

- Switched to working directly with WebOb Request and Response objects rather than using a framework

- Used a background thread with a single slot queue to make sure our response was returned to the user before we emitted the event log message, but always emit the message before moving to the next request

- Heavy optimisation of memcache / redis reads and writes

Edit: Fixed formatting

jMyles 11 years ago | |

- Used a background thread with a single slot queue to make sure our response was returned to the user before we emitted the event log message, but always emit the message before moving to the next request

The crosstown_traffic API in hendrix does exactly this.

https://github.com/hangarunderground/hendrix

vosper 11 years ago | | |

Hey, that's kinda cool - thanks for the link!

sitkack 11 years ago |

The order of tactics to take is wrong. In terms of energy expended, one should use PyPy first! It is amazingly compatible with CPython and can now be embedded directly in CPython programs, https://github.com/fijal/jitpy (supports numpy arrays)

Dump your virtualenv, create a new one with pypy, reinstall libraries and test your app. Takes less than 20 minutes, even for complex applications.

lqdc13 11 years ago | |

First thing you should do is optimize data structures IMO.

This is the advantage Python has over lower level languages - easy way to implement complicated things.

Kind of like Linus's quote: "Bad programmers worry about the code. Good programmers worry about data structures and their relationships."

sitkack 11 years ago | | |

I would say it is a close second thing. If I have a slow system, I will move to faster runtime before modifying any code. Going from CPython to PyPy, if possible will almost always gain you enough perf increase while you refactor the slow parts.

ma2rten 11 years ago | |

Except if you are using python 3 or numpy or any other library written in C.

Veratyr 11 years ago | | |

A lot (but definitely not all) of Numpy is actually Pypy compatible: http://buildbot.pypy.org/numpy-status/latest.html

riquito 11 years ago | | |

If you know that you can't us PyPy you can remove it from the list, otherwise sitkack has a point.

sitkack 11 years ago | | |

I have never used it, but PyPy3 is CPython3.2.5 compatible.

ryan_sb 11 years ago | |

Tbh, the order in that list wasn't meant to be sorted by difficulty, but I can go back and swap that around.

clickok 11 years ago |

Serious question: if you have some code that really has to be fast, is it viable to keep it in Python, or should you ultimately end up rewriting it in a compiled language?

For example, I am writing code that implements networks that evolve over time for AI research. Prototyping it in Python makes it easy to test things out, but I expect that I will have to rewrite it in C++ or maybe something more fun, like Haskell[1].

1. Mostly for the sheer joy of trolling my colleagues with a learning agent monad.

thezilch 11 years ago | |

You might "just" migrate some slower parts to numpy or write a C target and interface with it over Python's CFFI.

eikenberry 11 years ago | | |

+1 numpy could probably handle this

pepijndevos 11 years ago |

> Think for a second: time is only ever going to increase

Well, most of the time at least. Think about DST and leap seconds.

ryan_sb 11 years ago | |

True, but in this particular case, the cost to time going (temporarily) backwards would only be connecting to a potentially-suboptimal disque node, and that would be remedied after the subjective-machine-time caught up with the previous subjective time.

mangeletti 11 years ago |

Isn't Jython not a JIT compiler, and isn't Jython much slower than cPython?

coldtea 11 years ago | |

Nope and Nope. Jython is made to run on the HotSpot which is a JIT compiler, and Jython should be comparable to speed to cPython and faster in some cases (used to be slower, but that was 3-4 years ago, the optimized it a lot, and added stuff in Java 7/8 helped too).

JRuby is faster than standard C Ruby too.

DasIch 11 years ago | | |

There is a huge difference between an interpreter with a JIT compiler and an interpreter running on another interpreter that has one. These are not equivalent at all.

fnord123 11 years ago | | |

Do you have a source on the benchmark results suggesting that Jython is comparable to CPython speeds?

makmanalp 11 years ago | |

If you run Jython on the hotspot JVM, that'd count as JIT. As for the speed, I'm not sure.

wtetzner 11 years ago | |

Well, it runs on the JVM, so depending on which JVM you use, it might use a JIT compiler.

aburan28 11 years ago |

Pythran is also missing

accounthere 11 years ago | |

As is Nuitka, but who keeps track of these things?