Write Fast Apps Using Async Python 3.6 and Redis

Write Fast Apps Using Async Python 3.6 and Redis(eng.paxos.com)

337 points by midas 9 years ago | 128 comments

zzzeek 9 years ago |

> we make heavy use of asyncio because it’s more performant

more performant than....what exactly? If I need to load 1000 rows from a database and splash them on a webpage, will my response time go from the 300ms it takes without asyncio to something "more performant", like 50ms? Answer: no. async only gives you throughput, it has nothing to do with "faster" as far as the Python interpreter / GIL / anything like that. If you aren't actually spanning among dozens/hundreds/thousands of network connections, non-blocking IO isn't buying you much at all over using blocking IO with threads, and of course async / greenlets / threads are not a prerequisite for non-blocking IO in any case (only select() is).

it's nice that uvloop seems to be working on removing the terrible performance latency that out-of-the-box asyncio adds, so that's a reason that asyncio can really be viable as a means of gaining throughput without adding lots of latency you wouldn't get with gevent. But I can do without the enforced async boilerplate. Thanks javascript!

bysin 9 years ago | |

I'm glad you said this. There's an async cargo cult going on, where every service must be written in "performant" async code, without knowing the actual resource and load requirements of an application.

From the last benchmark I ran [1] async IO was insignificantly faster than thread-per-connection blocking IO in terms of latency, and marginally faster only after we hit a large number of clients.

Async IO doesn't necessarily make your code faster, it just makes it difficult to read.

[1] http://byteworm.com/evidence-based-research/2017/03/04/compa...

RedCrowbar 9 years ago | | |

A ~20% improvement in throughput and latency while using 50% less memory (which could allow more workers per-box) is not a "marginal" improvement in my book.

tuxracer 9 years ago | | |

const users = await getUsers();

const tweets = await getTweets(users);

console.log(tweets);

Is async code really harder to read?

tyingq 9 years ago | |

Heh. Somebody will assemble a few of these pieces, add a package manager for async oriented libs, call it node.py, and then market it a bit.

Then you'll really be irritated.

101km 9 years ago | | |

That's... actually not a bad idea. ᕕ( ᐛ )ᕗ

zzzeek 9 years ago | | |

I know it will have an ORM called NodeAlchemy

/calls lawyers

merb 9 years ago | |

well it can also make things faster. well in your example it won't. but consider you need to load 4 requests and do operations on each of them. if you schedule them in an async fashion you can begin operating on the first one that's ready and not the first one you defined. and this is also often the case. a website does not just do one request to the database. mostly it runs multiple ones and often they don't interfere. like getting 20 rows and the count as a whole, there is just no need to start the first and wait till you have 20 rows and then start the second. you should always start both and wait till you have both.

yes it does not magically make your fetching 100 rows faster or your pbkdf2()/bcrypt() function. you still need to wait for those.

zzzeek 9 years ago | | |

> if you schedule them in an async fashion you can begin operating on the first one that's ready and not the first one you defined.

This type of operation is a given in any production quality webserver, whether it runs with multiple threads and blocking IO or using a non-blocking approach with greenlets. For a web application, this is an implementation detail that should not be explicit within the request handling code (a request handled in the context of a web container after all is a package of data in, a package of data out. no network reading/writing is usually exposed to the web application unless it's trying to expose IO handles to the app, which is unusual). Easy enough with something like Gunicorn.

ma2rten 9 years ago | |

Will my response time go from the 300ms it takes without asyncio to something "more performant", like 50ms

If you have to do 1000 queries it could, since could async will make it feasible to do them parallel. If it's a single query, maybe async would make it feasible to shard the database.

manquer 9 years ago | | |

you usually see this pattern in ORMs with n+1 querys . If a single request requires 1000 db queries it is better to be optimising the query

matthewaveryusa 9 years ago | |

It buys you the stack size of each thread which only matters if you have a stupid amount of connections. In this article[1] the author makes a comparison between the 2 models and 7000 concurrent users will chew up 450MB of stack space. Of course this is adjustable.

[1] http://byteworm.com/evidence-based-research/2017/03/04/compa...

hawski 9 years ago | | |

On most Linux systems stack is allocated with mmap with overcommiting. Until first write all those pages will share same zeroed page AFAIK. Then only overwritten pages will be allocated.

Am I wrong?

davesque 9 years ago | | |

How do you save on stack space with asyncio? Don't you have to keep the coroutine object in memory somewhere?

flukus 9 years ago | |

> more performant than....what exactly? If I need to load 1000 rows from a database and splash them on a webpage, will my response time go from the 300ms it takes without asyncio to something "more performant", like 50ms?

Potentially, it depends on if you can do other tasks for the same request that don't depend on the data. You might be able to render most of the page for instance. It's not purely about throughput.

Please tell me that 300ms was made up too and that it's not really taking that long.

RubyPinch 9 years ago | |

https://magic.io/blog/uvloop-blazing-fast-python-networking/... from the makers of uvloop (for a toy example)

it seems the main bottleneck when using aiohttp is aiohttp itself, which practically makes the use of uvloop irrelevant

anentropic 9 years ago | |

If you have to make several requests to db backend to fulfil one response then potentially asyncio allows you to make them in parallel rather than in series. Reducing latency of your response.

est 9 years ago | |

> If I need to load 1000 rows from a database and splash them on a webpage, will my response time go from the 300ms it takes without asyncio to something "more performant", like 50ms? Answer: no

Well, actually, yes. Without async rendering, your webpage is not ready until your 1000 rows of list is placed in Python memory then rendered to HTML as a whole then returned to your browser after like 300ms of server cost.

With async rendering, your webpage's headers and such can be returned immediately, thus your first-byte-to-response time can be done under 50ms, and your page loads by enumerating the rest of 1000 rows and renders the page incrementally.

RubyPinch 9 years ago | | |

Well you can do all of that sync, can't you?

    def on_connection:
        send(headers)
        send(start of page)
        for row in db:
            send(row)
        send(footer)

will have the exact same effect as what you said (not like that applies regardless, I don't think jinja outputs partial renders, since its made for flask)

The performance comparison is between python managed green threads, and OS managed actual threads. You don't get any new features

zzzeek 9 years ago | | |

That's a client streaming optimization, not related to the subject at hand which is non-blocking network IO. Assume the service returns a JSON structure. It won't get to the end any faster.

Dowwie 9 years ago | |

You are the hero we need, Mike

erikcw 9 years ago |

We've just recently started using Sanic[0] paired with Redis to great effect for a very high throughput web service. It also uses Python 3 asyncio/uvloop at its core. So far very happy with it.

[0] https://github.com/channelcat/sanic

mixmastamyk 9 years ago |

Can anyone recommend a good book to get started on concurrency, with discussions of models, and a few implementations such as golang and python 3.5+?

While I can write this kind of code, I don't feel like I completely understand some of the concepts.

mrks_ 9 years ago | |

I'm not far, but Seven Concurrency Models in Seven Weeks is pretty good and might fit what you're looking for.

https://pragprog.com/book/pb7con/seven-concurrency-models-in...

Matthias247 9 years ago | | |

I recommend the book very much. However it doesn't have a chapter the single-threaded concurrency handling (with eventloops, and futures/promises and sometimes even plain callbacks), which is currently en-vogue in lots of languages (JS, python asyncio, boost asio, etc). So this is something one should look up elsewhere.

dom0 9 years ago | |

Concurrency and parallelism is such a huge landscape of difficult problems and complexity that I doubt any such introduction exists. I never found one, anyway.

michaelmcmillan 9 years ago |

Ouch: https://github.com/paxos-bankchain/pastey/blob/master/app.py...

brut 9 years ago | |

https://github.com/paxos-bankchain/pastey/blob/master/app.py...

OhSoHumble 9 years ago | | |

Oh, that's hilarious.

jeromeparadis 9 years ago | |

To their defense, it seems to be an app that demonstrates usage of the library. Also seems to used for benchmarking. That would explain why the Redis database can be easily flushed through a simple URL.

secstate 9 years ago | |

That's actually a call to a rickroll.

cdelsolar 9 years ago | | |

It still flushes the db first?

michaelmcmillan 9 years ago | | |

Hence the ouch.

cies 9 years ago |

> Write Fast Apps Using Async Python

When working with Python and Ruby I find 80ms responses acceptable. In very optimized situations (no framework) this can do down to 20ms.

Now I've used some Haskell, OCaml and Go and I have learned that they can typically respond in <5ms. And that having a framework in place barely increases the response times.

In both cases this includes querying the db several times (db queries usually take less then a millisecond, Redis shall be quite similar to the extend that it does not change outcome).

<5ms makes it possible to not worry about caching (and thus cache invalidation) for a much longer time.

I've come to the conclusion that --considering other languages-- speed is not to be found in Python and Ruby.

Apart from the speed story there's also resource consumption, and in that game it is only compiled languages that truly compete.

Last point: give the point I make above and that nowadays "the web is the UI", I believe that languages for hi-perf application development should: compile to native and compile to JS. Candidates: OCaml/Reason (BuckleScript), Haskell (GHCJS), PureScript (ps-native), [please add if I forgot any]

dom0 9 years ago | |

You can get 2-3 ms response time (sans network) with any of Django, Flask and Pyramid. Database queries tend to eat a lot, esp. if the queries are bad (long wait in the DBMS or post-filtering in Python/whichever); sometimes ORMs can eat a fair bit as well. But it's fairly rare to get that low, most pages for me (that I cared about) will take 10-30 ms. Using the correct tools and the right approach is fruitful as always.

cies 9 years ago | | |

> You can get 2-3 ms response time (sans network) with any of Django, Flask and Pyramid.

Wow, never managed to do that. Maybe I have to try it again (last time checked on Django was some years ago).

jitl 9 years ago |

> Paxos.com

I'm confused by the relationship between Paxos, the company, and Paxos, the algorithm. Do the authors of Paxos work for Paxos?

Edit:

https://en.m.wikipedia.org/wiki/Paxos_(computer_science)

Ah; both are named for a fictional financial systen

lou1306 9 years ago | |

By the way, the author of the original Paxos paper is Leslie Lamport, who currently works at Microsoft Research.

ipsum2 9 years ago |

The title is misleading. The blog post doesn't cover how fast using async python is, it's a tutorial on how to use their ORM redis library.

mixmastamyk 9 years ago | |

There's a link on the page which digs in a bit more:

https://magic.io/blog/uvloop-blazing-fast-python-networking/

jackbravo 9 years ago | | |

And that topic was discussed in HN previously with 130 comments:

https://news.ycombinator.com/item?id=11625585

StreamBright 9 years ago |

>>> The performance of uvloop-based asyncio is close to that of Go programs.

I would prefer standard benchmarks for this. I hope they submit their framework to TechEnpower benchmarks.

https://www.techempower.com/benchmarks/

pekk 9 years ago | |

Those benchmarks aren't any more standard than anything else.

StreamBright 9 years ago | | |

Yes but you can see the most number of frameworks there running on the same hardware and same settings doing the same job. Also you can see the configuration how to achieve that.

njharman 9 years ago |

> You get the benefits of a database, with the performance of RAM!

One of the benefits of modern RDBMS is that they make extremely sophisticated use of RAM, and all levels of fast to slow storage below that SSD / RAIDs / slow single spindle.

siscia 9 years ago |

Quite related, but if you want to use Redis as a SQL database I wrote an extension to do just that: https://github.com/RedBeardLab/rediSQL

It is a relative thin layer of rust code between the Redis module interface and SQLite.

At the moment you can simply execute statements but any suggestion and feature request is very welcome.

Yes, it is possible to do join, to use the LIKE operator and pretty much everything that SQLite gives you.

It is a multi-thread module, which means that it does NOT block the main redis thread and perform quite well. On my machine I achieved 50.000 inserts per seconds for the in memory database.

If you have any question feel free to ask here or to open issues and pull request in the main repo.

rcarmo 9 years ago |

This is pretty neat. I've been using a plain Redis wrapper (aioredis) with uvloop and Sanic (https://github.com/rcarmo/newsfeed-corpus), but I'm going to have a peek at subconscious.

VT_Drew 9 years ago |

>One of the common complaints people have about python and other popular interpreted languages (Ruby, JavaScript, PHP, Perl, etc) is that they’re slow.

Proceeds to show an animation of posting a blog post that performs no faster than if it was built using Django.

NightlyDev 9 years ago |

> 10k pageviews took ~41s

Might be that the server is insanely slow, but I would have no problems reaching 10k page views per second with some basic PHP and even MariaDB on a low end E3-1230 server. Pretty sure more would be quite easy to...

fritzy 9 years ago |

It seems strange that they would claim that Python's libuv based event loop is twice as fast as Node.js's libuv based event loop. There's some context missing to that statement or it's flat out false.

GroSacASacs 9 years ago | |

What does it even mean. The event loop is only used when there is nothing going on. Is it faster at doing nothing ?

1st1 9 years ago | | |

> The event loop is only used when there is nothing going on.

In async applications event loop is what actually executes your code and performs IO. In essence, event loops are under load all the time.

hasenj 9 years ago |

If you want performance don't use Python.

hasenj 9 years ago | |

I hope the downvotes are not due to people thinking you can actually write high performance applications in Python.

twistedpair 9 years ago | |

Sadly true. Python is great for scripting, but "high performance Python" is frequently a challenge better suited to other tools.

floatboth 9 years ago | | |

"High performance Python" is usually done by "offloading literally everything to native extensions" :D

davesque 9 years ago | |

Until JIT authors start taking advantage of the frame execution API that was added in 3.6.

theprop 9 years ago |

This is to get a high performance ready app out. You could probably get an app out faster in PHP or Meteor or other prototyping framework.