Serving a high-performance blog solely from memory, using Rust

jmillikin 3 years ago |

There may be further opportunities for improvement.

Chrome and Curl both report it takes about 1100ms to load the linked page's HTML, split about 50/50 between establishing a connection and fetching content. I'm not sure how the implementation works internally but that seems like a long time for a site served from memory and aiming to be "high-performance". The images bring the total time up to around 5.7s.

As a point of comparison, my site (nginx serving static content, on the 0.25 CPU GCP instance) serves the index page in 250ms. Of that, ~140ms is connection setup (DNS, TCP, TLS). The whole page loads in < 1000ms.

https://i.imgur.com/X4LDbWj.png

https://i.imgur.com/Ccwzmgz.png

One thing to remember is that when a server like nginx serves static content, it's often serving it from the page cache (memory). The author of Varnish has written at some length about the benefits of using the OS page cache, for example <https://varnish-cache.org/docs/trunk/phk/notes.html>. Some of the same principles can be applied even for servers that render dynamically (by caching expensive fragments).

xena 3 years ago | |

Author here. I wrote that post before I axed the CDN for my blog site itself. It was true at the time of writing, but it is not true anymore because I need to redo the CDN for the blog itself. All the images are CDNed with XeDN though.

georgyo 3 years ago | | |

I'm trying to parse what you are saying here.

You removed the CDN and the site got slower?

How do you know your site was the one that was fast or just the CDN? IE, the CDN should have added a lot of extra hops and made things slower.

To me, this implies the rust code is very poor at opening and closing connections, so the CDNs keep alive is pasting over that issue.

xena 3 years ago | | |

The main thing the CDN provided was nodes on basically every continent that kept the site in cache. Without those servers on every continent keeping the site in cache, it takes longer to get to the netherlands to get the site loaded. The speed of light is only so fast.

hn92726819 3 years ago | | |

If they are keeping your article cached, what's the point of saying it's a high-performance blog? Saying it's slow because the CDN down means that it's just slow... You can have a 'high performance' blog run on a raspberry pi zero if it's globally cached by someone else, but then I wouldn't say that's high performance.

Cool article though. Agree on the ructe part, and I dislike how whitespace is handled. I wish Jade/Pug templates could be done in rust but will check out Maud.

xena 3 years ago | | |

The blog itself is fast. The internet is the slow part.

progrus 3 years ago | | |

And probably more importantly, routing packets through international traffic all the way to the Netherlands takes a while too.

MuffinFlavored 3 years ago | | |

> The speed of light is only so fast.

Is the Internet not connected internationally (US -> Europe for example) via cables underneath the ocean? Speed of light would be satellite, light? Not electric current?

Or is electricity flowing through a wire also "speed of light"?

tsimionescu 3 years ago | | |

First of all, the "speed of light" is usually referring to c, the maximum speed that matter or energy can move at.

Second of all, electrical signals in cables move at speeds slightly lower than c, but very close to it, so the speed of light is still a very good approximation of the possible upper bound.

Third of all, intercontinental cables are normally fiber optic, for several reasons. That is, they directly transmit light through the cable.

Fourth, it should be noted that electricity is actually the same thing as light, since photons are the carrier particles of the electric field (when two charged particles interact, they are actually exchanging a photon). It's of course not visible light, but satellite communication also uses radio waves normally, which are not visible light either.

Finally, either through cables or through satellite communication, the distance/c minimum theoretical one-way latency is usually a significant under-estimation of the actually possible minimal latency, since the straight-line distance is significantly shorter than the actual cable/satellite-and-back distance that the signals must travel - the difference in straight-line VS physical path distance is typically much larger than the difference between the theoretical speed of light and the actual speed of the electrical signal propagation.

potatochup 3 years ago | | |

Most (all?) Intercontinental cables will be fibre optic. So it's the speed of light (in glass, not a vacuum)

xena 3 years ago | | |

I'm not an expert in cross-continent interconnects, I have no idea what cables are being used there. I'd imagine that a lot of the backbone of the internet is fiber because that's what all the SRE memes say about wandering backhoes and sharks being the primary predator of fiber optic cables.

Matthias247 3 years ago | | |

Nothing is faster than speed of light. In fact signals transmitted via copper wires are traveling at 2/3 the speed of light. Don't know the details about fiber ocean links, but it certainly won't be faster than speed of light.

pkhuong 3 years ago | | |

These cables are fiber optics, but either way, the speed of light is still a bound.

Groxx 3 years ago | |

As a contrasting point: I'm consistently getting 150ms from their main domain, and 25-35ms from their cdn subdomain. I suspect most of your latency is from "the internet".

vinay_ys 3 years ago |

After going to the end of a long post, I'm disappointed to not find any latency or throughput efficiency metrics. Author seems to claim he has a very popular high-traffic blog and it is super fast, faster than all the popular web servers serving static pages. Where's the performance data to prove this?

edit: web.dev measure gave this blog post url a performance score of 30/100 which is quite poor.

xena 3 years ago | |

Ripping out cloudflare made the metrics slower. I wrote this post before I ripped out cloudflare and it was accurate at the time of writing. It will be better once I can re-engineer things to be anycasted.

kixiQu 3 years ago | |

Author isn't a man (https://github.com/Xe)

lionkor 3 years ago | | |

> Nephelemancer, Kastermakfa, Hacker, Ordained Minister [...] Please call me (order of preference): Xe/xer, They/them or She/her please.

deathanatos 3 years ago | |

web.dev seems to give it a poor score primarily because of the YouTube embed … so perhaps Google should heed its own advice?

Jabbles 3 years ago |

It would be good if the post contained some data to justify its points, like a graph of loading times. Otherwise assertions like "So fast that it's faster than a static website." don't seem supportable.

I would have liked to see the actual results from this comparison: "I compared my site to Nginx, openresty, tengine, Apache, Go's standard library, Warp in Rust, Axum in Rust, and finally a Go standard library HTTP server that had the site data compiled into ram."

xena 3 years ago | |

I'm sorry but I have lost that data after some machines got reinstalled. I can attempt to recreate it, but that will have to wait for a future blogpost.

trh0awayman 3 years ago |

I want to see this taken to the logical extreme. A real OS with actual drivers (no unikernel, no virtio) for a small set of hardware that only serves static pages. No need for virtual memory. Just hardcode the blog posts right into the OS and use the most minimal TCP stack you can make.

Thaxll 3 years ago |

How can it be faster than a static page that is already in memory, the bytes are there you just send them over a socket? Transforming some template to rust code back to string buffer is somehow faster?

greenhearth 3 years ago |

The tech is cool, but some of the language is so cringy. For example, the statement "websites are social constructs" makes zero sense. You could say that websites are material objects of a symbolic network of computer languages, like physical paper money is a material, fetishized object of the social construct of money. Websites themselves are not constructed socially. Maybe the author means how websites are perceived, or conventions of web tech itself, is constructed socially?

19h 3 years ago |

You don't need Rust for this -- you can do the same in Go, Node, etc. In 2012 my cheap VPS had a crappy HDD share but fairly acceptable memory, so I rendered the Markdown files and stored them in a little structure, returning them directly from memory.

Everyone thought it was amazing even though it was just a dumb http server returning pages[req.path] :-) Latency was under 10ms which was pretty amazing for a 2012 KVM VPS.

NoraCodes 3 years ago | |

I don't think OP was implying that Rust was a requirement, just what was actually used in this case. And, indeed, OP gives some reasons that Rust might be preferable:

> And when I say fast, I mean that I have tried so hard to find some static file server that could beat what my site does. I tried really hard. I compared my site to Nginx, openresty, tengine, Apache, Go's standard library, Warp in Rust, Axum in Rust, and finally a Go standard library HTTP server that had the site data compiled into ram. None of them were faster, save the precompiled Go binary (which was like 200 MB and not viable for my needs). It was hilarious. I have accidentally created something so efficient that it's hard to really express how fast it is.

amelius 3 years ago | | |

Rust is preferable in this case since there is no manager shoving more requirements on the project every week.

In the real world, use Go, Node, etc.

NoraCodes 3 years ago | | |

At the last place I worked, mean time to feature on the Rust codebase was like a week.

xani_ 3 years ago | |

I did that in Go although it was "only" caching the markdown rendering - the page templates were written in Go (via some lib that gave tools to make that mangeable) and compiled with the app so the whole template building was blazingly fast.

spullara 3 years ago |

Measuring the performance of a CDN isn't that interesting. This is about the fastest blog I have seen and it doesn't have a CDN in front of it:

https://www.lukew.com

manuelmoreale 3 years ago |

I get the fun for a developer to set up something like this to experiment and learn new things. But I'm left with a question: why? Like, is there really a point aside for the aforementioned intrinsic dev fun?

There has to be a point of diminishing return. And again, I'm not discarding the dev side of things but it seems a lot of extra tooling and complexity cor not much gain.

whalesalad 3 years ago |

I admire the OP's ability to use their blog as a rapid prototyping platform that is constantly growing and changing. Over engineering on a personal project like this is the whole point! Very cool.

I am too much of an OCD perfectionist and don't have the guts to ship this often.

xena 3 years ago | |

The trick is to do lots of little changes that are easy to do in isolation. Then do bigger changes later after you learn what you messed up.

I have CDO too but I work around it by sheer trolling with infrastructure, like my hacked up to hell CDN: https://xeiaso.net/blog/xedn

hinkley 3 years ago | | |

Relentless Refactoring is a great tool, but one that is often stymied by faddish behaviors like micro-services/modules. Small projects tend not to have that problem and so make a better petri dish. Of course then you have to take your knowledge out of the 'lab' and apply it in vivo...

A lot of our (and in particular, my) best features come from of relocating the boundaries between things, to make space for features that weren't considered in the original design. With monolithic systems we see this late in the lifecycle in the form of Conway's Law. If you stick this problem in front of the CI/CD mirror, it's painful to face. CI/CD argues that if something is difficult we should do it all the time so that it's routine (or stop doing it entirely).

However there's a conspicuous lack of tools and techniques to make that practical. The only one I really know of is service retirement (replace 2-3 services with 2 new, refactored services), and we don't have static analysis tools that can tell us deterministically when we can remove an API. We have to do it empirically, which is fundamentally on par with println debugging.

whalesalad 3 years ago | | |

I love the idea of "relentless refactoring"

treffer 3 years ago |

Website title: My Blog is Hilariously Overengineered to the Point People Think it's a Static Site

Seeing the initial comments here I think it would be better to go with the original title.

mkl95 3 years ago |

You can build and deploy a blazingly fast blog within minutes with Django, Gunicorn and Nginx. This is cooler though.

pradn 3 years ago |

Question for the author: do you have numbers to share about performance relative to other static site servers?

Great blog by the way :)

xena 3 years ago | |

I had the numbers at one point, but I have lost them. I can try to recreate them, but I'd probably have to use my old mac pro again to be sure the results are consistent.

AJRF 3 years ago |

Is this the same author that made the talk about PAM recently? I really like his articles.

xena 3 years ago | |

Thanks!

I'm not a guy, I'd prefer if you used they to refer to me, but she works too.

The PAM one was a really fun talk to write. I need to finish that postmortem on how that talk went wrong.

epolanski 3 years ago | | |

I'm confused about the they pronoun, isn't it plural? This pronoun thing doesn't exist in my native language.

xena 3 years ago | | |

See https://en.wikipedia.org/wiki/Singular_they for more information. It's been around since the 1300's but it's only gone into the mainstream as someone to refer to direct people fairly recently.

epolanski 3 years ago | | |

Thank you.

allan_s 3 years ago |

cppcms was (is?) using something similar , you write in a template language, and it get compiled into c++ code

http://cppcms.com/wikipp/en/page/main

https://github.com/Tatoeba/tatowiki the wiki of tatoeba.org ( https://en.wiki.tatoeba.org/articles/show/main# ) is written in it

Existenceblinks 3 years ago |

Ah same as (precompiled + loaded into memory):

https://dashbit.co/blog/welcome-to-our-blog-how-it-was-made

apstats 3 years ago |

This loaded pretty slowly for me (2 seconds) and also has aggressive page layout changes. It’s almost like for 99% of software the most important part is UX not the low level programming language that is chosen

robertlagrant 3 years ago |

Without reading: why do Rust folks think it's better if they memorise a website and serve it, instead of using a computer?

HillRat 3 years ago | |

The borrow checker is much less strict if the data only lives inside your skull. Much harder to mutably borrow.

hit8run 3 years ago |

Can you use more rust to serve the 7 readers of a blog? You know what: use caching or something that compiles to plain html (hugo, jekyll etc.). No need for hardcore memory optimization.

xani_ 3 years ago |

And then all the gains were entirely eaten by first hop to a network device. Speaking from experience as I did similar thing, although speed was not a concern, just perpetual annoyance with available tools for blogging.