Quack: The DuckDB Client-Server Protocol

Quack: The DuckDB Client-Server Protocol(duckdb.org)

387 points by aduffy 51 days ago | 83 comments

I was just wishing something like this existed last week. What timing.

I'm piping sensor readings into duckdb with a deno server, and couldn't use duckdb -ui to look over the data without shutting down the server. I had no interest in using the server to allow me to look at the contents of the db, so I was just going to live with it for now. This perfectly solves that, along with several other similar kinds of problems I've encountered with duckdb.

duckdb is my favourite technology of 2025/26. It has worked its way into so many of my workflows. It's integral to how I work with LLMs, how I store all kinds of data, analytics, data pipelines... I love it.

malnourish 50 days ago | |

Can you expand more on how you use it in your workflows? I'm very interested but I haven't incorporated it into my problem solving mindset yet so I don't even know what use cases I could map to it.

steve_adams_86 50 days ago | | |

I think one of the most common and elevating methods of using it has been combining disparate sources of data into multiple tables of a single instance so I can run queries locally and use DuckDB as a bridge between platforms.

Yesterday I pulled a bunch of data from Sentry, multiple log groups on AWS, and Github to figure out when some incidents occurred and how events correlated or caused each other.

Doing that in other tools is perfectly possible and fine, but the overhead of setting up a docker container or understanding requirements for setup or needing an account or whatever bespoke query language makes me lose interest immediately.

With this I only need to know SQL, optionally duckdb -ui, roughly how to ingest the sources correctly so they can be joined easily (in this case just make sure everything is a UTC time series), and I'm mostly off to the races. It works fine.

There are more sophisticated and cool and whatever ways to do this, but with Claude as an assistant you can do this with like 3.5 brain cells and get absolutely incredible results.

DuckDB is awesome partially because of how effortless it is and how little ceremony there is. Like SQLite, but even less friction. Having duckdb -ui as a little work bench is brilliant.

rglover 51 days ago |

This is rad. I've been eyeballing using DuckDB in my firm's internal app framework and this just solved the "but how do I horizontally scale this" problem. Kudos to the DuckDB folks. Love "Quack" for the protocol name, too.

smithclay 51 days ago |

Been working on open-source projects involving storing and querying observability data (metrics, logs, traces) in parquet[0] and have been frustrated with the usability of Apache Iceberg … despite strongly agreeing and wanting to use an open storage format and catalog.

This makes Ducklake much more interesting for my use case, excited where this is going.

[0] https://github.com/smithclay/duckdb-otlp

esafak 50 days ago | |

Are you using it to replace Mimir?

smithclay 50 days ago | | |

Not yet.

That said… think duckdb/ducklake/quack could potentially be a future replacement for Mimir or Clickstack with way less operational complexity.

simlevesque 51 days ago |

I like DuckDB but I'm not sure what it wants to be. There's always new ways to use it and it's not easy to see what's the right one.

NortySpock 51 days ago |

Sounds useful for small-ball internal analytics datasets you want to place on shared team server.

I can definitely see exploring this for some homelab use.

arpinum 51 days ago | |

With ducklake this scales well to multi-terabyte data sets. The big benefit of this server protocol is sharing a high memory server and taking advantage of a shared cache for recent data.

feverzsj 51 days ago |

They didn't explain what "concurrent writers" is. But seems it's just serialized writes on server side.

geysersam 50 days ago | |

I don't think that's correct. DuckDB already supports concurrent writes within one process. I don't see why this would suddenly serialize all writes.

hermitcrab 51 days ago |

I have a C++ application. Everything is in memory during execution. Saved to disk between session as XML. Works great, except that that it is strictly single user and some of my customers would love me to generalize it for multiple concurrent users reading and writing. Performance requirements are quite low - a few thousand records being updated by 2 or 3 people at a time. Would DuckDb + Quack be a good choice for this? Or are there better choices? I looked at SQLite, but I understand it doesn't operate as client server.

password4321 51 days ago | |

https://firebirdsql.org has been flying under the radar in-between SQLite and full-blown PostgreSQL for decades, but if you're asking which client-server database to use PostgreSQL is the default recommendation.

hermitcrab 50 days ago | | |

Did some reading. Given my modest performance requirements, Firebird might be a good choice due to simpler install and admin. Thanks.

appplication 51 days ago | |

DuckDB is more for analytics. I don’t think you’re going to find good options for a DB that can handle concurrent users without hosting it in some way server side. It’s certainly possible (think how some games create their own client servers for direct multiplayer) but honestly hosting Postgres or SQLite is ridiculously cheap, easy, and more importantly the standard approach to this issue.

hermitcrab 51 days ago | | |

IIRC SQLite is in-process and says in it's documentation that it is not a client-server database.

apitman 51 days ago | |

I think the term you want to search for is local-first.

hermitcrab 50 days ago | | |

My understanding is that Local First means syncs across multiple devices, which is not the same thing as multi-user concurrent access.

WebBurnout 50 days ago | |

Sounds like a good use case for CRDTs, which would also enable offline editing

hermitcrab 50 days ago | | |

In my use case I have 2 or 3 users editing the same database concurrently and they all want to see other's updates in near real time (within a second or two). Would a CRDT support that? It would be great if it did and I could just keep using XML to persist everything with no server. But that sounds unlikely.

microflash 51 days ago |

This is fantastic. I’ve been building an Excel-like but columnar spreadsheet app using DuckDB and had to reinvent the “client” through classic HTTP layer.

philbe77 49 days ago |

Two new Quack client drivers:

ADBC: https://github.com/gizmodata/adbc-driver-quack JDBC: https://github.com/gizmodata/quack-jdbc

mritchie712 51 days ago |

> Can I use DuckDB with Quack as the catalog database for DuckLake?

> Not yet, but we are working on it!

Seems like a niche use case, but it's the one I'm most interested in.

Our lakehouse uses ducklake with postgres as the catalog. Seems like a DuckDB / Quack catalog would be an excellent alternative.

szarnyasg 51 days ago | |

Well, we are really working on it: https://github.com/duckdb/ducklake/pull/1151

So you'll be able to test it in a few days.

IceWreck 51 days ago | | |

Does this mean I can finally connect to a ducklake instnace hosted remotely? i.e. DuckLake is writing to disk on the remote server and my client is just a client.

Because rn even with Postgres as a catalog my client needs access to the underlying storage to use Ducklake.

mritchie712 50 days ago | | |

already works now! just tried it out

pdet 51 days ago | |

I think that Quack will become the primary option for a DuckLake catalog in the future, for several reasons. To list a few:

1. No type mismatches for inlining. If you use a non-DuckDB catalog, many types do not have a 1:1 mapping, which introduces additional overhead when operating on those data types.

2. You get the raw performance of DuckDB analytics (and now transactions) over the catalog. DuckDB reading DuckDB is simply faster than any of our Postgres/SQLite scanners.

3. No round-trip for retries. We can easily(tm) run the full retry logic on the DuckDB server side. Right now, these retries trigger multiple round trips for Postgres, making it a performance bottleneck for high-contention workloads.

Disclaimer: I'm a duckdb/ducklake developer.

dangoodmanUT 51 days ago | | |

This. Type casting is an insidious problem (both correctness, and perf)

bourse_lee 49 days ago |

What drives such a high throughput difference between Quack and Arrow on high-volume operations ?

I'll try to search from source/Github, reply appreciated though, for example:

- when DuckDb bulk exports a table, does Quack benefit from pre-existing compression/encodings/0-copy where Arrow requires decode+re-encode ?

- the post mentions parallel reads, is the level of parallelism the same on Arrow vs Quack here ? Running the high throughput benchmark at resource saturation with increasing number of concurrent bulk-read clients would be more transparent

ashkankiani 51 days ago |

My first thought: setting up a self replicating duckdb wrapper over ssh so that I can execute queries on any computer. Can’t wait to play with this!

bfeynman 48 days ago |

Very hyped for this and updates. Have been using my own workarounds for a while with own WAL things and then sort of generating snapshots which with duckdb is so cheap was simpler than really implementing concurrent writes and mutations but this will make it so much easier.

philbe77 49 days ago |

TypeScript: https://www.npmjs.com/package/@quack-protocol/sdk

ozgrakkurt 51 days ago |

> It would be rather misguided not to build a database protocol on top of HTTP in 2026

This is wrong, HTTP is bad for transferring large amount of data and it is also bad for doing streaming.

It is bad for large amount of data because you have timeout issues on some clients, you hit request/response size limits etc.

It is obviously bad for streaming as there is no concept of streaming in it.

It is comical to go the path of least resistance so lazy people can put a reverse proxy on top of it. And then say HTTP is the only relevant way to do it in 2026.

The benchmark doesn't seem to mean much as TCP can max out 50GB/s on a single thread. Pretty sure it can do more than that even. So you could be using anything that isn't terrible and you should get max performance out of this.

Also the protocol is something else from the format. For example if you are transferring mp4 over ftp and http you can compare that.

If you are transferring different things over different protocols then the comparison means nothing.

The benchmark graph for bulk transfer should show more granularity so it is possible to understand how much of the % of the hardware limit it is reaching. Similar to how BLAS GEMM routines are benchmarked based on the % of theoretical max flops of the hardware.

> 60 million rows (76 GB in CSV format!)

This reads a bit disingenuous.

It is dissappointing to see this instead of something like PostgreSQL protocol with support for a columnar format.

timsuchanek 51 days ago |

This is very exciting. Now we just need this for Postgres as well.

znite 51 days ago |

Does this work with duckdb-wasm?

PhilippGille 51 days ago | |

It's in the article:

> HTTP also allows the DuckDB-Wasm distribution to speak Quack natively! So DuckDB running in a browser can e.g., directly connect to a DuckDB instance running in an EC2 server using Quack.

anentropic 50 days ago | | |

I missed that and it seems like one of the more compelling features...!

znite 51 days ago | | |

Thanks, thought I searched for it & didn't come up. Great stuff

philipallstar 50 days ago | | |

That is a pretty amazing feature.

neomantra 50 days ago | |

Although a maintainer answered you, watch the video from the blog. There's a WASM demo at the end, which is great. It also has a good explainer for those confused about the HTTP decision.

And I appreciate that the Hannes still appreciates the magic of the WASM. [And I keep hearing quark which makes me hungry for tangy creamy German yogurt]

hfmuehleisen 51 days ago | |

Maintainer here. Yes!

znite 51 days ago | | |

Thanks, thought I searched for it & didn't come up. Great stuff

66yatman 50 days ago |

If it quacks it’s a duck.

Arcaveli 51 days ago |

cool