Solving Sudoku in Python Packaging

Solving Sudoku in Python Packaging(github.com)

305 points by Yenrabbit 1 year ago | 53 comments

simonw 1 year ago |

I love this so much. I dug around a bit and figured out how it works - I have an explanation (with an illustrative diagram) here: https://simonwillison.net/2024/Oct/21/sudoku-in-python-packa...

Figuring out how it works is a great way to learn a bit more about how Python packaging works under the hood. I learned that .whl files contain a METADATA file listing dependency constraints as "Requires-Dist" rules.

I ran a speed comparison too. Using the uv pip resolver it took 0.24s - with the older pip-compile tool it took 17s.

TeMPOraL 1 year ago | |

Tangent, but I wondered what libuv had to do with speeding up Python packaging, and it turns out nothing. I wonder why someone choose to name a pip replacement in a way that effectively collides with several tools and libraries across many languages...

giancarlostoro 1 year ago | | |

I agree.. While I think it looks amazing, it's a poor naming choice.

seanw444 1 year ago | |

Wow, uv really is fast.

jebebeebehhe 1 year ago | | |

As is simonw writing that post in under 60m assuming he first saw the concept here on HN.

zahlman 1 year ago | |

People keep trying to sell the speed of such solutions as a killer feature for uv, but I think I must not be anywhere near the target audience. The constraint-solving required for the sorts of projects I would typically work on is not even remotely as complex, while I'm bottlenecked by a slow, unreliable Internet connection (and the lack of a good way to tell Pip not to check PyPI for new versions and only consider what's currently in the wheel cache).

the_mitsuhiko 1 year ago | | |

> while I'm bottlenecked by a slow, unreliable Internet connection (and the lack of a good way to tell Pip not to check PyPI for new versions and only consider what's currently in the wheel cache).

Which is one of the reasons why uv is so fast. It reduces the total times it needs to go to PyPI! Not only does it cache really well, it also hits PyPI more efficiently and highly parallel. Once you resolved once, future resolutions will likely bypass PyPI for the most part entirely.

kvdveer 1 year ago | | |

Our CI took 2 minutes to install the requirements. Adding UV dropped that to seconds. Now most time is spent on running tests, instead of installing requirements.

Of course we could've cached the venv, but cache invalidation is hard, and this is a very cheap way to avoid it.

simonw 1 year ago | | |

More significant than the speed improvement in my opinion is the space saving.

The reason uv is fast is that it creates hard links from each of your virtual environments to a single shared cached copy of the dependencies (using copy-on-write in case you want to edit them).

This means that if you have 100 projects on your machine that all use PyTorch you still only have one copy of PyTorch!

itsbjoern 1 year ago | | |

Personally I’m just a fan of people improving dev tooling, regardless of it ultimately making a huge difference to my workflow. I haven’t used uv yet, but I’m still tangentially following it because despite pip and poetry being great tools I have had my fair share of grievances with them.

ilyagr 1 year ago | |

How does it encode the idea of having all the numbers on each line/square?

visarga 1 year ago |

That's why it feels like installing a ML repo is like sudoku. You install everything and at the last step you realize your neural net uses FlashAttention2 which only works on NVIDIA compute version that is not deployed in your cloud VM and you need to start over from scratch.

hskalin 1 year ago | |

Sometimes I just change the version of the package in requirements to fit with others and pray that it works out (a few times it does)

pjc50 1 year ago | |

See the discussion on why sqlite insists on vendoring its build dependencies as far as possible and not using, say, CMake.

austinjp 1 year ago | |

This describes the day I wasted on Monday before I gave up and wrote some damn deterministic code instead of using some damn AI.

nicman23 1 year ago | |

honestly if the ml does not have a docker image - not compose no build an image- i do not even bother any more

anthk 1 year ago | |

Guix fixes that in the spot.

chatmasta 1 year ago |

Here’s the same thing in Poetry (2022): https://www.splitgraph.com/blog/poetry-dependency-resolver-s...

teschmitt 1 year ago | |

Was just about to say: I've seen this before but building it with a universally usable requirements.txt is even cooler.

echoangle 1 year ago |

> Solving the versions of python package from your requirements is NP-complete, in the worst case it runs exponentially slow. Sudokus are also NP-complete, which means we can solve sudokus with python packaging.

Is that actually sufficient? Can every system that’s solving something that’s NP-complete solve every other NP-complete problem?

yochem 1 year ago |

No way pip actually is a really inefficient SAT solver!

stabbles 1 year ago | |

For a long time it was not because there was no backtracking.

Now it is just an exhaustive, recursive search: for the current package try using versions from newest to oldest, enqueue its dependencies, if satisfied return, if conflict continue.

taeric 1 year ago | | |

If there was no backtracking, that implies it couldn't solve every sudoku? That is rather amusing with the implication that it couldn't solve every dependency, as well?

fernandotakai 1 year ago | |

uv actually talks about this in their resolver docs https://docs.astral.sh/uv/reference/resolver-internals/

alentred 1 year ago |

This is BRILLIANT ! I knew of a trend to implement lots of different things at compile-time (in Scala and Haskell communities at least) - definitely fun and quirky, but it never seemed that "special". This one, it has an air of old-school computer magic around it, probably because it is so elegant and simple.

mi_lk 1 year ago |

See also this 2008 post using Debian package system to solve Sudoku:

https://web.archive.org/web/20160326062818/http://algebraict...

ziofill 1 year ago |

but how does it know the constraints?

thangngoc89 1 year ago | |

This is the content of sudoku_0_0-1-py3-none-any.whl. So when the (0,0) cell is 1, none of the cells in the same row, column and subgrid should be 1.

    Requires-Dist: sudoku_0_1 != 1
    Requires-Dist: sudoku_0_2 != 1
    Requires-Dist: sudoku_0_3 != 1
    Requires-Dist: sudoku_0_4 != 1
    Requires-Dist: sudoku_0_5 != 1
    Requires-Dist: sudoku_0_6 != 1
    Requires-Dist: sudoku_0_7 != 1
    Requires-Dist: sudoku_0_8 != 1
    Requires-Dist: sudoku_1_0 != 1
    Requires-Dist: sudoku_2_0 != 1
    Requires-Dist: sudoku_3_0 != 1
    Requires-Dist: sudoku_4_0 != 1
    Requires-Dist: sudoku_5_0 != 1
    Requires-Dist: sudoku_6_0 != 1
    Requires-Dist: sudoku_7_0 != 1
    Requires-Dist: sudoku_8_0 != 1
    Requires-Dist: sudoku_0_1 != 1
    Requires-Dist: sudoku_0_2 != 1
    Requires-Dist: sudoku_1_0 != 1
    Requires-Dist: sudoku_1_1 != 1
    Requires-Dist: sudoku_1_2 != 1
    Requires-Dist: sudoku_2_0 != 1
    Requires-Dist: sudoku_2_1 != 1
    Requires-Dist: sudoku_2_2 != 1

jsnell 1 year ago | |

The constraints are going to be static and independent of the puzzle. So I expect they're encoded in the package dependencies. So for example version 1 of the package sudoku_0_0 will conflict with all of: version 1 of sudoku_[0-8]_0; version 1 of sudoku_0_[0-8]; version 1 of [012]_ [012].

roywiggins 1 year ago | | |

generate_packages makes it moderately clear:

https://github.com/konstin/sudoku-in-python-packaging/blob/m...

IshKebab 1 year ago | |

Yeah they missed out the actual interesting bit from the readme...

worewood 1 year ago |

This is type of cool hacking I like to see. Kudos! (Or better, Sukodus :) )

niyonx 1 year ago |

How did you even think of that? Nice!

revskill 1 year ago |

This is a hack.

jessekv 1 year ago | |

And why I come here for... er, news.

anthk 1 year ago |

Now, in MicroLisp, Common Lisp and maybe Emacs' Elisp too:

http://www.ulisp.com/show?33J9