How I shrunk a Docker image by 98.8% – featuring fanotify

pmlamotte 11 years ago |

Reminds me of the method used for the demoscene FPS game .kkrieger which was stored in less than 100kb. They basically played through the game several times and trimmed out any code paths that weren't used in order to get it small enough, using a rudimentary c++ "pretty printer" they wrote that tracked executions of code paths. They had the advantage of being able to alter their code to only use constructs supported by their custom tool. Utlimately, this led to some bugs/features being stripped. An example being they hadn't pressed up in the menu during their runthrough, so you could only navigate downward.

https://fgiesen.wordpress.com/2012/04/08/metaprogramming-for...

neonfreon 11 years ago |

I don't think there is any way to prove that this found all the required files. The more paths through the code, each with its own potential file accesses that can't be predicted with out run time information, the more likely one will be missed in this optimization stage.

dxhdr 11 years ago | |

Variation on the halting problem? Given infinite running time and arbitrary input, can you prove that a program will never access file X.

michaelmior 11 years ago | | |

Agreed that this seems equivalent to me. Assume instead of the instruction "access file X", the instruction is HALT.

birdsbolt 11 years ago | | |

There's no general algorithm, but you could probably prove it, if you tried really hard for your given example. :D

seanp2k2 11 years ago | | |

I'm also interested if it's possible to say so with certainty.

TheDong 11 years ago | |

You have runtime information though. It's true that this method will not find all things (dlopened files in strange codepaths), but just like we have tools that can verify 100% code coverage in tests, you could fuzz inputs until you find that you've hit every single branch of the executable's instructions and record all dependencies as you go.

You could argue that that can still be fooled by, e.g., making the software dlopen the argument given to it at which point that codepath would have different dependencies each time it was hit, but that argument quickly devolves. That same argument says that when I run `ls /tmp/file` that makes `/tmp/file` a dependency of ls and thus I must include every file in the image else it will have different behavior.

I think intelligent fuzzing + high branch coverage can prove that you have found all required files.

michaelmior 11 years ago | | |

I don't think you can ever prove that you've found the required files for an arbitrary binary. (I especially have a hard time believing that such a proof would involve fuzzing, which is random.) However, it seems reasonable that you would be able to achieve a high enough level of confidence that this technique would be viable.

TheLoneWolfling 11 years ago | | |

You cannot. It reduces to the halting problem, relatively trivially:

    <arbitrary code that cannot open foo.txt>
    do something with foo.txt

This will use foo.txt iff said code halts.

You can, however, prove that you've found a superset of the required files for an arbitrary binary. Or prove that you've found the required files for some, but not all, arbitrary binaries.

TheDong 11 years ago | | |

You're wrong.

You cannot say you haven't found all the dependencies, but you can say you have found all the dependencies (given the constraints I placed above).

The halting problem only says that you cannot prove that a given program will halt.

However, you can prove a specific program halts if, in fact, that program halts.

The original question was not "can prove that I can find the dependencies for an arbitrary binary", but "can you prove that all the dependencies were found for a single specific binary".

For some program that has an infinite loop you can say "I don't know if I've found everything", but if you have shown that you have hit every code branch, as I said above, then clearly this program both halts and has had all dependencies found, excepting different behavior for user input within those already explored branches.

vacri 11 years ago | |

I was thinking the same thing during the article, and the author says as much in the 'Last Thought', and doesn't recommend using this for production purposes. The footnotes say that this was more of an exercise in the syscall than in docker.

kentonv 11 years ago |

Sandstorm.io has baked something like this into its basic packaging tool for about a year now, except based on FUSE rather than fanotify. Really helps cut down package sizes - many are 10-20MB despite containing all userspace dependencies of the app. https://blog.sandstorm.io/news/2014-05-12-easy-port.html

derefr 11 years ago |

The 80/20 solution here is to just find the few files that take up the largest amount of space and are clearly pointless to your app, and remove them. The USB hwdb, for example. Also, trimming down the timezone and locale DBs to just the ones your app runs on (hopefully UCT and UTF-8) should help—unless your app has to deal with data containing user-defined datetimes/charsets.

The other interesting thing to try, if your app's problem isn't so much library-dependencies but instead Unix shell dependencies, is to use a Busybox base image. Apps whose runtimes are already sandboxed VMs, especially, usually work great under Busybox: the JVM, Erlang's BEAM VM, etc.

rwmj 11 years ago |

A better idea for chroots or VM images is supermin, where you copy the files from the host filesystem. (http://libguestfs.org/supermin.1.html)

xorcist 11 years ago |

Isn't the point of running an application in a container, or any chrooted environment, to only isolate the application from the rest of the operating system?

Then why would you start out with a complete extra operating system in there? Why not just put the application and its dependencies in there?

To strip non-dependencies from an complete operating system sounds like a very failure prone way to accomplish almost the same thing. You really need to execute all code paths, which is difficult to guarantee (did you really run your application in all locales for example?).

errordeveloper 11 years ago |

The exact approach described here is very extreme. It's a top-down method with a tool. I find the tool may be of some interest, but I think bottom-up method would be more practical . I have done some experiments with Yocto/OpenEmbedded and about to put that out one day, once I have time to document it ...

social42 11 years ago |

Why not just use a micro kernel container like OSV from cloud outs? Same result with less effort

jtlebigot 11 years ago | |

The truth is hidden in a comment: The goal was to learn fanotify syscall using a real world use-case. This said, when Dockerizing an application from scratch, using an optimized base image may be a suitable option. But that's not always the case. For instance, I often start a project from the Python base image which contains loads of generic libraries that I will not use in a given project but will be important for others. This is when a profiling based approach is interesting. You get the ease of a known environment and the efficiency of an optimized image.

errordeveloper 11 years ago | |

OSv is a unikernel and isn't a container... but anyway, good point!

zokier 11 years ago |

ptrace probably would have been better solution, at least it would have avoided the problems with links

lsllc 11 years ago |

Here you 'go':

http://blog.xebia.com/2014/07/04/create-the-smallest-possibl...

nathwill 11 years ago |

Why not ldd?

daakus 11 years ago | |

It doesn't find dependencies like /etc/ssl/certs/ca-certificates.crt or /usr/share/zoneinfo.

nathwill 11 years ago | | |

ah, thanks.

sophacles 11 years ago | |

I've been playing with a project to do this. The first major obvious problem is anything that uses dlopen won't necessarily get all that it needs.

errordeveloper 11 years ago | | |

Yes. Grepping the code or fighting runtime errors are two complementary approaches I can think of... Not sure if there are other methods.

pure_x01 11 years ago |

another way of achieving this https://github.com/PerArneng/fortune

guidob 11 years ago |

While a standard base might be bigger, it does make it easier to cache when you use it in most images. A lot of smaller specific images will mostly be unique.

krakensden 11 years ago |

Does CAP_SYS_ADMIN still leak out of containers? I know at some point running with that meant you were root on the host...

justincormack 11 years ago | |

Thats only to find the files, not afterwards.

Immortalin 11 years ago |

Could this method be used as a reversed way of creating Unikernels?

SandB0x 11 years ago |

If only there were some way to describe in a few lines of text what an image should contain?

Animats 11 years ago | |

Do Docker images really have to contain an entire bloated Linux distro? Even for Xen, which, as a hypervisor, provides fewer services than Docker, it's possible to write applications which run directly under Xen.

errordeveloper 11 years ago | | |

They don't have to, one can run static binary without any problems. It's just that most people keep throwing in a whole distro...

iso8859-1 11 years ago | | |

You can't make a truly static binary with glibc, so almost no one has a toolchain that is able to do it.

michaelmior 11 years ago | | |

What do you mean by "truly static"?

iso8859-1 11 years ago | | |

One that doesn't try to load dynamic objects on runtime like glibc does, if you use certain functions.

zobzu 11 years ago | | |

im not sure why one wouldnt want a whole distro tho - it makes debugging, testing, etc far easier.