Docker Misconceptions

76 points by lsm 10 years ago | 20 comments

mkozlows 10 years ago |

It seems to me that a lot of the "Docker isn't good for production" stuff boils down to "Docker is a base layer that's not sufficient for production, and you need other tooling around it."

Like, if you're using Docker in conjunction with AWS's suite of tools (Elastic Beanstalk, CloudWatch, etc.), a lot of these concerns are taken care of, you know?

So Docker doesn't solve everything, but it can be part of the solution.

nzoschke 10 years ago | |

I totally agree.

Docker on ECS, with VPC for network and instance isolation, an ELB for load balancing and a Kinesis for log streaming is working extremely well.

Docker is feeling really great here as common tooling between a development system and production system.

Disclaimer: I'm working on a project, Convox, that automates setting up this type of system. https://docs.convox.com/

numbsafari 10 years ago |

I have this sinking feeling that a lot of what is happening with Docker as a specific tool is going to be replaced in 4 or 5 years with unikernels.

My hope is that the orchestration/scheduling tools (mesos, kubernetes, etc.) mature in such a way that the switch from Docker to unikernels is largely transparent to most people.

ebiester 10 years ago | |

For all we know, that unikernel might be Docker-branded.

nogox 10 years ago | |

Why? unikernel is quite opinioned imo.

numbsafari 10 years ago | | |

If you look at it from the perspective of the typical deployed application, say, with 4 or 5 VMs working together, it probably doesn't amount to much.

But if you are Google or Amazon, who have to build massive data centers to host thousands and thousands of those apps, along side much larger-scale applications, you could achieve much more significant density (and therefore reduced costs) if you were running unikernels as opposed to VMs. Perhaps passing some of that cost difference on to the customer for both competitive reasons and as an incentive for them to upgrade.

That said, even for a small-time app, consider the weight of trying to run a complicated micro-service-based system on a developer laptop. Having to orchestrate a bunch of VMs is an unmitigated disaster. Having to orchestrate a bunch of containers in one or more VMs is an improvement, but not much.

If you could instead run unikernels, there's considerably less overhead. Especially since the unikernels are typically able to run hosted inside a standard host-OS process.

Don't get me wrong, the world isn't really there. But when you consider a kubernetes cluster of docker containers that you never SSH into ... why bother with all those added layers of OS and runtime cruft?

angersock 10 years ago |

Given the amount of money Docker, Inc. has raised (>50M, three series rounds, etc.), I somewhat cynically think that this buzz about Docker may just be the result of a lot of marketing money.

I'm not really comfortable with such widespread adoption of a tool that is primarily a VC baby--NPM is setting itself up to fail (I think) in a similar fashion.

I do hope I'm wrong.

KaiserPro 10 years ago |

"Misconception: You should have only one process per Docker container!"

as soon as you start treating docker images as anything other that isolated statically compiled executables, you're not going to get the best out of docker.

if you are bundling inits, crons and companion apps into a single container then you need to stop, go back and either re-factor your code, or go to Full on VMs,

why?

because the networking is terrible. There are three great advantages to using real VMs over containers:

o Networking

o Isolation

o hot migration and resource allocation

Networking:

every instance of a service can have its own IP, and can be trivially tied to DNS automatically. scoped service discovery that's only sortof just possible now. however it uses immature tools with limited professional experience to back them up. DNS, DHCP with subdomains means images can be dropped in without any hard work

Isolation:

Its far harder to break out of a VM than it is a container. Especially if you are dealing with persistent storage and need to allow a container to write outside of its own chroot.

Hot migration:

This is killer. Hardware fails. having a cluster that automatically migrates around contention and hardware failure, without the app having to worry is worth many thousands of man hours. Yes making your own clustering system is fun, but its really quite hard to do well. Why bother when the hypervisor can do it for you?

There are three things going for docker:

Configuration library:

There is a rich library of prebuilt images

Baked in fudges:

You can bake in your dirty hack into the container, so long as you script it into your build job, its repeatable.

Speed:

yes there is less overhead. but lets be honest, how often have you hit up against VM speed issues that were down to your machine using too much CPU/memory? (if you're on AWS, no, you've not. AWS is dogshit slow, and expensive.)

Everything else, like immutable builds, easy dev environments et al, can be achieved already, and without much work.

theduro 10 years ago |

This post is a year old. Many of it's points are still valid, but others are not. For example, orchestration has been simplified with hosted services like Tutum and Cloud66.

I do however agree that not everything is ready to be containerized, but we are starting to get close.

davexunit 10 years ago | |

>orchestration has been simplified with hosted services like Tutum and Cloud66.

Ah, so you need to use proprietary SaaS in order to have decent orchestration? Not good news.

numbsafari 10 years ago | | |

Also not true. Consider kubernetes and mesos. Both are open source.

joshstrange 10 years ago |

I didn't even notice this was posted a year ago until I got to the bottom (though I did feel some tools/ideas were left out which was explained by the date). That said by and large this is a really good resource and as someone who is going all-in with docker on a side project it was a very useful read!

jrochkind1 10 years ago |

Interesting, while the OP says they like Docker, they pretty much recommend against using Docker for the things/purposes that most Docker hype recommends it for.

bradhe 10 years ago | |

I think it's more the case that he recommends against it...unless you know what you're doing!

exelius 10 years ago |

A lot of these articles are correct. I would agree that Docker probably isn't ready for production. But containers provide a TON of benefits, and you should absolutely be thinking about how to containerize your applications now. Just because it's not currently ready for production doesn't mean you shouldn't start getting ready to move to a container solution. The ecosystem will mature, companies will offer solutions for these problems, and it will eventually be ready for production. When it is ready, you should be too.

The big problem that Docker solves is the dependency problem. Specifically, it ties multiple levels of dependencies together with application code in a way that makes no assumptions about your environment and how well-maintained it is. It means that your CI system can test on the exact same versions of binaries -- and every dependency down to the kernel level -- that you will run on your production systems.

Many bigger companies will have multiple Yum/Apt/Maven/Git repositories, and with Docker, it doesn't matter. Whatever is built into the container is what gets run. Most importantly it puts control of those things into the hands of the development team, not the system administration team. It allows you to more cleanly separate your infrastructure ops from your application engineering/devops, which is the prime benefit IMO because those two groups have never worked together well.