Take a look at Traefik, even if you don't use containers

Take a look at Traefik, even if you don't use containers(j6b72.de)

388 points by q2loyp 2 years ago | 260 comments

sph 2 years ago |

Traefik is pretty cool, but suffers from the same, terrible problem of Ansible: there is a lot of documentation, and a lot of words written, yet you can never find anything you need.

I have used it since v1 and I routinely get lost in their docs, and get immensely frustrated. I have been using Caddy for smaller projects simply because its documentation is not as terrible (though not great by any stretch)

Technical writers: documentation by example is good only for newbies skimming through. People familiar with your product need a reference and exhaustive lists, not explanation for different fields spread over 10 tutorial pages. Focus on those that use the product day in and day out, not solely on the "onboarding" procedure.

This is my pet peeve and the reason why I hate using Ansible so damn much, and Traefik to a lesser extent.

johanbcn 2 years ago | |

> Technical writers: documentation by example is good only for newbies skimming through. People familiar with your product need a reference and exhaustive lists, not explanation for different fields spread over 10 tutorial pages. Focus on those that use the product day in and day out, not solely on the "onboarding" procedure.

I agree. We all would benefit by giving more exposure to documentation frameworks such as https://diataxis.fr

adolph 2 years ago | | |

I'm glad to have clicked through for curiosity's sake. Diátaxis is tremendously interesting.

For folks who might recognize the author's name:

Daniele Procida: Director of Engineering at Canonical. Creator of Diátaxis and BrachioGraph. Django core developer. Fellow of the Python Software Foundation.

siamese_puff 2 years ago | | |

Also https://docs.divio.com/documentation-system/

yoyojojofosho 2 years ago | | |

Discussed on HN: https://news.ycombinator.com/item?id=33721314

yread 2 years ago | | |

MSDN also follows these principles

molszanski 2 years ago | | |

Thank you for the link <3

plantain 2 years ago | |

My latest gripe in this category - opentelemetry. Thousands of pages. Very little about actually achieving basic common workflows.

jackthejacky 2 years ago | | |

Oh man, I FEEL this comment. That was one absurdly awful set of documentation, because they not just have a lot of confusingly placed repeat content, they also follow the philosophy of only explaining top level initial conceptual primer for everything, and only explaining the actual main use case of the component 3 navigation pages deep.

So a beginner has to jump a BUNCH of pages to get a primer, and an expert has to bookmark the couple actually-useful pages and later give up and just look at github for specific operators/processors when they already know the basic config inside out.

silisili 2 years ago | | |

Same experience. Otel is one of the wordiest docs I've ever come across that says very little.

Further, I found a lot of little bugs that are hard to Google, or when Googling finding open issues that are either known and working on, or no response at all.

I ended up just throwing it in the garbage and using direct connectors. I like what Otel is trying to achieve, but it feels extremely opaque and half baked at the moment.

jpgrassi 2 years ago | | |

I know the feeling, so I built something that hopefully addresses some of this: https://otelrecipes.com. Just launched it last week!

It offers sample applications and a website that shows in a step-by-step manner what you have to do to get OpenTelemetry configured in your apps. My goal is to keep the sample apps to the minimum and focused on a single goal: E.g., I want to add tracing to my app; I want to record metrics; I want to correlate logs with traces etc.

I have lots of ideas and things in the backlog, such as collector recipes.

It's all OSS as well, so anyone can contribute with more samples :)

https://github.com/joaopgrassi/otel-recipes

arkh 2 years ago | | |

I feel like it is endemic to anything OPS / DevOPS. Lot of uselessly verbose "documentation" but no list of whatever you really need.

All in the name of selling products which abstract those parts, consulting or courses.

tnolet 2 years ago | | |

Oh boy that hits home. Been deep in the OTEL world the last months and the official docs are very, very undercooked.

phillipcarter 2 years ago | | |

Have you given the Getting Started docs pages a go? Indeed it's mostly still reference/conceptual content, and not oriented towards specific workflows yet. We've been wanting to get that kind of content written for quite some time, but the reality is we're a small group (often volunteering our time) and there's still an immense amount of reference and conceptual gaps that need addressing.

jdub 2 years ago | | |

When I started using Honeycomb, I had such a wonderful integration experience with their Beeline SDKs.

Then they transitioned to OpenTelemetry – for very good, justifiable, "good community member" reasons – and yikes, everything got so much more complicated. We ended up writing our own moral equivalent to the Beeline SDK. (And Honeycomb have followed up since with their own wrappers.)

There's so much I love about Open Source, but piles and piles of wildly generic, unopinionated code... ooft. :-)

jethro_tell 2 years ago | |

One of the problems that the yaml interpreter class of languages, or whatever you'd call them, suffer from is the fact that yaml itself is a language and tends to be more or less undocumented in the interpreter docs.

It's sort of assumed that you are going to do extremely simple tasks on very flat data structures. That doesn't tend to be the reality that most of us live in. And to really get the most out of these languages you have to understand an entire unspoken set of rules on how to use yaml. That's never really pointed out in the docs.

Additionally, there are docs for the unique settings for each module but as far as using the standard settings, additionally, its rarely clear how to operate on the data that might be returned or combined with anything mildly complex, you are given a dozen 1 stanza examples for each item like a stack of ingredients and then told to bake a cake.

I've had this experience with basically every one of the various yaml interpreter systems I've used.

After a few 100k lines of yaml I can get things done but the docs are useless other than a listing of settings.

ornornor 2 years ago | | |

To illustrate this point, here is how to have a multi line value in yaml: just kidding, it’s so confusing that there is a whole website to help you figure it out: https://yaml-multiline.info/

cromka 2 years ago | | |

Isn’t it why toml is seemingly increasingly used to replace yaml in projects?

tootie 2 years ago | | |

I honestly wonder why not just write your web server in node or something. It would be traceable and testable and probably performant enough. There's just so much arcana inside platforms like traefik or nginx where they do all this miraculous stuff if you just add the right flags, but also when it doesn't work it's a total black box and there's no way to discover what it thinks it's doing.

samuell 2 years ago | |

Ansible is definitely requiring constant lookup in the documentation.

I've found a pretty good workflow with using ansible-doc though, with two-three aliases that I use constantly:

    alias adl='ansible-doc --list'
    alias adls='ansible-doc --list | less -S'
    alias ad='ansible-doc'

Then I'll:

1. Use adls to quickly search (in less with vim bindings) for relevant commands,

2. Check up the docs with `ad <command>`.

3. Almost always immediately jump to the end (`G`) to see the examples, which typically provides a direct answer to what I need.

Since authoring Ansible scripts is so dependent on the docs, I think they really should make this process work better out of the box though, providing some interface to do these lookups quicker without a lot of custom aliases.

bithaze 2 years ago | | |

I keep the module index[0] in my bookmarks bar and that's also been pretty easy to search and read.

[0] https://docs.ansible.com/ansible/latest/collections/index_mo...

ollien 2 years ago | |

Pydantic falls into this box for me. The maintainer refuses to build API reference documentation, as they feel that there should only be one source of information. It's their project, of course, but every time I need to find a method on an object, I am scouring pages of prose for it. Sometimes it's just easier to read the source.

aeyes 2 years ago | | |

What's missing from the existing API documentation?

https://docs.pydantic.dev/latest/api/base_model/

angra_mainyu 2 years ago | | |

Haproxy does the whole documentation side of things very well.

The docs are very straightforward and thorough.

mholt 2 years ago | |

Funny you say that because we don’t have nearly any examples in the Caddy docs. We’re working on improving them later this year.

sph 2 years ago | | |

Examples are good in docs. But documentation that's only made of examples and tutorials... not so much.

Thanks for Caddy btw. Neat little tool.

molszanski 2 years ago | |

This is strange. I also don't like the docs but for a different reason.

I would rather have a more examples. And kinda _advanced_ and complex, rather than trivial we see in the docs.

Even though I had a working V1 configs and had a know-how about lingo / architecture like routes / services I still struggled for a day or two to properly configure a pretty simple workflows in v2 like:

* add TLS with LetsEncrypt

* configure multiple domains

* configure multiple services

* add Basic Auth for some domains

That said, more detailed and extensive docs would be much better.

I also remember finding things in github issue comments that worked as bugfix/workaround of something from the docs.

PS. For now I've moved to Caddy for simplicity and better Caddy DSL compared to yaml/label verbose config.

linsomniac 2 years ago | |

Do not agree WRT ansible, been using it for well over 5 years and usually a google search points me right at the correct part of the documentation to answer my question. Ansible, the tool itself, can be a bit obtuse, largely IMHO because of the YAML source language, so some concepts are hard to translate into the tool, but the documentation has never bothered me.

As far as "a lot of words written, can't find what you need", Fortinet is my poster child there (based on trying to use it a decade ago). Everything I looked up there had 10,20,30 pages of introductory material with the Fortinet stuff spread throughout it.

sph 2 years ago | | |

Alright, please link me to an exhaustive list of Jinja filters supported by Ansible out of the box. I'll wait.

What you are given is https://docs.ansible.com/ansible/latest/playbook_guide/playb... and you need basically to read/scan each example until you find what you need [1]. Do you call that good, especially when these are basically the only way of doing anything a little complex? That's a sure way of killing my flow and productivity in its tracks. I have been through this page in anger a dozen times, and I still have no idea what Ansible filters can or cannot do.

Also, using Google to find stuff is "cheating". The goal of documentation is to be able to use it as reference; if you need an external tool to find anything in it, that defeats its purpose a bit. When people wrote documentation books, they had to make sure it's usable, legible and efficient. These days apparently that's become lost art.

1: these examples are not even exhaustive, because they don't list all the builtin Jinja filters; chances are that what you need isn't listed on that page, but you should instead refer to https://tedboy.github.io/jinja2/templ14.html

pragma_x 2 years ago | |

Possibly unpopular opinions to follow. This is made even worse by:

- Having documentation split between v1 and v2, that is similar yet different enough to yield half-baked configurations before you realize what you did wrong. The website itself provides the barest of subtle changes to distinguish the two. Edit: I learned all this prior to v3.

- Supporting multiple config formats (TOML and YAML) which makes it that much harder to hunt down relevant forum posts for support. That wouldn't be a huge problem if it weren't for things that you need that aren't in the documentation (above)

- Multiple configuration _modes_. You can set options on the CLI, or in config files, and they are not 100% reflected between the two; some things must be in config files no matter what. Config files themselves are split between "dynamic" and "static" configs, and you must place the correct options in the right file.

- The one thing that Traefik does well is routing traffic to/from containers. Container labels are used to drive this. How to map those label namespaces to various internal and user provided aspects of Traefik is unclear in the docs, at best.

- Traefik exposes a dashboard web GUI by default. Yet much of the useful diagnostic and troubleshooting information is only found in the container logs.

Retiring v1 completely, picking a single configuration language/mode, and providing a rich library of example docker-compose configs, would go a very long way to securing this project's future.

renk 2 years ago | | |

The documentation split is unfortunate and the GUI really just a status page. The other points are a strength. A pattern that works well: Put all Traefik config in your Docker container definitions, as command line flags and labels, plus the dynamic config provided as a volume. That gives you all the flexibility and only one or two places to look for the config (e.g. a Docker compose file and the dynamic config file)

lamontcg 2 years ago | |

You really need at least three documentation targets:

- onboarding the newbies workflows/tutorials - intermediate "focus on the important bits" workflows/tutorials - exhaustive references

There might be other useful ones as well, but I never see those three hit at the same time adequately.

bshacklett 2 years ago | |

This was exactly my experience. It’s incredibly frustrating to search documentation only to be stuck with examples that are related, but don’t fit one’s exact situation, and don’t explain the underlying behavior.

jq-r 2 years ago | |

Those are great points. Even the page layout of the documentation is terrible. Whywe have huge monitors and millions of pixels if I have to read content from a very narrow column, which is a mile long?

Eg: https://doc.traefik.io/traefik/routing/services/

If you visit that page in your desktop browser you'll get less words per column then seeing this on the iPad (works even in dev tools window). Mind blowing.

rezonant 2 years ago | |

Well said, this extends far beyond Traefik. Far too much documentation these days is tailored for people who have never used software of it's type. This was a workable strategy during the Great Developer Boom, but that's more or less over now.

As a developer who didn't come from this Boom, I have been constantly frustrated by this trope, and I hope the changes in the industry will tip the scales back toward solid reference documentation, so that I can feel confident in deploying more of these technologies.

Putting that more general note aside, I have been a Traefik user for years and I do recommend it. But a lot of what it does is difficult to cite using solid docs.

sharperguy 2 years ago | | |

I've often resorted to just looking through the project on github and finding whatever source file is responsible for parsing the configuration files to figure out what each option does.

igor_varga 2 years ago | |

I'm using the Traefik and have the same experience with the documentation. It can be time consuming to configure it properly if you are not a power user.

I'm happy with it though, it's a great piece of software. I wonder is there any other product out there with a similar feature set?

throwfaraway398 2 years ago | |

It's funny because one thing I like about ansible is how easy it is to get the reference doc for any module with `ansible-doc -t module`.

I do sometimes struggle to find the right doc when I'm searching for something about ansible core itself, but that doesn't happen too often.

mubu 2 years ago | |

I share the same sentiments. I dread having to go through Ansible docs because it's so densely packed. Meanwhile Caddy's docs feel too sparse, and too many spread out tutorials. The reference isn't well thought out either imo.

mholt 2 years ago | | |

We're revamping the Caddy docs this year.

Fire-Dragon-DoL 2 years ago | |

I want both, in the same page if possible, for every possible permutation of input arguments. In theory ansible does this, but then it doesn't link to "you might use it in combination with...", essentially, it lacks integration of multiple things in the reference docs. but I didn't find ansible docs that bad? Most of the time I search module name and find the reference doc

DanielHB 2 years ago | |

same reason why Terraform AWS Provider is better documentation than AWS documentation

https://registry.terraform.io/providers/hashicorp/aws/latest...

If I can't find the answer to what I need there I usually resort to LLMs, they are surprisingly good and fetching the info you need out of these massive documentations. The failure rate is quite high though so a lot of trial and error required, but the LLM at least gives you some hints to where to look for it.

danielvaughn 2 years ago | | |

My primary use case for LLMs so far has indeed been to avoid terrible technical documentation.

remoquete 2 years ago | |

You're assuming that Traefik has a team of technical writers taking care of the docs. From what I know, that's not the case.

Propelloni 2 years ago | |

From this point of view the Oracle RDBMS handbooks ca. 1998 were pretty good. Come to think of it, they were pretty good all around.

SoftTalker 2 years ago | |

I don't think ansible docs are that bad.

I use duckduckgo and adding !ansible to my search usually gets me what I need pretty directly.

hinkley 2 years ago | |

Some projects need documentation, some need cookbooks. Sounds like traefik is the latter.

Hopefully as an aside (I know very little about traefik so maybe I am talking about them too and don’t know it), it seems like in the time since I abandoned Java they have weaponized that architectural strategy and I have no patience for it. I look at that sort of documentation and my eyes glaze over. Or if they don’t I feel disgust or anger and all three result in my stomping off.

Opentelemetry, particularly the stats code (vs the span code) triggered a lot of this in me. It has several sets of documentation that say different things. It took me a long time to figure out how to connect the amorphous dots, and then I didn’t entirely agree with their solution anyway.

remoquete 2 years ago | | |

OpenTelemetry docs maintainer here. We need more quality feedback like this. Please consider contributing issues to the docs repo.

arendtio 2 years ago | |

I agree that the documentation could be better, but it isn't that bad. I enjoyed all the gophers, and these images really helped me understand the structure.

However, I find it amusing that you wish there was a better reference. I think getting to the initial setup is quite hard. Once you have that, extending it is straightforward.

scrubs 2 years ago | |

Oh man are you on to something!!! One huge, bad side effect of web is the atomization of an overall body of work into 62.9 million links.

One pdf please. The book concept works!

You know who's docs blow too? Mellanox. I hate their stuff.

And to give credit where due: intel does a damn good job.

crabbone 2 years ago | |

Just another one for collection: conda. Especially the parts about conda-build, meta.yaml etc. There are only examples w/o any way to tell what's available. And the source code is frustratingly twisted, undocumented and all over the place. Something that makes creating conda packages an extremely frustrating experience, to the point that it's significantly easier to create archives and write the metadata by hand than to rely on conda tooling.

cdelsolar 2 years ago | |

If only there were a program that had crawled bazillions of documents, including all of the traefik documentation, examples, and thousands of code files using it, and if only said program were especially designed to answer natural-language queries about said documents.

lopkeny12ko 2 years ago | |

This take is, at best, disingenuous, and at worst, dangerous. The Traefik maintainers and community contributors (including myself) have collectively invested hundreds of man-hours writing and improving documentation, specifically in response to feedback from users that things are hard, unintuitive, or complex.

You are discounting massive amounts of unpaid labor done specifically for people like you. At this point, if you can't find what you're looking for, it's on you. Maybe do a little bit of your own homework instead of throwing your hands up after 2 minutes and crying to the maintainers.

arp242 2 years ago | | |

I never used Traefik and have no opinion on it one way or the other as such. But if this is the response to some criticism of the documentation – which you can agree or disagree with, then you've done more to turn me of from Traefik than anything anyone here can write.

alex_lav 2 years ago | | |

Investing a lot of time and trying really hard is not the same as adding a lot of value. If your users don't find value in your documentation, saying "But we spent a lot of time on it!" doesn't really change anything.

And, to be clear, I have no idea if the person you're responding to's criticism is valid. But I also know that your response does not negate their criticism at all.

halJordan 2 years ago | | |

Disagree that this isnt a generic problem. And i'll take the same amount of umbrage at you calling it disingenuous. There are dual needs here. Having to read a story and take in a wholly unrelated workflow just to discover only half of the switches available to the feature im looking up is a problem.

And when there isn't just straight documenting of what's been implemented then it is an unreasonable gate to usage which limits customers to only the flows imagined by the technical writer.

Which itself breeds this sort of refusal to participate. Either the end user is ungrateful and needs to express that gratitude through silence or there's a smug moderator who's read everything and knows which paragraph of which tutorial has the answer and harangues anyone asking with a link and a "why didnt you read sentence 5 of paragraph 2 of a tutorial written 2 years and 3 major versions ago?"

engine_y 2 years ago |

We've been using Traefik in prod for 2 years. While I used NGINX in the past, I decided to migrate to Traefik mainly because of the automatic let's encrypt integration. I am sorry for that decision. Traefik's documentation does not make sense to me or my team. It is finicky and misbehaves without proper logging. As an example - when I want to recreate the certificates - it fails sporadically leaving prod down for an indefinite amount of time.

We're moving back to NGINX.

arush15june 2 years ago |

I use caddy rather traefik. It's much easier to manage the Caddyfile compared to the traefik YAML config IMO, and we just keep three separate Caddyfiles for local, production and on-prem deployments. There are a plethora of great plugins, we use the coraza WAF plugin for caddy and it works well.

pricci 2 years ago | |

I moved from Traefik to Caddy with caddy-docker-proxy for my self-hosting setup.

All the features I need but *much* simpler.

https://github.com/lucaslorentz/caddy-docker-proxy

sureglymop 2 years ago | | |

Looks interesting but I don't see the benefits really. Still looks like a lot of labels exactly like with traefik. Why should one switch?

preya2k 2 years ago | | |

Same here. I enjoyed Traefik for being able to use docker tags for my reverse proxy configuration. The mechanism is great, however I did not like Traefiks internal config structure. Caddy is much easier for me to understand and matches my (small scale) use cases much better. Using Caddy via Docker labels through caddy-docker-proxy is about as perfect as it gets (for me).

overstay8930 2 years ago | |

I love Caddy, I wish the docs were better on production deployments, too many unanswered questions about best practices especially RE: storage and config management. Like how local storage is supposed to be handled when you're using external storage? Allegedly it can be treated as stateless but maybe not?

You basically just have to pray the guy who made the module you need knows what he is doing, because there's no standards for documentation there either. Maintainers really need to put their foot down on random ass modules with 0 documentation providing critical functionality (i.e. S3 storage backend).

renk 2 years ago | |

Yes. If you don't need all of the service discovery and auto-scaling shenanigans (or are willing to script it yourself), you can gleefully skip Traefik, Docker Swarm, Kubernetes etc. and just use Caddy! It can really do most things and it does them well.

beestripes 2 years ago |

Why traefik over nginx for my modest needs, a couple docker hosts and a few dozen containers. I use https://github.com/NginxProxyManager/nginx-proxy-manager, would traefik provide a benefit on such a small scale?

aedocw 2 years ago | |

I think https://github.com/caddyserver is the best option here. Automatic handling of SSL certs, it's incredibly lightweight, and has super clear config syntax.

mubu 2 years ago |

A couple weeks ago I was deciding between reverse proxies and eventually settled with Caddy because of its simplicity. However, Traefik's auto discovery of containers and referencing by labels is quite nice, but Caddy has a plugin to do the same.

I read the article but I'm still not convinced Traefik has anything over Caddy for me. Maybe someone else does and can chime in.

jasoneckert 2 years ago |

Another thing worthy of note is that Traefik is configured by default in K3s. This has allowed K3s to be the quickest way to spin up a K8s cluster for testing, essentially allowing you to treat your cluster like cattle too. Simply add your deployment and associated service using NodePort, and you can access your app without worrying about the ingress controller.

I use a shell script to spin up K3s clusters and test apps I specify as a positional parameter on demand (leveraging the ttl.sh ephemeral container registry). The same script tears down the cluster when finished.

psYchotic 2 years ago |

I'm considering moving reverse proxying to Traefik for my self-hosted stuff. Unlike the article's author, I'm running containerized workloads with Docker Compose, and currently using Caddy with the excellent caddy-docker-proxy plugin. What that gets me, currently:

- Reverse proxying, with Docker labels for configuration. New workloads are picked up automatically (but I do need to attach workloads to Caddy's network bridge).

- TLS certificates

- Automatic DNS configuration (using yet another plugin, caddy-dynamicdns), so I don't have to worry too much about losing access to my stuff if my ISP decides to hand me a different IP address (which hasn't happened yet)

There are a few things I'm currently not entirely happy about my setup:

- Any new/restarting workload makes Caddy restart entirely, resulting in loss of access to my stuff (temporarily). Caddy doesn't hand off existing connections to a new instance, unfortunately.

- Using wildcard certs isn't as simple as it could/should be. As I don't want every workload to be advertised to the world through certificate transparency logs, I use wildcard certs, and that means I currently can't use simple Caddy file syntax I otherwise would with a cert per hostname. This is something I know is being worked on in Caddy, but still.

Anyway, I've used Traefik in k8s environments before, and it's been fairly pleasant, so I think I'll give it a go for my personal stuff too!

PS: Don't let this comment discourage you trying Caddy, it's actually really good!

teekert 2 years ago |

I have used traefik a lot. But I mostly got frustrated with all the docker-compose labels and layers and so many lines just to have a rev proxy. Then I found Caddy. Never looked back.

I guess I was never the audience for Traefik. I just need an https enabled rev proxy. Or a basic-auth layer. In Caddy both are just 1 line, very concise, no layers (which I still don’t understand…)

djhworld 2 years ago |

I've been using traefik for a few years for all my self hosted things.

I abandoned the dynamic/discovery/docker labelling functionality though it was just too finicky and annoying to debug.

Instead I generate a static config file using a template engine, pretty much all my things are just a combination of host/target/port so very easy to generate the relevant sections - I don't really have any complicated middlewares other than handling TLS. It sounds like the author of the linked post has taken the same route.

The config gets generated through an ansible script and then gets copied to the machine where traefik is running - traefik watches the directory where that file is and auto-reloads on changes.

It's been working great!

silverquiet 2 years ago |

I use Traefik in production (with containers), and my favorite aspect of it is that the configuration is carried via the labels on containers which means I rarely if ever need to make any modifications to the Traefik config itself. I'd say the biggest con is trying to figure out how to pronounce the name - I think it's just regular traffic, but I can't help wanting to call it "trey-feek" or something like that.

ofrzeta 2 years ago |

Is it any better than HAProxy? HAProxy has served me well for at least a decade and has also been modernized for the cloud age with the runtime API that allows dynamic configuration.

ljhtlajdfqasd 2 years ago | |

All of these proxies seemed to have achieved feature parity within the last couple years.

Where they seem differ is the licensing, enterprise model, source language, and data plane model (sidecar vs no sidecar).

juangacovas 2 years ago | |

Same here, we've been using HAProxy for years now and only gets to improve

dizhn 2 years ago |

I use caddy wherever I can. That it can already handle automatic certificates is a big plus. Plus it's very easy to congiure.

amne 2 years ago | |

I tried to get caddy to listen to both ports 80 and 443 in a cluster. I failed miserably. The documentation simply dismisses this as a possible scenario.

mholt 2 years ago | | |

How do you mean? Many of our users do this with no issues.

jspdown 2 years ago | |

If you like Caddy for it's ACME capabilities, then you might enjoy Traefik as well. It supports HTTP, TLS ALPN and DNS challenges and can be configured in one line as well.

dizhn 2 years ago | | |

I already use it as a web server and reverse proxy so it's a better match. I've tried traefik in the past and it wasn't as simple as caddy to configure. Caddy has some well thought out magic (like creating a sane modern php config with just one line).

evtothedev 2 years ago |

The 37Signals/Basecamp team has been working on a small, opinionated replacement for Traefik called Thruster: https://github.com/basecamp/thruster

Would be worth checking out, if you're currently considering options.

renk 2 years ago | |

wg0 2 years ago |

Side question - what people use to hide (and make accessible) the internal services such as grafana, prometheus, rabbit mq (the web interface) and such?

Should they be public behind such a proxy? (seems odd) Or should they be totally internal and then setup a Wireguard VPN to reach them?

barbazoo 2 years ago |

I’d stay away from it. The magical way to set it up via docker compose tags is nice but doesn’t allow for zero downtime deployment at least until recently.

Getting true zero downtime deployments only worked with their file provider but that’s a bit archaic these days.

Sincere6066 2 years ago |

I'll stick with caddy. It's worked for me for years.

riedel 2 years ago |

Funnily I spend my weekend making a traefik config file to gitlab pages on a self hosted instance without pages enabled but using the artifact API. No code involved. Had to configure quite some rewriting logic and use three different plug-ins, which are mostly unmaintained. In the end probably something like nginx, Apache or caddy or a bit of code probably would have worked better, because of all the layering of different middleware. But it worked somehow. I guess it shines through still for easy SSL termination of docker and great observability. That is why at least I have been using it for the past years.

jakubsuchy 2 years ago |

Article spends a lot of time comparing Traefik to HAProxy. Might just as well use HAProxy then :-)

notoall 2 years ago |

For simple deployments, consider whether you need a reverse proxy at all.

I have IPv6 everywhere, with each service getting its own IPv6 address. Each service is managed in inetd-style (via systemd-socket-proxyd ), and so essentially listens directly.

For services that need to serve IPv4, I have a reverse proxy on my network edge that demuxes on TLS SNI to the corresponding IPv6 address.

The advantage here is never having to deal with complex applications, with their complex and changing configuration.

notpushkin 2 years ago | |

I'm using a reverse proxy just to terminate TLS. Pretty sure it is possible to do that at a service level, but don't think it's worth the trouble.

MrOxiMoron 2 years ago |

I love treafik, we use it with nomad/consul and docker to setup our whole infrastructure. The plugin system is also simple yet powerful and the dynamic configs are great for our customers custom domains, we can quickly see if a domain points to the right IP and put it in to get everything working. And of a domain no longer points to is we get a slack notification and it removes it from traefik so it no longer tries to get SSL certificates for it.

cagenut 2 years ago |

In a mirror/reverse of the OPs premise - I always wondered why so many of these open source http reverse proxies sprung up in the container era, like what did they offer that varnish or a vmod to varnish wasn't already doing or capable of? somehow varnish almost completely missed the container era, despite seemingly being the exact type of tool a bunch of teams would go on to create.

Starlevel004 2 years ago | |

Devops guys are mostly incapable of using any service that isn't a) written in Go and b) configured using a YAML-based DSL.

TNorthover 2 years ago | | |

Traefik's YAML does a particularly bad job at keeping syntax (such as it is) separate from user-defined labels, I feel.

Very difficult to just look at a file and see which bits are labels for the sake of it, and which bits are direct instructions to builtin features.

demi56 2 years ago | | |

> and b) configured using a YAML-based DSL.

Go devops HATE YAML-based DSL we just put it there cause there’s not alternatives, json ?, don’t wanna go there fortunately there’s CUE lang but moving all these project to accept cue isn’t that easy either.

> Devops guys are mostly incapable of using any service that isn't a) written in Go

Lol we basically rewrite it in Go if we’re using it frequently. Most Go projects are just things the founder really wanted for himself

lmeyerov 2 years ago | |

For Caddy, LetsEncrypt: Free TLS in one line without talking to anyone

For Traefik, afaict, something about k8s

rglullis 2 years ago |

For authentication, I had good luck with authentik as forward proxy.

The one thing that bothers me with traefik is that their implementation of ACME does not work if you have some sort of DNS load balancing. I had one setup with three servers responding to the same domain. It seems the first request )to start the ACME dance) would go to one server, and if the second one (with the .well-known address) is sent to a different one, it will just return a 404 and fail the whole thing. Now I either have * to delegate the certificate management to the service itself or add Caddy as a secondary proxy just to get certificate from it.

* Of course, someone smarter than me will point me to a better solution and I will be forever grateful.

jackweirdy 2 years ago | |

If I am not misunderstanding (sorry if I am) it sounds like you use the http challenge where your cert provider tries to GET your challenge file — if so, could the DNS challenge be better suited? There, you put the challenge in a TXT record value

rglullis 2 years ago | | |

You got it, but your solution won't work because of one detail: I can not use the DNS challenge because I am running a managed service provider, and my customers are the ones who own the domain. All I can do is ask them "please add a CNAME to my gateway", and I need to figure out everything else on my side.

chadsix 2 years ago |

You can also use Cloud Seeder [1] which might be easier since it gives each container a dedicated IP. </shamelessplug>

[1] https://github.com/ipv6rslimited/cloudseeder

dmeijboom 2 years ago |

I don’t get the appeal of Traefik. If you want an easy to use reverse proxy that works well, pick nginx. Want something simple for self-hosting? Take a look as caddy. For Kubernetes, try out Envoy Gateway.

cab404 2 years ago |

Somehow, I find myself using Caddy everywhere I would use Træfik in the past.

vedmed 2 years ago |

I needed a reverse proxy the other week. OPNSense is my firewall. I tried traefik, but it was too complicated. So I installed caddy, and it was easy as pie. My .02

brainzap 2 years ago |

It would be nice if proxies are opinionated about typical URL usecases and offer an easy way to redirect www to non-www or handle path with missing slash.

firesteelrain 2 years ago |

We just started running Traefik in production since looking at self managed K8s was just too hard and complicated for what we were trying to do. We have an Ansible Docker compose service (that’s what we call it), that starts up the containers and auto registers the containers with Traefik. It works really well.

We are airgapped so can’t use Let’s Encrypt. We inject the certs into our containers via Ansible or Docker Compose.

methou 2 years ago |

The only problem I'm having with it is that it doesn't support unix domain socket[0], in a "cloud native" environment you rarely need it but if you are using single node this can be sweet.

-- [0]: https://github.com/traefik/traefik/issues/4881

meonkeys 2 years ago | |

Could you say more about how a non-network socket would be beneficial? I'm guessing simpler code and lower resource usage, but I'm curious what you're interested in. And by "single node", do you mean one server / one user (even if the user is, say, a single API consumer or whatever), or something else?

kubanczyk 2 years ago | | |

Any euid can connect to a localhost tcp socket. But a unix socket is protected with filesystem permissions (rwxrwxrwx, etc.).

btbuilder 2 years ago |

When looking for a reverse proxy that is performant on Windows and Linux around 5 or 6 years ago the options were very limited. Traefik is what we ended up using.

I haven’t checked recently but at the time nginx on Windows used select() and envoy was either beta or needed a recent version of the Windows kernel that not all customers were running.

We still use it today.

xorcist 2 years ago |

After running into traefik a couple of times I have yet to see a deployment where it does not consume more cpu cycles than the microservices it is fronting.

From a casual glance it does nothing haproxy doesn't already do, at a fraction of the cpu cost.

lakomen 2 years ago |

Traefik is considerably slower and more resource hungry than nginx. There is nothing more to say.

woopwoop24 2 years ago |

i had such a hard time learning traefik and transitioning to V2. I do not fall into the standard case, wanting to use traefik for containers, not running on the same host (you cannot have labels annoted as the docs suggest, if the container is on another host) Docs were sparse and also not wanted to use the env vars for the traefik config as well, so took a bit of fumbling and reading and eventually i figured it out, but was almost on the verge of going back to haproxy

muhehe 2 years ago |

In the future our company will migrate to k8s. It looks like it will be openshift, specifically. Do we need this in openshift or is there some "native" mechanism baked in?

verdverm 2 years ago | |

You'll likely have an ingress controller provided with openshift, which tends to be more batteries included. There are quite a few options: https://kubernetes.io/docs/concepts/services-networking/ingr...

siva7 2 years ago |

It’s nice if you’re running a bare metal server on hetzner or DO but in the age of cloud platforms like aws or azure there is hardly a need for traefik.

PennRobotics 2 years ago | |

Even on Hetzner, it's not amazing and not a one-click workflow.

Load their Photoprism image on a standard server with only IPv6 (as a v4 address costs extra) and certificates will not get generated; logs point to Traefik although the solution is modifying Dockerfiles; thanks Dockerphiles, for insisting your software is the answer to everything server...

kopadudl 2 years ago |

When my company looked at different proxies for k8s, we ended upon traefik cause we had experience from docker swarm and it has a dashboard.

cvalka 2 years ago |

The holly three: Caddy, Envoy, Traefik. Do not use nginx and haproxy.

iansinnott 2 years ago |

> Traefik is more comparable to HAProxy than to nginx/caddy/apache2

Aren't caddy and traefik fairly comparable? I've only used them both lightly so I may be missing the core point of each, but I thought of them as very similar.

candiddevmike 2 years ago | |

Traefik can't serve static files, or interact with CGI providers like PHP.

justusthane 2 years ago | |

The rest of the sentence you quoted explains that nginx, Caddy, and Apache are all webservers (which can also reverse proxy). Traefik and HAproxy are only reverse proxies and not webservers.

IggleSniggle 2 years ago | | |

HAProxy can be a web server though, albeit it is not designed to operate this way and thus requires some goofy configuration to make happen. I only know this because it was useful for me while working on a HAProxy extension.

thinkmassive 2 years ago | |

Caddy is primarily a web server like nginx and apache httpd. Traefik and HAproxy are primarily reverse proxies.

mholt 2 years ago | | |

Caddy is actually used as a reverse proxy more than a static file server. It's equally excellent and proficient as both! Caddy's functionality is comparable to nginx, apache httpd, and haproxy.

mkesper 2 years ago | |

Caddy is at the same level as nginx/apache. It is able to do everything a web server is expected to (serving web sites, files and proxying services) plus handling LetsEncrypt automatically. It does not, afaik, do dynamic service discovery like traefik nor load balancing of TCP at the protocol layer, like e.g. haproxy. https://caddyserver.com/features

mholt 2 years ago | | |

Caddy can absolutely do both of those things.

- https://caddyserver.com/docs/modules/http.reverse_proxy.upst...

- https://github.com/mholt/caddy-l4

baobun 2 years ago | | |

Just to add on, haproxy does service discovery too.

https://www.haproxy.com/blog/consul-service-discovery-for-ha...

znpy 2 years ago |

> you mount the docker socket into the traefik container and gain the ability to auto-detect other containers that you might want to expose using traefik.

Totally not a security issue. Source: trust me bro.

xorax 2 years ago | |

https://github.com/traefik/traefik/issues/4174

meonkeys 2 years ago | | |

https://doc.traefik.io/traefik/providers/docker/#docker-api-...

https://www.reddit.com/r/Traefik/comments/g46lhh/does_bindin...

https://github.com/wollomatic/traefik-hardened

1oooqooq 2 years ago |

> “Server Name Indication” (SNI)

into the trash it goes. anyone who support https everywhere and ever slightly tolerates SNI is a fool.

ajnin 2 years ago | |

I don't see why you're opposing HTTPS everywhere and SNI, HTTP already had the Host header so it is not a new information leak.

It's pretty much mandatory if you intend to serve multiple domains with different certificates from the same host/proxy, which seems like a very very common use case, and there is no alternative to this right now.

1oooqooq 2 years ago | | |

I don't see how you think NSI doesn't nullify https everywhere.

"we need MitM for performance". listen to yourself. if some optimization breaks security, you do not optimize.

d-z-m 2 years ago | |

can you elaborate?

1oooqooq 2 years ago | | |

SNI = nsa backdoor into https everywhere.

basically it moves private info in the plain text header "for edge performance"

nderjung 2 years ago |

If you're looking for an alternative way to run traefik, we support this out-of-the-box on https://kraft.cloud -- A platform dedicated to running ultra-lightweight VMs based on Dockerfiles, with millisecond cold start times (96ms for Traefik), scale-to-zero, autoscale.

Check it out in our docs: https://docs.kraft.cloud/guides/traefik/

It's also possible to start traefik and other services together using Compose files: https://docs.kraft.cloud/guides/features/compose/

yaml # traefik.yaml # Enable API and Dashboard api: dashboard: true # Define entry points entryPoints: web: address: ":80" app1: address: ":8081" app2: address: ":8082"

security.acme = { acceptTerms = true; defaults.email = "admin-email@provider.net"; certs."mydomain.example.com" = { domain = "*.mydomain.example.com"; dnsProvider = "cloudflare"; environmentFile = "/path/to/cloudflare/password"; }; }; services.caddy.enable = true; services.caddy.virtualHosts."subdomain1.mydomain.example.com" = { extraConfig = '' reverse_proxy 127.0.0.1:1234 ''; useACMEHost = "mydomain.example.com"; };