Pgo: The Postgres operator from crunchy data(github.com) |
Pgo: The Postgres operator from crunchy data(github.com) |
Before using Crunchy Data, I'd read their terms of use.
"without an active Crunchy Data Support Subscription or other signed written agreement with Crunchy Data are not intended for... using the services provided under the Program (or any part of the services) for a production environment, production applications or with production data"
https://hub.docker.com/r/crunchydata/crunchy-postgres
If you look at their Docker Hub images, you'll see that they're provided under the terms of use of the Crunchy Data developer program which means you can't use them in production without an active subscription.
Maybe I'm reading it wrong, but if that's the case Crunchy Data should definitely change their terms of service.
https://www.percona.com/blog/2021/05/26/percona-distribution...
Percona certainly seems to think you can't use the Crunchy Data images in production saying, "CrunchyData container images are provided under Crunchy Data Developer Program, which means that without an active contract they could not be used for production."
It is more than "Percona Thinks" We have number of customers who started using Crunchy Kubernetes Operator based solution thinking it is Open Source and were contacted by Crunchy Sales team to indicate they need subscription to use it.
This was one of the reasons for them to move to Percona Operator for PostgreSQL which does not require any commercial relationship with Percona to use in practice and completely Open Source
https://www.percona.com/doc/kubernetes-operator-for-postgres...
We decided to go even further with the CloudNativePG operator.
EDB as original creator has decided to donate the intellectual property of the source code to the community, open sourced the existing operator under Apache License 2.0 to apply for the CNCF Sandbox. The project not only includes the operator, but also the PostgreSQL operand images - which can be customized (we provide details on how images should be).
We genuinely welcome other vendors to participate in the community and contribute to the project, including by offering professional services around it. Our multi-year commitment is to become a graduated CNCF project.
For more information: https://cloudnative-pg.io/
Basically, the license seems to be (and IANAL): we provide the software for development purposes...without a subscription, it's only intended for development purposes
They didn't say "we provide this software for your use...without a subscription, it's only intended for development purposes". They basically said: we provide this software for development purposes, it's only intended for development purposes.
If it's only a disclaimer, where's the grant of rights to use the software beyond development purposes?
Yes, if I were a lawyer defending a user, I'd definitely be arguing your point. However, I think Crunchy Data's lawyer would simply point out that there's literally no grant in the license for non-development purposes. Maybe a judge would take pity on you given that it seems hidden, has some ambiguity (though maybe it's not ambiguous to a lawyer), and because they allow you to spin up the operator basically without ever knowing these terms exist.
Given that there are many other PostgreSQL operators from companies like Percona (which I think has a great and long track record of supporting open source databases), EnterpriseDB, and Zalando, I don't see why I'd want to choose Crunchy Data.
You might be right and I think you would be right if only looking at the piece I quoted, but given that there's no general grant in the license that the "intended for" is merely a disclaimer for, it seems like the license grant is that they provide access for development purposes. IANAL and I'd rather work with software and companies where I don't have to be a lawyer. Crunchy Data could have said "We provide access to this software free of charge for any purpose. Without a subscription, it is unsupported and not intended for production use. We are not responsible for anything that happens if you use it in production." That's not what they said. They said that they provide it "free of charge for development purposes" with no grant for non-development purposes.
New player from Enterprise DB: https://github.com/cloudnative-pg/cloudnative-pg
[2] https://stackgres.io/extensions/
Disclaimer: founder of the project.
EDB is the original creator. The software is now entirely owned by a vendor neutral community, openly governed. We have applied for the CNCF sandbox and waiting for the approval at this stage.
Percona also has a PG operator.
I a am part of the community team. We have weekly live streams/blog posts about the topic and a Slack channel.
But honestly, the project always felt like a one-man-show, some (realy great) dev had a working set of scripts and kubernetized / operatorized it, but the whole thing feels hacky as hell.
I'd still give a try if I ever needed postgresql again, but I would also know that I need to implement (again) my set of scripts and hacks on top of it.
Back then, we evaluated Crunchy Operator's source code. Being primarily imperative and using an external tool for failover, where the two main reasons we decided to start a new project in 2019 which was entirely declarative and purely based on the Kubernetes API server for cluster status. Such project was released open source last April under the name CloudNativePG and hopefully it will enter the CNCF Sandbox soon (fingers crossed).
Couldn’t agree with you more, especially if you’re used to the quality of strimzi.
I think this is a super interesting space. What I mean by that is the fact we have Postgres, which is arguably one of the most successful publicly governed open source project out there, for many decades! That meats on of the most vibrant and transformative publicly governed projects as well, called Kubernetes (CNCF).
Why then consider a (proprietary) vendor governed (open source) project to bring these two technologies together? With CloudNativePG, you bring these two super strong communities together using these exact same governance principles, enabling everyone to benefit and contribute.
It is my conviction that this is going to be one of those elements that is going to contribute to the ongoing transformation of data management today.
It looks super solid. Unfortunately I can’t vouch for it in production yet since I still write my own resources but for me Zalando is #1 and Kubegres is either 2nd or 3rd.
https://www.reddit.com/r/PostgreSQL/comments/mqrsbn/kubegres...
We abandonded Zalando after failing to set it up properly. Don’t remember the details, but do remember that the setup is leaning towards the internal infrastructure at Zalando.
Regarding being opinionated I believe that it is what we expect from an operator. An operator simulates what human DBAs in this case would do. I am a maintainer of CloudNativePG, and I have been running and supporting PostgreSQL in production for 15+ years, creating also another open source software for backups (Barman). In CloudNativePG we have basically translated our recipes into Go code and tests.
Many people believe that databases should not run in Kubernetes. I not only believe the opposite, I believe that running Postgres in Kubernetes represents the best way, potentially, to run Postgres out there.
I've run into way too many exotic edge cases with kubernetes to trust an operator to do the right thing with data I care about. Most especially when the operator is also managing the replication and replicas and their underlying storage.
I am pro running datastores and other stateful workloads on kubernetes. I've been running databases on kubernetes since petSets.
(1) https://dok.community/wp-content/uploads/2021/10/DoK_Report_...
Feels like all the Postgres enterprises are making kubernetes operators these days -- probably seeing it as a way to stay in the game/relevant.
Just like before when I benefited from EDBs contributions to pg as a whole (and advancing the cause of pg), I look forward to benefiting again!
[EDIT: I made an error, I originally said that it was explicit about no license being granted for the software, but that was about things like no license for service marks and trade names.]
On the other hand, there is a license for postgres-operator and it looks like Apache 2.0:
https://github.com/CrunchyData/postgres-operator/blob/master...
For what's being distributed in their Docker container image, I imagine it depends on what they're actually distributing for that to matter. I expect that it's mostly other people's software (like PostgreSQL) and that the Docker page listing the container just too much of a summary to say anything about licensing. I'd investigate that further to clarify license status prior to use, but expect it to not be legally constrained to non-production use only.
- Focus on CRDs versus outside tools (crunchy used to insist on pgo and while it's not required, I don't really want another CLI to manage/use)
- Amount of open conversation, conference talks etc around Zalando's solution and why they've built their solution/how they've scaled it
- No specific need for pgadmin/badger/extra logging features (I might feel differently these days!)
- Customizable pods
These days both of these projects are very similar overall, and I haven't compared them recently.
Looking back at my notes I had this written down:
> Looks like Zalando is probably the winner (https://www.youtube.com/watch?v=WBdNVrffOSo)
The video is old and in Japanese but basically:
- scale in/out are similar
- RBAC control is similar
- Automatic version upgrades (only Zalando supported rolling update at the time)
- Zalando had PVC-driven volume resize vs Crunchy required cluster change
It's been a year so maybe they're probably even more similar now but I'd love to hear what I'm missing if I'm wrong.
Some more resources:
https://blog.flant.com/comparing-kubernetes-operators-for-po...
https://www.youtube.com/watch?v=3TFXztwat_s
These days I might lean Crunchy -- but a lot of the things that would be gaps (like no built in postgres support) I would actually opt to solve myself with a different operator (ex. Prometheus Operator) or just adding a deployment myself (when needed). Taking another quick look today, the differences I see are:
- customizable tablespaces with Crunchy
- local backups with pgbackrest + crunchy (and in general wal-g is preferred to wal-e so zalando is a little behind there)
Neither of them have citus though, which I think will be a huge game changer as a default-include for their pg images.
Yes. We have development version available already, check it out and give us some feedback - what else need to be done so it is Production Ready
That is why we took the approach to reduce the number of components and integrate everything in Kubernetes, especially with logging (we directly log in JSON to standard output) and the usage of application containers, which enables us to cover the case of troubleshooting via the fencing mechanism (your pods are up, you can access storage, but Postgres is down, giving you the possibility to check even possible data corruption issues).
Also, the status is directly available in Kubernetes, so in our view easier for Kubernetes administrators.
Finally, the source code is open source and directly available for inspection - if you want to understand what is happening.