AWS S3 open source alternative written in Go

AWS S3 open source alternative written in Go(minio.io)

405 points by krishnasrinivas 9 years ago | 131 comments

Ixiaus 9 years ago |

Or, run Riak with their S3 compatibility layer. Riak is extremely stable and the work Basho has done to make a truly robust distributed database is significant.

http://docs.basho.com/riak/cs/2.1.1/

viraptor 9 years ago | |

Other alternatives:

ceph - http://docs.ceph.com/docs/master/radosgw/s3/

swift - https://wiki.openstack.org/wiki/Swift/APIFeatureComparison#A...

theanalyst 9 years ago | | |

Also ceph (& swift) are known to scale well in prod. clusters with over 30+ PB of data (at least looking at CERN's cluster) and the latest version of RGW does support geographic redundancy for S3 like apis

dekobon 9 years ago | | |

Don't forget:

manta - https://www.joyent.com/manta

dc2447 9 years ago | | |

CEPH is a volume service not an object storage service.

SWIFT is indeed analogous to S3.

Natales 9 years ago | |

+1 on RiakCS. They now call it RiakS2 for kicks. The scalability and reliability of their server is insane. You just can't beat Erlang software in that regard.

Unfortunately, Basho has been so successful with their TSDB and KV products that they have basically put S2 on maintenance mode. They are still "supporting" it, but no new features. I was hoping this Minio tool could do something similar, but with a single daemon is a single point of failure. Unacceptable for serious deployments.

bramd 9 years ago | | |

Another interesting project written in Erlang is LeoFS: http://leo-project.net/leofs/

jdboyd 9 years ago | | |

Considering how many serious deployments still use non-clustered NASs, a single node object store seems equally reasonable.

owyn 9 years ago | | |

That sounds pretty nice. If it works does it need new features? :)

corobo 9 years ago | |

There's also Skylable's SX Cluster if you use the libres3 daemon with it. Been using it for over a year with no problems. Set, forget, add more nodes when I need more disk.

Everyone's got their s3 of choice, always good to have more options on the table.

https://www.skylable.com/products/sx/

https://www.skylable.com/products/libres3/

ranman 9 years ago | |

Ran Riak CS in production and had constant issues. It's not terrible but it's also not ideal. I would caution against anyone depending on it for mission critical systems. Many of the failure modes are undocumented.

0xmohit 9 years ago | | |

Could you elaborate on some of the specific issues you ran into?

mprev 9 years ago | |

There's also Pithos from Exoscale. Runs on top of Cassandra. Code is Clojure and open source. Http://Pithos.io

merb 9 years ago | |

guess it's possible, but riak is not designed to run on a single node. Guess even basho suggests using at least a 5 node cluster.

unlocksmith 9 years ago | | |

Minio is deliberately designed this way. Cloud native applications require strict multitenancy. Minio's approach is to build just enough to meet a single tenant's requirement. Deploy one minio server per tenant or user or customer .. whichever fits you the best. This will allow you to upgrade, customize or bug fix in isolation. To replicate for HA, use "mc mirror -watch SOURCE TARGET" command to pair them up. If you have multiple drives (JBOD), you can eliminate RAID or ZFS and use Minio's erasure code to pool them up. Distributed version is also in testing at the moment. It should be out in a month.

dc2447 9 years ago | |

Dont you have to pay for an enterprise licence if you want multi region/datacentre/AZ?

davidu 9 years ago |

Theory here is that people will build apps that talk to S3. But sometimes those apps might need to run inside the perimeter and can't talk to the cloud. So rather than rewrite an app to talk to a new internal datastore, you just point it at a locally hosted Minio and you're up and running.

Smart.

tomjakubowski 9 years ago | |

Versions of this (S3-compatible service for development use) have existed for years. One I used was https://github.com/jubos/fake-s3

notyourwork 9 years ago | |

What kind of situations do you see this becoming a factor in? 5 or 10 years ago this was an issue with early cloud adopters. Now a days cloud providers are ramping up their DCs to be compliant and allow companies/government entities with strict policies to still onboard.

Its a good strategy but not one that I see being exercised frequently enough.

extrapickles 9 years ago | | |

The software I work on is targeted towards customers who generally have really spotty internet connections (eg: they all are in the less forgiving parts of the ocean or middle of nowhere if on land). This pretty much mandates using software like this to build out your app as you can't rely on internet connectivity.

There pretty much isn't anything you can do to improve their internet connections as cables to remote places are always getting dug up with week+ times to repair so you need something that can run locally for long periods. Ships have a different problem with very slow speeds that effectively means you can only transmit the absolute minimum off the ship when its out as sea (when they are at port they typically have normal internet connections to bulk dump data off on).

hhandoko 9 years ago |

I switched from Fake S3 [1] to Minio for local development. Fast and lightweight, good experience so far :)

Easy to setup with Vagrant, and linking / sharing the Minio shared folder to the host makes it quite convenient to quickly check the files without going to the UI [2].

[1] - https://github.com/jubos/fake-s3

[2] - It stores the files as-is in the local filesystem (files in folders, unchanged), as opposed to having it 'wrapped' like Fake S3 does.

krishnasrinivas 9 years ago |

Minio will always be 100% free software / open source. We have no plans to add any proprietary extensions or hold back on features for paying customers only. -- Minio Team

cyphar 9 years ago | |

Then why not make the license AGPLv3-or-later, to avoid other people creating proprietary forks? I get that it's not a common occurence within the Golang world, but nothing will change unless more Golang projects start making their code copylefted.

y4m4b4 9 years ago | | |

GNU AGPL is an ideal license for free software projects. We are a strong supporter of the GNU project. We chose Apache License for Minio purely for adoption reasons. Most of our users build proprietary software around Minio and their legal council has a default NO policy towards GNU licenses. Besides, FSF has also approved Apache License v2 as a free software license.

Proprietary forks are OK with us. It will be too expensive to maintain branches of their own and catch up with the upstream.

bjoerns 9 years ago |

After evaluating a couple of options mentioned in the other comments here, we recently replaced our in-house built s3 clone with minio for our on-prem version of our app. Very robust and stable.

matt_wulfeck 9 years ago | |

Keep in mind that there are plenty of object stores that are robust and stable until you put 1 billion keys in them.

bjoerns 9 years ago | | |

That's a very good point - but for what we do (on-premise version control for Excel where each workbook version represents one object) we won't be getting even close to that number. But yes, agreed, it entirely depends on your use case.

fizzbatter 9 years ago |

Does this have the ability to mirror to an encrypted remote? I'm looking for something like this for a simple home storage server, but emphasis on being able to replicate to something like B2 Storage for cheap backup.

Currently Infinit.sh has my attention the most, but it's quite young still.

edit: https://news.ycombinator.com/item?id=12125344 this thread seems to be talking about what i want. With that said, i'm not yet sure if `mc mirror` supports Backblaze, as that (per price point) is my prime need

rsync 9 years ago | |

Current opinion is that "borg" is the holy grail of backup schemes ... it takes attic, which fixed all of the duplicity shortcomings, and improved on that ... [1]

We[2][3] tend to agree with that.

One reason it might not work for you is that we are an order of magnitude more expensive than B2, so perhaps that's a better bet for you. On the other hand, $7.20 per year for our smallest borg account is almost as close to zero as your B2 minimum order would be, so ... who knows.

One upside of choosing our service is that you can choose your location (US, Zurich, HK, etc.)

[1] https://www.stavros.io/posts/holy-grail-backups/

[2] rsync.net

[3] http://www.rsync.net/products/attic.html

RubyPinch 9 years ago | | |

from [3]

> If you're not sure what this means, our product is Not For You.

Please don't do that, its childish and unimpressive.

krishnasrinivas 9 years ago | |

Minio is object-storage server. You can use https://github.com/restic/restic to encrypt and mirror to remote minio server. For more help https://docs.minio.io/docs/restic-with-minio

frugalmail 9 years ago |

The canonical open source alternative to S3 https://wiki.openstack.org/wiki/Swift

hansjorg 9 years ago | |

Riak CS is another one:

https://github.com/basho/riak_cs

ranman 9 years ago | | |

Ran this in production and dealt with a lot of issues. I would caution people against it's use in anything critical or customer facing.

spudfkc 9 years ago | |

I use Swift at work, and while it is a great tool, it is a bitch to set up. I would be curious to learn how Minio works more technically on a distributed level: how is object replication handled? are downloads automatically routed to the closest server? can I make downloads temporarily available (think Swift tempURLs)?

y4m4b4 9 years ago | | |

We are currently working on the distributed version and will be making a beta release soon.

Currently minio supports

- pure FS backend with single disk - pure Erasure coded backend with multiple disks on single node (like ZFS)

For more information you can read here - https://docs.minio.io/docs/minio-erasure-code-quickstart-gui...

We do not do any sort of replication erasure code handles disk failures and we also implement transparent bit-rot protection as well.

To replicate one setup to many you can use 'mc mirror -w' which would watch on events and do continuous replication.

Relevant docs can be found here

https://docs.minio.io/docs/minio-client-complete-guide#mirro...

llambiel 9 years ago | |

Also http://pithos.io/ backed by Cassandra

kjetijor 9 years ago | |

And another good example is ceph+radosgw.

majewsky 9 years ago | |

I run OpenStack Swift at work, currently working on deploying it on Kubernetes. Swift is a very fine piece of software, with a pleasant operations experience, but it will take a lot of time to set up initially. Plan at least half a man-year until you have it all up and running.

cdnsteve 9 years ago |

Practical use case:

- Spin up a bunch of droplets on DigitalOcean, because I want reliability, etc.

- What's the best way to share drive space across these to create a single Minio storage volume, so if one DO node goes away I don't lose my stuff?

krishnasrinivas 9 years ago | |

We are working on distributed minio https://github.com/minio/minio/tree/distributed

The minio available today for production use can export single disk or aggregate multiple disks on the same machine using erasure coding.

For this, if you want backup you can use github.com/minio/mc tool to mirror, more help here https://docs.minio.io/docs/minio-client-complete-guide#mirro...

killbrad 9 years ago | | |

I think this should be made clear on your site. I spent a good amount of time trying to figure out how to actually get this to be distributed, but the answer is - you don't. So it's only like S3 in interface, not in durability or availability.

SteveNuts 9 years ago | |

So far the best option I've found has been GlusterFS

krishnasrinivas 9 years ago | | |

Minio is by ex-GlusterFS developers!

squiguy7 9 years ago | |

I was going to suggest using their new block storage but I read the docs some more:

> A volume may only be attached to one Droplet at a time. However, up to five volumes can be attached to a single Droplet.

Looks like you would have to roll your own solution.

bryanlarsen 9 years ago |

minio works awesome for dev & test deployments. It's dead simple to set up, just a single executable. Hopefully it doesn't lose that simplicity as it grows up and gains features.

tbrock 9 years ago | |

It's a go binary, that's just how they work.

Keyframe 9 years ago |

Sorry for two posts (the other one was unrelated). If anyone has experience with this I have a few questions regarding a particular use case.

How does something like this behave with really large files. Video files in 100s of gigabytes, for example. I'm asking because if one could set up a resilient online (online as in available) storage with fat pipes like this it could be used as a platform to build a centralized video hub for editing. It's another question how much sense would it make over a filesystem though.

zx2c4 9 years ago |

Their CLI client is called `mc`. This is an unfortunate conflict with the venerable Midnight Commander.

andrewchambers 9 years ago |

I love the website. I'm a lone developer who doesn't know any HTML, how would I go about getting such a nice design for my own projects? (Or how much would it cost)

zbuttram 9 years ago | |

Wappalyzer (https://wappalyzer.com/) tells me they're using Bootstrap (http://getbootstrap.com/) (probably customized a bit). HTML isn't very difficult (just another markup language) and if you're not inclined toward design (I am also not) there are a plethora of CSS frameworks to choose from (like Bootstrap) that will get you up and running with something not completely ugly. Personally I like Bulma (http://bulma.io/) right now which showed up (I think as a Show HN) on here a while back. Currently using it for a project and I'm enjoying it.

andrewchambers 9 years ago | | |

Really my design sense isn't great, given time I can hack together something with bootstrap, but I do think I lack the designer training and probably instincts

jedisct1 9 years ago |

Or run LeoFS http://leo-project.net/leofs/

Keyframe 9 years ago |

Unrelated question. What's the point of fullscreen button on those term session players (or whatever they are) if it doesn't stretch the playback to fullscreen? You only get a same-sized screen with black around it. It's not even centered to the screen.

eknkc 9 years ago | |

I guess it is https://asciinema.org but their samples have centered full screen. Maybe a CSS issue here.

I'm not sure about the point either. Maybe if you embedded a small player it would be zoomed out and fullscreen would show the native style.

jdc0589 9 years ago | | |

all my brain sees in the domain name is "ascii enema"

nulagrithom 9 years ago |

Is this just meant to emulate S3 for the sake of dev/test environments? Without clustering/HA I don't really see the point of using this over the plain old file system. Or am I missing something?

krishnasrinivas 9 years ago | |

Absolutely, our focus currently is on multi-server minio which is being actively developed on the "distributed" branch https://github.com/minio/minio/tree/distributed

Our current stable version can export single disk or multiple disks (using erasure coding providing protection against disk failures) As it is very easy to get started with (single binary, thanks to Go) people find it attractive for dev/test environments.

To replicate for HA (even for the single server version), use "mc mirror -watch SOURCE TARGET" command to pair them up. If you have multiple drives (JBOD), you can eliminate RAID or ZFS and use Minio's erasure code to pool them up. Distributed version is also in dev/testing at the moment. It should be out in a month.

olalonde 9 years ago |

Previous discussion: https://news.ycombinator.com/item?id=12122998

helper 9 years ago |

How easy is it to embed this into go tests? Right now I use goamz/s3test for that, but it has a lot of limitations.

khc 9 years ago | |

goofys and s3fs both use s3proxy for this, which works fine as long as you are ok with having Java as a test dependency: https://github.com/kahing/goofys/blob/master/test/run-tests....

y4m4b4 9 years ago | |

Quite easy actually you can look at

https://github.com/restic/restic/blob/master/run_integration...

helper 9 years ago | | |

I don't want to run it in an external process, I want to run it in a goroutine.

scoopr 9 years ago |

So, I can use midnight commander as the client? ;) (half joking, half serious)

unboxed_type 9 years ago |

Why is it so important what language it is written in? :-)

LoSboccacc 9 years ago |

couldn't find at a glance wheter it has the same read after write issue of s3, or in general what the consistency is.

also, failure and backup modes.

kparthas 9 years ago | |

Minio server provides read-after-write consistency. For fault-tolerance, * protection against failed disks, you could deploy Minio erasure code setup. ref: https://docs.minio.io/docs/minio-erasure-code-quickstart-gui...

* Minio erasure code setup also provides protection against "bit-rot".

muminoff 9 years ago |

Do you guys have plans with multi-tenancy feature?

koolhead17 9 years ago | |

Absolutely, we are working on it. Please visit our "distributed" branch https://github.com/minio/minio/tree/distributed

anonymous7777 9 years ago |

ok tired of people bragging about "Go". It underperforms than many GC based languages that are out there.

RubyPinch 9 years ago | |

Generally, if you comment less about Go, then you end up in less discussions about it

beastman82 9 years ago |

written in Go - Does this matter?

mrweasel 9 years ago | |

Yes and no, if you're in the market for an S3 clone, but want to be able to add features, fix bug or hack on it in some way, it nice to know which language it's being developed in.

As you can tell from the other comments, there's plenty of alternatives to pick from, and if you're going to dive in to the code yourself the language may be a deciding factors.

unboxed_type 9 years ago | | |

It is important, because you will not find any Go-developers on the market, so if you are serious about using it then think twice ;)