I just want simple S3

blog.feld.me

118 points by g0xA52A2A 3 days ago


keyle - 4 hours ago

   I just need something that can do S3 and is reliable and not slow.
Oh, simply that.

I'm a simple man, I just need edge delivered cdn content that never fails and responds within 20ms.

tptacek - 5 hours ago

Since this has come up 4-5 times on the thread already, the clear subtext of this post is that this developer wants to build to the S3 API, but run their storage locally --- maybe for testing reasons, maybe for data hygiene reasons, maybe for performance reasons. So things like "what about Hugging Face's object storage product" don't really answer their question.

PunchyHamster - 5 hours ago

S3 isn't "simple" tho.

It doesn't need to care about POSIX mess but there is whole swathes of features many implementations miss or are incomplete, both on frontend side (serving files with right headers, or with right authentication) and backend (user/policy management, legal hold, versioning etc.)

It gets even more messy when migrating, for example migrating your backups to garagefs will lose you versioning, which means that if your S3 secret used to write backups gets compromised, your backups are gone vs on implementation that supports versioning you can just rollback.

Similarly with password, some will give you secret and login but won't allow setting your own so you'd have to re-key every device using it, some will allow import, but only in certain format so you can restore from backup, bot not migrate from other software.

jerf - 6 hours ago

I think we get a "S3 clone" about once every week or two on the Golang reddit.

It strikes me as a classic case of "we need all the interested people to pull in one project, not each start their own". AI may have made this worse then ever.

estebarb - 3 hours ago

Personally I would suggest that the "easiest S3" would be simply using NFS. You can get replication with RAID.

S3 is simple for the users, not the operators. For replicating something like S3 you need to manage a lot of parts and take a lot of decisions. The design space is huge:

Replication: RAID, distributed copies, distributed erasure codes...

Coordination: centralized, centralized with backup, decentralized, logic in client...

How to handle huge files: nope, client concats them, a coordinator node concats them...

How will be the network: local networking, wan, a mix. Slow or fast?

Nature of storage: 24/7 or sporadically connected.

How to handle network partitions, pick CAP sides...

Just for instance: network topology. In your own DC you may say each connection has the same cost. In AWS you may want connections to stay in the same AZ, use certain IPs for certain source-destination to leverage cheaper prices and so on...

CobrastanJorji - 6 hours ago

This is an interesting write up, but I'm curious about the use case. If you don't need to scale, and you don't need to replicate anything, why do you want S3 specifically? Are you using a tool that wants to write to something S3-like? Do you just like reading and writing objects via HTTP POST and GET? Are you moving an app to or from the cloud?

jdbohrman - 42 minutes ago

Wouldn't Blossom fit this? https://github.com/hzrd149/blossom

pveierland - an hour ago

Garage has worked well for me and gives a good sense of stability. They provide helm charts for deployment and a CLI. There's also very few concepts to learn to start to use it, while e.g. for SeaweedFS I feel like you need to parse a lot of docs and understand more specific terminology.

siliconc0w - 34 minutes ago

I use rustfs (for local development, not scaled usage) and it seems solid.

panarky - 7 hours ago

Sounds like you want S4. Super simple storage service.

K0IN - 2 hours ago

Not that long ago someone on hn poster this [0] a zig based s3 server in 1k lines, (warning not production ready) but if you really look for something simple, it might fit your case.

[0] https://news.ycombinator.com/item?id=46421196

rtpg - 2 hours ago

Is the problem here that everyone wants a different like 45% of the S3 API? Or is it that minio sucked all the oxygen out of the air in this space by being good at this, and now we need something else to show up?

didgetmaster - 3 hours ago

Better title: I just want local storage with a simple S3 interface.

0xbadcafebee - 3 hours ago

Call me crazy, but wouldn't 15 minutes on GLM 5.1 produce a working implementation? I haven't looked at the code, but a non-production-grade Go implementation can't be that complicated.

Edit: Minio is written in Go, and is AGPL3... fork it (publicly), strip out the parts you don't want, run it locally.

ChromaticPanic - 2 days ago

Garage "unnecessarily complex" . If anything it's the simplest solution in the list especially compared to Ceph or Apache Ozone

therealmarv - 3 hours ago

Settled with SeaweedFS for replacing minio and getting a good chunk of S3 feature parity. I wonder about the problems OP is posting about. Never seen that behaviour but usually only having a bunch of smaller files.

uroni - 6 hours ago

I made https://github.com/uroni/hs5 -- focus is on single node and high performance. So plenty of alternatives available.

singhrac - 4 hours ago

I wanted to try NVIDIA’s aistore for our datasets, but I couldn’t figure out how to get a small version up and running so I gave up (a few years ago, today I’d get an LLM to show me how k8s works).

lewtun - 6 hours ago

Hugging Face Buckets are pretty simple: https://huggingface.co/docs/huggingface_hub/en/guides/bucket...

Disclaimer: I work at HF

grizzletooth - 3 hours ago

Check out Floci. It is a self hosted AWS clone with multiple services functional, including S3 and Dynamodb.

https://github.com/floci-io/floci

moondev - 7 hours ago

microceph is pretty nice and straightforward for throwaway s3 endpoints

https://canonical-microceph.readthedocs-hosted.com/stable/tu...

amarsahinovic - 6 hours ago

S3-compatible storage solution: https://www.hetzner.com/storage/object-storage/

scottfits - 6 hours ago

100% - i really wanted Render to add this, feels like there is potential for a startup here

nate - 6 hours ago

I only recently realized how much I like using Cloudflare more than AWS :) R2 (their version of S3) is no exception. Much more pleasant figuring out how to use and configure it in Cloudflare than the craziness inside AWS.

pkghost - 6 hours ago

Based on the list of contenders feels like you might be missing rsync.net?

nhumrich - 4 hours ago

Well, OP, your requirements section is seriously lacking. You need "s3", but only local, non horizontally scalable?

You failed to answer why you even need s3... Why not a filesystem? Full stop. The entire point of s3 is distributed.

phibz - 5 hours ago

Why do rust compile times matter for a production deployment?

otterley - 2 days ago

So use S3.

hybirdss - 4 hours ago

someone is 100% going to write the 'i just want simple S4' post next month

- 7 hours ago
[deleted]
deepjoy - 3 hours ago

[dead]