AWS S3: Sometimes you should press the $100k button

AWS S3: Sometimes you should press the $100k button(cyclic.sh)

408 points by korostelevm 4 years ago | 233 comments

lenkite 4 years ago |

sigh. My team is facing all these issues. Drowning in data. Crazy S3 bill spikes. And not just S3 - Azure, GCP, Alibaba, etc since we are a multi-cloud product.

Earlier, we couldn't even figure out lifecycle policies to expire objects since naturally every PM had a different opinion on the data lifecycle. So it was old-fashioned cleanup jobs that were scheduled and triggered when a byzantine set of conditions were met. Sometimes they were never met - cue bill spike.

Thankfully, all the new data privacy & protection regulations are a life-saver. Now, we can blindly delete all associated data when a customer off-boards or trial expires or when data is no longer used for original purpose. Just tell the intransigent PM's that we are strictly following govt regulations.

CydeWeys 4 years ago | |

The data protection regulations really are so freeing, huh. It's amazing to be able to delete all this stuff without worrying about having to keep it forever.

jeff_vader 4 years ago | | |

In case of my previous employer it led to incredibly complicated encryption system. It took couple years to maybe implement in 10% of the system. Deleting any old data was rejected.

theshrike79 4 years ago | | |

Yep, having everything disappear at 2 months max is a life-saver.

That "absolutely essential thing" isn't essential any more when there is a possible GDPR/CCPA violation with a significant fine just around the corner.

whimsicalism 4 years ago | | |

now this is a spin i havent heard before.

StratusBen 4 years ago | |

Disclosure: I'm Co-Founder and CEO of a cloud cost company named https://www.vantage.sh/ - I also used to be on the product management team at AWS and DigitalOcean.

I'm not intentionally trying to shill but this is exactly why people choose to use Vantage. We give them a set of features for automating and understanding what they can do to manage and save on costs. We're also adding multi-cloud support (GCP is in early access, Azure is coming) to be a single pane of glass into cloud costs.

If anyone needs help on this stuff, I really love it. We have a generous free tier and free trial. We also have a Slack community of ~400 people nerding out on cloud costs.

vdm 4 years ago | | |

https://www.google.com/search?q=site%3Ahttps%3A%2F%2Fdocs.va...

I gave vantage.sh 5 minutes and did not see anything for S3 that is not already available from the built-in Cost Explorer, Storage Lens, Cost and Usage Reports, and taking 1 hour to study the docs https://docs.aws.amazon.com/AmazonS3/latest/userguide/Bucket...

Most "cloud optimisation" products want to tell you which EC2 instance type to use, but can't actually give actionable advice for S3. Happy to be corrected on this.

samlambert 4 years ago | | |

Vantage is a seriously awesome product. We love it at PlanetScale. Obviously being a cloud product things can get pricy and so Vantage is essential.

cookiesboxcar 4 years ago | | |

I love vantage. Thank you for making it.

imwillofficial 4 years ago | | |

I work on a team the computes bills, shoot me a slack invite and perhaps I can offer insight.

candiddevmike 4 years ago | |

Are you multi-cloud because your customers need you to be multi-cloud?

lenkite 4 years ago | | |

Yes, geographically diverse customers who prefer different cloud platforms.

raxxorrax 4 years ago | |

I host stuff on AWS, but I am pretty sure that hosting on my own server or a server a IT service provider maintains is much cheaper.

tekknik 4 years ago | | |

Did you include maintenance, patching and machine upgrades? Cause likely it’s not.

liveoneggs 4 years ago |

I have caused billing spikes like this before those little warnings were invented and it was always a dark day. They are really a life saver.

Lifecycle rules are also welcome. Writing them yourself was always a pain and tended to be expensive with list operations eating up that api calls bill.

----

Once I supported an app that dumped small objects into s3 and begged the dev team to store the small objects in oracle as BLOBS to be concatenated into normal-sized s3 bjects after a reasonable timeout where no new small objects would reasonably be created. They refused (of course) and the bills for managing a bucket with millions and millions of tiny objects were just what you expect.

I then went for a compromise solution asking if we could stitch the small objects together after a period of time so they would be eligible for things like infrequent access or glacier but, alas, "dev time is expensive you know" so N figure s3 bills continue as far as I know.

darkwater 4 years ago | |

> I then went for a compromise solution asking if we could stitch the small objects together after a period of time so they would be eligible for things like infrequent access or glacier but, alas, "dev time is expensive you know" so N figure s3 bills continue as far as I know.

This hits home so hard that it hurts. In my case is not S3 but compute bills but the core concept is the same.

WrtCdEvrydy 4 years ago | | |

Because the bill isn't a "dev problem". Once you move those bills to "devops", it becomes an infrastructure problem.

ac2022 4 years ago | | |

It is also because devops is shoved down devs throats while claiming that it is easier and better. So now many of developers don’t want to spend time rewriting their code for something that is supposed to reduce their workload not increase it.

vdm 4 years ago | |

The warning should say "you have N million objects technically eligible for an archive storage class and hitting the button to transition them will cost $M".

Also S3 should no-op transitions for objects smaller than the break-even size for each storage class, even if you ask it to.

sharken 4 years ago | |

I suppose it's not just dev time on the line, but also the risk of doing the change that is thought to be too high.

If I ever get to be a manager I'd go for an idea such as yours. Though I suspect too many managers are too far removed from the technical aspect of things and don't listen nearly enough.

asim 4 years ago |

The AWS horror stories never cease to amaze me. It's like we're banging our heads against the wall expecting a different outcome each time. What's more frustrating, the AWS zealots are quite happy to tell you how you're doing it wrong. It's the users fault for misusing the service. The reality is, AWS was built for a specific purpose and demographic of user. It's now complexity and scale makes it unusable for newer devs. I'd argue, we need a completely new experience for the next generation.

rizkeyz 4 years ago |

I did the back-of-the-envelope math once. You get a Petabyte of storage today for $60K/year if you buy the hardware (retail disks, server, energy). It actually fits into the corner of a room. What do you get for $60K in AWS S3? Maybe a PB for 3 months (w/o egress).

If you replace all your hardware every year, the cloud is 4x more expensive. If you manage to use your getto-cloud for 5 year, you are 20x cheaper than Amazon.

To store one TB per person on this planet in 2022, it would take a mere $500M to do that. That's short change for a slightly bigger company these days.

I guess by 2030 we should be able to record everything a human says, sees, hears and speaks in an entire life for every human on this planet.

And by 2040 we should be able to have machines learning all about human life, expression and intelligence to slowly making sense of all of this.

arein3 4 years ago | |

>I guess by 2030 we should be able to record everything a human says, sees, hears and speaks in an entire life for every human on this planet.

That's a very good point.

Are you employed?

Would you like to join Meta?

gmiller123456 4 years ago | |

I don't get what's going on with on-line storage. You can walk in Best Buy and get a few Tb hard drive for well under $100. Yet every cloud service wants to charge you several times that per year for just 1Tb. I understand drives fail, there's operating cost, and some need extremely low latency. But there seems to be a huge disparity between what a hard drive costs, and what it costs to make it available on the Internet.

tekknik 4 years ago | | |

There’s a difference between a consumer drive and a server drive. Plop that $100 drive in and you may be back in a week or so replacing it.

jwalton 4 years ago |

Your website renders as a big empty blue page in Firefox unless I disable tracking protection (and in my case, since I have noscript, I have to enable javascript for "website-files.com", a domain that sounds totally legit).

Sophira 4 years ago | |

The problem is that the DIV that contains the main text has the attribute 'style="opacity:0"'. Presumably, this is something that the JavaScript turns off.

A lot of sites like to do things like this for some reason. I haven't figured out why. I like to use Stylus to mitigate these if I can, rather than enabling JavaScript.

acdha 4 years ago | | |

This is a common anti-pattern — I believe they're trying to ensure that the web fonts have loaded before the text displays but it's really annoying for mobile users since it can add up to 2.5 seconds (their timeout) to the time before you can start reading unless you're using reader mode at which point it renders almost instantly.

MattRix 4 years ago | | |

The page animates in. I have no idea why it does, but it does, which explains why the opacity starts at 0%.

ectopod 4 years ago | | |

A lot of these sites (including this one) do work in reader view.

test1235 4 years ago | | |

mitigation for a flash of unstyled content (FOUC) maybe?

mst 4 years ago | |

I have tracking protection and ublock origin both enabled and it rendered fine (FF on Win10).

(presented as a data point for any poor soul trying to replicate your problem)

tazjin 4 years ago | |

Chrome with uBlock Origin on default here, and it renders a big blue empty page for me, too. That's despite dragging in an ungodly amount of assets first.

Here's an archive link that works without any tracking, ads, Javascript etc.: https://archive.is/F5KZd

moffkalast 4 years ago | |

Noscript breaking websites? Who woulda thunk.

How do you manage to navigate the web with that on by default? It breaks just about everything since nothing is a static site these days.

cj 4 years ago |

Off topic: for people with a "million billion" objects, does the S3 console just completely freeze up for you? I have some large buckets that I'm unable to even interact with via the GUI. I've always wondered if my account is in some weird state or if performance is that bad for everyone. (This is a bucket with maybe 500 million objects, under a hundred terabytes)

lloesche 4 years ago |

I had a similar issue at my last job. Whenever a user created a PR on our open source project artifacts of 1GB size consisting of hundreds of small files would be created and uploaded to a bucket. There was just no process that would ever delete anything. This went on for 7 years and resulted in a multi-petabyte bucket.

I wrote some tooling to help me with the cleanup. It's available on Github: https://github.com/someengineering/resoto/tree/main/plugins/... consisting of two scripts, s3.py and delete.py.

It's not exactly meant for end-users, but if you know your way around Python/S3 it might help. I build it for a one-off purge of old data. s3.py takes a `--aws-s3-collect` arg to create the index. It lists one or more buckets and can store the result in a sqlite file. In my case the directory listing of the bucket took almost a week to complete and resulted in a 80GB sqlite.

I also added a very simple CLI interface (calling it virtual filesystem would be a stretch) that allows to load the sqlite file and browse the bucket content, summarise "directory" sizes, order by last modification date, etc. It's what starts when calling s3.py without the collect arg.

Then there is delete.py which I used to delete objects from the bucket, including all versions (our horrible bucket was versioned which made it extra painful). On a versioned bucket it has to run twice, once to delete the file and once to delete the then created version, if I remember correctly - it's been a year since I built this.

Maybe it's useful for someone.

coredog64 4 years ago | |

AWS has an inventory capability for S3: https://docs.aws.amazon.com/AmazonS3/latest/userguide/storag...

k__ 4 years ago | |

What about the lifecycle stuff?

I thought, S3 can move stuff to cheaper storage automatically after some time.

lloesche 4 years ago | | |

Like I wrote for us it was a one-off job to find and remove 6+ year old build artifacts that would never be needed again. I just looked for the cheapest solution of getting rid of them. I couldn't do it by prefix alone (prod files mixed in the same structure as the build artifacts) which is why delete.py supports patterns (the `--aws-s3-pattern` arg takes a regex).

If AWS' own tools work for you it's surely the better solution than my scripts. Esp. if you need something on an ongoing bases.

ebingdom 4 years ago |

I'm confused about prefixes and sharding:

> The files are stored on a physical drive somewhere and indexed someplace else by the entire string app/events/ - called the prefix. The / character is really just a rendered delimiter. You can actually specify whatever you want to be the delimiter for list/scan apis.

> Anyway, under the hood, these prefixes are used to shard and partition data in S3 buckets across whatever wires and metal boxes in physical data centers. This is important because prefix design impacts performance in large scale high volume read and write applications.

If the delimiter is not set at bucket creation time, but rather can be specified whenever you do a list query, how can the prefix be used to influence where objects are physically stored? Doesn't the prefix depend on what delimiter you use? How can the sharding logic know what the prefix is if it doesn't know the delimiter in advance?

For example, if I have a path like `app/events/login-123123.json`, how does S3 know the prefix is `app/events/` without knowing that I'm going to use `/` as the delimiter?

zmmmmm 4 years ago |

The rationale for using cloud is so often that it saves you from complexity. It really undermines the whole proposition when you find out that the complexity it shields you from is only skin deep, and in fact you still need a "PhD in AWS" anyway.

But as a bonus, now you face huge risks and liabilities from single button pushes and none of those skills you learned are transferrable outside of AWS so you'll have to learn them again for gcloud, again for azure, again for Oracle ....

pontifier 4 years ago |

DON'T PRESS THAT BUTTON.

The egress and early deletion fees on those "cheaper options" killed a company that I had to step in and save.

pphysch 4 years ago | |

On a related note, suppose the Fed raises rates to mitigate inflation and indirectly kills thousands of zombie companies, including many SaaS renting the cloud. What happens to their data? Does the cloud unilaterally evict/delete it, or does it get handled like an asset -- auctioned off, etc?

cmckn 4 years ago | | |

I’m not aware of a cloud provider that is contractually allowed to do such a thing (except maybe alibaba by way of the CCP). Dying companies get purchased and have their assets pilfered every day, the same thing would happen with cloud assets.

Uehreka 4 years ago | | |

> does it get handled like an asset -- auctioned off, etc?

Who would buy that? I guess if this happened enough then people would start "data salvager" companies that specialize in going through data they have no schema for looking for a way to sell something of it to someone else. I have to imagine the margins in a business like that would be abysmal, and all the while you'd be in a pretty dark place ethically going through data that users never wanted you to have in the first place.

Of course, all these questions are moot because if this happened the GDPR would nuke the cloud provider from orbit.

Aeolun 4 years ago | |

If they were already paying 100k per month for their storage, I doubt the additional 100k would severely impact their business.

Proven by the fact that they happily went on to pay the bill for the next 6 months.

Tehchops 4 years ago |

We’ve got data in S3 buckets not nearly at that scale and managing them, god forbid trying a mass delete, is absolute tedium.

pattycake23 4 years ago |

Here's an article about Shopify running into the S3 prefix rate limit too many times, and tackling it: https://shopify.engineering/future-proofing-our-cloud-storag...

sciurus 4 years ago | |

Their solution was to introduce entropy into the beginning of the object names, which used to be AWS's recommendation for how to ensure objects are placed in different partitions. AWS claims this is no longer necessary, although how their new design actually handles partitioning is opaque.

"This S3 request rate performance increase removes any previous guidance to randomize object prefixes to achieve faster performance. That means you can now use logical or sequential naming patterns in S3 object naming without any performance implications."

https://aws.amazon.com/about-aws/whats-new/2018/07/amazon-s3...

pattycake23 4 years ago | | |

Seems like it's a much higher rate limit, but it exists none the less, and Shopify's scale has also grown significantly since 2018 (when that article was written) - so it was probably a valid way for them to go.

wackget 4 years ago |

As a web developer who has never used anything except locally-hosted databases, can someone explain what kind of system actually produces billions or trillions of files which each need to be individually stored in a low-latency environment?

And couldn't that data be stored in an actual database?

rgallagher27 4 years ago | |

Things like mobile/webisite analytics events. User A clicked this menu item, User B viewed this images etc All streamed into S3 in chunks of smallish files.

It's cheaper to store them in S3 over a DB and use tools like Athena or Redshift spectrum to query.

wackget 4 years ago | | |

Wow. What makes it cheaper than using a DB? Is it just because the DB will create some additional metadata about each stored row or something?

gmiller123456 4 years ago | |

    And couldn't that data be stored in an actual database?

This is the "it's turtles all the way down" concept. A database is just going to store data in the file system, plus some extra overhead. Putting data in a database saves you nothing unless you actually need the extra functionality a database provides.

That overhead doesn't mean much if you have 10 users and 1gb of data. But it adds up in very large systems.

abhishekjha 4 years ago | |

An image service.

wackget 4 years ago | | |

Yeah that use-case I get. Binary files which would be difficult/impractical to index in a database.

However it feels like something at that scale will only ever realistically be dealt with by enterprise-level software, and I'd hazard a guess that most developers - even those reading HN - are not working on enterprise-level systems.

So I'm wondering what "regular devs" are using cloud buckets for at such a scale over regular DBs.

gnulinux 4 years ago | |

My company gets sensor data from millions of devices and records. Happens all day, all around the word. It adds up. If you don't delete that data, it becomes petabytes. Thanks god GDPR et al exist so we have a good excuse to "need to delete this data boss".

wodenokoto 4 years ago |

I've never been in this situation, but I do wish you could query files with more advanced filters on these blob storage services.

- But why SageMaker?

- Why do some orgs choose to put almost everything in 1 buckets?

tyingq 4 years ago | |

>Why do some orgs choose to put almost everything in 1 buckets?

The article seems to be making the case it's because the delimiter makes it seem like there's a real hierarchy. So the ramifications of /bucket/1 /bucket/2 versus /bucket1/ /bucket2/ aren't well known until it's too late.

charcircuit 4 years ago | | |

>So the ramifications of /bucket/1 /bucket/2 versus /bucket1/ /bucket2/ aren't well known until it's too late.

What's the difference?

korostelevm 4 years ago | |

For many at orgs like this, SageMaker is probably the shortest path to an insane amount of compute with a python terminal.

Why single bucket? Once someone refers to a bucket as "the" bucket - it is how it will forever be.

akdor1154 4 years ago | |

> But why SageMaker?

You could ask the same thing of most times it gets used for ML stuff as well.

> Why do some orgs choose to put almost everything in 1 buckets?

Anecdote: ours does because we paid (Multinational Consulting Co)™ a couple of million to design our infra for us, and that's what the result was.

liveoneggs 4 years ago | |

1 athena?

2 some jobs make a lot of data

charcircuit 4 years ago |

Can someone explain what happened in the end? From my understanding nothing happened (they deprioritizod the story for fixing it) and they are still blowing through the cloud budget.

snowwrestler 4 years ago | |

They didn’t resolve the issue.

There’s an important moment in the story, where they realize the fix will incur a one-time fee of $100,000. No one in engineering can sign off on that amount, and no one wants to try to explain it to non-technical execs.

They don’t explain why. But it’s probably because they expect a negative response like “how could you let this happen?!” or “I’m not going to pay that, find another way to fix it.”

In a lot of organizations it’s easier to live with a steadily growing recurring cost than a one-time fee… even if the total of the steady growth ends up much larger than the one-time fee!

It’s not necessarily pathological. Future costs will be paid from future revenue; whereas a big fee has to be paid from cash on-hand now.

But sometimes the calculation is not even attempted because of internal culture. When the decision is “keep your head down” instead of “what’s the best financial strategy,” that could hint at even bigger potential issues down the road.

hogrider 4 years ago | | |

Sounds more like non technical leadership sleeping at the wheel. I mean if they could just afford to lose money like this why bother with all that work to fix it?

seekayel 4 years ago | |

How I read the article, nothing happened. I think it is a cautionary tale of why you should probably bite the bullet and press the button instead of doing the "easier" thing which ends up being harder and more expensive in the end.

vdm 4 years ago |

DeleteObjects takes 1000 keys per call.

Lifecycle rules can filter by min/max object size. (since Nov 2021)

electroly 4 years ago | |

Thank you for mentioning that lifecycle rule change. I must have missed the announcement; that is exactly the functionality I needed.

vdm 4 years ago | |

Athena supports regexp_like(). By loading in an S3 inventory this can match what a wildcard would. Then a Batch Operations job can tag the result.

Not easy, but is possible and effective.

Mave83 4 years ago |

Just avoid the cloud. You get a Ceph storage with the performance of Amazon S3 at the price point of Amazon S3 Glacier in any Datacenter worldwide deployed if you want. There are companies that help you doing this.

Feel free to ask if you need help.

charcircuit 4 years ago | |

You have to properly administrate those servers else you'll lose all your files and everything will be inaccessible.

klysm 4 years ago | | |

Administrating CEPH is unfortunately hard.

red0point 4 years ago | |

I want to know what the absolute cheapest way of doing this is, without having a lot of CapEx. I thought of renting dedicated storage servers (e.g. Hetzner) and slapping Ceph on them.

Do you have another, better, idea?

valar_m 4 years ago |

Though it doesn't address the problem in TFA, I recommend setting up billing alerts in AWS. Doesn't solve their issue, but they would have at least known about it sooner.

0x002A 4 years ago |

Each time a developer does something on a cloud platform, that moment the platform might start to profit for two reasons: vendor lock-in and accrued costs in the long term regardless of the unit cost.

Anything limitless/easiest has a higher hidden cost attached.

StratusBen 4 years ago |

On this topic, it's always surprising to me how few people even seem to know about different storage classes on S3...or even intelligent tiering (which I know carries a cost to it, but allows AWS to manage some of this on your behalf which can be helpful for certain use-cases and teams).

We did an analysis of S3 storage levels by profiling 25,000 random S3 buckets a while back for a comparison of Amazon S3 and R2* and nearly 70% of storage in S3 was StandardStorage which just seems crazy high to me.

* https://www.vantage.sh/blog/the-opportunity-for-cloudflare-r...

blurker 4 years ago | |

I think that it's not just people not knowing about the lifecycle feature, but also that when they start putting data into a bucket they don't know what the lifecycle should be yet. Honestly I think overdoing lifecycle policies is a potentially bigger foot gun than not setting them. If you misuse glacier storage that will really cost you big $$$ quickly! And who wants to be the dev who deleted a bunch of data they shouldn't have?

Lifecycle policies are simple in concept, but it's actually not simple to decide what they should be in many cases.

kondro 4 years ago |

The minimum size of objects in cheaper storage types is 128KiB.

Given the article quotes $100k to run an inventory (and $100k/month in standard storage) it's likely most of your objects are smaller than 128KiB and so probably wouldn't benefit from cheaper storage options (although it's possible this is right on the cusp of the 128KiB limit and could go either way).

Honestly, if you have a $1.2m/year storage bill in S3 this would be the time to contact your account manager and try to work out what could be done to improve this. You probably shouldn't be paying list anyway if just the S3 component of your bill is $1.2m/year.

dekhn 4 years ago |

I had to chuckle at this article because it reminded me of some of the things I've had to do to clean up data.

One time I had to write a special mapreduce that did a multiple-step-map to converted my (deeply nested) directory tree into roughly equally sized partitions (a serial directory listing would have taken too long, and the tree was really unbalanced to partition in one step), then did a second mapreduce to map-delete all the files and reduce the errors down to a report file for later cleanup. This meant we could delete a few hundred terabytes across millions of files in 24 hours, which was a victory.

cyanic 4 years ago |

We solved the problem of deleting old files early in our development process, as we wanted to avoid situations such as this one.

While developing GitFront, we were using S3 to store individual files from git repositories as single objects. Each of our users was able to have multiple repositories with thousands of files, and they needed to be able to delete them.

To solve the issue, we implemented a system for storing multiple files inside a single object and a proxy which allows accessing individual files transparently. Deleting a whole repository is now just a single request to S3.

jopsen 4 years ago |

One of the biggest pains is that cloud services rarely mention what they don't do.

I think it's really sad, because when I don't see docs clearly stating the limits, I assume the worst and avoid the service.

gfd 4 years ago |

Does anyone have recommendations on how to compress the data (gzip or parquet).

zitterbewegung 4 years ago |

I was at a presentation where HERE technologies told us that they went from being on the top ten (or top five) S3 users (by data stored) to getting off of that list. This was seen as a big deal obviously.

solatic 4 years ago |

TL-DR: Object stores are not databases. Don't treat them like one.

throwaway984393 4 years ago | |

Try telling that to developers; they love using S3 as both a database and a filesystem. It's gotten to the point where we need a training for new devs to tell them what not to do in the cloud.

mst 4 years ago | | |

Honestly a Frequently Delivered Answers training for new developers is probably one of the best things you can include in onboarding.

Every environment has its footguns, after all.

hinkley 4 years ago | | |

Communicating through the filesystem is one of the Classic Blunders.

It doesn't come up as often anymore since we generally have so many options at our fingertips, but when push comes to shove you will still discover this idea rattling around in people's skulls.

solatic 4 years ago | | |

You can either train them with a calm tutorial or you can train them with angry billing alerts and shared-pain ex-post-facto muckraking.

I, for one, prefer the calm way.

Quarrelsome 4 years ago | | |

do you know if such sources exist publicly? I would be most interested in perusing recommended material on the subject.

prima-facie 4 years ago | |

They're also _not_ classic hierarchical filesystems, but k-v stores with extras.

hughrr 4 years ago |

For every $100k bill there’s a hundred of us with 14TB that costs SFA to roll with.

harshaw 4 years ago |

AWS budgets is a tool for cost containment (among other external services).

gnutrino 4 years ago |

Lol this post hits close to home.

gtirloni 4 years ago |

A "TLDR" that is not.

sokoloff@ Downloads % aws s3 ls s3://foo-asdf PRE bar-folder/ PRE baz-folder/ 2022-02-17 09:25:38 0 bar-file-1.txt 2022-02-17 09:25:42 0 bar-file-2.txt 2022-02-17 09:25:57 0 baz-file-1.txt 2022-02-17 09:25:49 0 baz-file-2.txt sokoloff@ Downloads % aws s3 ls s3://foo-asdf/ba PRE bar-folder/ PRE baz-folder/ 2022-02-17 09:25:38 0 bar-file-1.txt 2022-02-17 09:25:42 0 bar-file-2.txt 2022-02-17 09:25:57 0 baz-file-1.txt 2022-02-17 09:25:49 0 baz-file-2.txt sokoloff@ Downloads % aws s3 ls s3://foo-asdf/bar PRE bar-folder/ 2022-02-17 09:25:38 0 bar-file-1.txt 2022-02-17 09:25:42 0 bar-file-2.txt sokoloff@ Downloads % aws s3 ls s3://foo-asdf/bar-folder PRE bar-folder/