Name: Big Storage, Small Budget: A Survey of Methods for Beating the Costs of File Storage
Start: 2024-05-22T14:40:00-0600
End: 2024-05-22T15:10:00-0600

May 21-23, Wolf Law School, Boulder CO

Big Storage, Small Budget: A Survey of Methods for Beating the Costs of File Storage

This session is a crash course on all of the different methods and architectures for beating the cost of file-based storage with some commentary on the advantages and pitfalls of each approach. The obvious solutions involve moving hot and cold files between storage tiers and/or deploying novel technologies for compression and/or deduplication, but, as with everything, the devil is in the details.

How do you know what data to move and where to move it to?
How do you actually move the data?
What are different approaches for applications and users to reconcile path names when files move?
How do you separate junk from data of lasting value?
Where are the trade-offs between costs, complexity, and vendor lock-in?
The lecturer is the CTO of Cambridge Computer and the founder of Starfish Storage, but he will only touch on Starfish's role in orchestrating storage tiering and data life cycle management. Instead, the focus will be on a whole bunch of technologies (some obvious and some not so obvious) that purport to beat the cost of storage.

Topics Covered:
Dedupe and compression
Hard drives: PMR, HMMR, SMR
Flash drives: SLC, MLC, TLC, QLC
Tape (yes, it still exists)
File stubs and symlinks
Tiering to object storage
Virtual name spaces
Inline (symmetric) v. Out of band (asymmetric) name spaces
Leveraging the users with charge-back, show-back, shame-back, and quotas
Data catalogs with integrated data movers

Wednesday May 22, 2024 2:40pm - 3:10pm MDT
Room 205

Technical Presentation

Feedback form isn't open yet.

RMACC HPC Symposium 2024

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!