Experience level

Learner

Session Track

DevOps

Enhancing storage efficiency in GlusterFS with YADL

Data Deduplication is an effective method to gain storage efficiency in heterogeneous data centers, and hence a single truly software defined solution, catering to different data streams like file,object and block, is most desirable. YADL "Yet Another Dedupe Library" attempts to solve this by providing a Linux user space library for file/block/object storage systems. In the presentation we will discuss and demonstrate current design, approaches and algorithms used in YADL for deduplication for optimal performance and storage efficiency. We will also discuss use cases with GlusterFS, Ceph, Open Stack Swift and the future of YADL i.e. Distributed Dedupe and YADL Shells.