

Instead, you will end up wasting memory that ZFS could otherwise utilize for caching and other important tasks.įor deduplication to work, ZFS must keep track of the data blocks stored on your ZFS pool/filesystem. If the data you’re storing on your ZFS pool/filesystem does not have much redundancy (almost unique), then deduplication won’t do you any good. Problems of ZFS Deduplication:ĭeduplication is a very nice feature and it saves a lot of disk space of your ZFS pool/filesystem if the data you’re storing on your ZFS pool/filesystem is redundant (similar file is stored multiple times) in nature. The higher the deduplication ratio (DEDUP) of the data you’ve stored on your ZFS pool/filesystem, the more disk space you’re saving with deduplication. So, deduplication saved about 80% (1-1/DEDUP) of the available disk space of the ZFS pool pool1. The deduplication ratio ( DEDUP) is about 5 ( 5.01x). You can skip this section if you already have a ZFS pool for testing deduplication.īut 5 copies of the same file only use about 739 MB of disk space from the physical storage device. To experiment with ZFS deduplication, I will create a new ZFS pool using the vdb and vdc storage devices in a mirror configuration. Disabling Deduplication on ZFS Pools/Filesystems.Enabling Deduplication on ZFS Filesystems.
#Openzfs deduplication how to
In this article, I am going to show you how to enable deduplication on your ZFS pools/filesystems.

So, even if parts of the file are matched, the deduplication feature can save up disk spaces of your ZFS pool/filesystem. Technically, when you copy/move/create new files on your ZFS pool/filesystem, ZFS will divide them into chunks and compare these chunks with existing chunks (of the files) stored on the ZFS pool/filesystem to see if it found any matches. This will save a lot of disk space on your ZFS pool/filesystem. The rest of them will be a reference to that copy of the file. Simply put, if you store a lot of files on your ZFS pool/filesystem, and some of these files are the same, only one copy of these files would be kept on the ZFS pool/filesystem. The deduplication feature of the ZFS filesystem is a way of removing redundant data from ZFS pools/filesystems.
