Global Data Deduplication

Backup deduplication is a method for reducing backup size by excluding duplicate data blocks from the backup. In any given organization, VMs contain duplicates of data, such as VMs deployed from the same template, VMs with the same OS, and VMs that have some (semi) identical files, such as database entries. Block-level data deduplication enables you to reduce backup size by saving only unique data blocks to the Backup Repository while replacing duplicated blocks with references to existing ones.

NAKIVO Backup & Replication automatically deduplicates all backups in a given forever-incremental Backup Repository if this feature is enabled. This means that all data blocks are taken into account by backup deduplication, even if you back up your VMware VMs, Hyper-V VMs, and Amazon EC2 instances to the same Backup Repository. Global deduplication can be enabled during Backup Repository creation process. You can also use hardware-based data deduplication device such as an EMC Data Domain instead of enabling it for the repository.


The backup deduplication method can be enabled/disabled during the Backup Repository creation process. For details, refer to one of the Creating Backup Repositories sections.

VM backup deduplication can provide a 10X to 30X reduction in storage capacity requirements. For example, you have 10 VMs running Windows Server 2016, which occupies 10 GB each. While the total amount of data is 100 GB, only one copy of OS data (10 GB) will be written to a backup repository with data deduplication, which provides 10 to 1 storage space savings.

More efficient disk space utilization allows for storing more recovery points per VM backup. In addition, lower storage space requirements save money on direct storage costs (as fewer disks are needed to store the same amount of information) and on related costs (such as cooling, electricity, and maintenance).