Known issues after you enable data deduplication on CSV


Data deduplication in Windows Server 2012 R2 supports optimization of storage for Virtual Desktop Infrastructure (VDI) deployments and optimization of Cluster Shared Volumes (CSV). Data deduplication is supported on NTFS-formatted CSV and is not supported on Resilient File System (ReFS)-formatted CSV. For more information, see Extending Data Deduplication to new workloads in Windows Server 2012 R2.

This article describes some known issues that may occur after you enable data deduplication on CSV. 

Known Issues

Issue 1

The LastWriteTime property of a file is changed to the time when the file is processed by a data deduplication optimization job. Additionally, the archive bit of the file is reset when the data deduplication optimization job is finished.

This behavior does not affect production performance or limit access to the files that are stored on the CSV. However, this behavior may affect some backup applications that use the archive bit or the LastWriteTime property to detect incremental changes of files. For example, when the file properties are changed by a data deduplication optimization job, the backup application may be triggered to back up the files again.

Issue 2

When you use the Update-DedupStatus cmdlet to query a data deduplication job status on a CSV volume from a passive (non-coordinator) cluster node, you receive an error that resembles the following:
Update-DedupStatus : MSFT_DedupVolumeStatus.Volume='<CSV volume path>' - HRESULT 0x80565364, 0x80565304, 0x8056536B
Additionally, you receive one of the following error messages:
Data deduplication cannot run this job on this CSV volume on this node. Try running the job on the CSV volume resource owner node.

Data deduplication cannot run this cmdlet on this CSV volume on this node. Try running the cmdlet on the CSV volume resource owner node.

This behavior is expected because the job status can be queried only from the coordinator node. To obtain the status of the data deduplication job, log on to the coordinator node, and then run the Update-DedupStatus cmdlet.

More Information

Data deduplication was introduced in Windows Server 2012. Enabling data deduplication reduces the number of duplicate blocks of data in the storage so that more data can be stored. Data deduplication is highly scalable, resource efficient, and nonintrusive and can run on dozens of large volumes of primary data at the same time and yet have only a minimal effect on the server workload. For more information, see Data Deduplication Overview and About Data Deduplication.

Article ID: 2906888 - Last Review: Nov 19, 2013 - Revision: 1