Symptoms
Consider the following scenario:
-
You install the Failover Clustering feature on a computer that is running Windows Server 2008 R2.
-
On this computer, you run some backup applications that use Volume Shadow Copy Service (VSS) in parallel.
In this scenario, the Cluster service stops responding.
Cause
This issue occurs because of a race condition between the calls of the VSS writer on the cluster. If an OnBackupShutdown method call occurs between an OnFreeze method call and an OnThaw method call, the lock on the cluster hive is not released. Therefore, a deadlock occurs, and the Cluster service stops responding.
Resolution
Hotfix information
A supported hotfix is available from Microsoft. However, this hotfix is intended to correct only the problem that is described in this article. Apply this hotfix only to systems that are experiencing the problem described in this article. This hotfix might receive additional testing. Therefore, if you are not severely affected by this problem, we recommend that you wait for the next software update that contains this hotfix.
If the hotfix is available for download, there is a "Hotfix download available" section at the top of this Knowledge Base article. If this section does not appear, contact Microsoft Customer Service and Support to obtain the hotfix. Note If additional issues occur or if any troubleshooting is required, you might have to create a separate service request. The usual support costs will apply to additional support questions and issues that do not qualify for this specific hotfix. For a complete list of Microsoft Customer Service and Support telephone numbers or to create a separate service request, visit the following Microsoft website:http://support.microsoft.com/contactus/?ws=supportNote The "Hotfix download available" form displays the languages for which the hotfix is available. If you do not see your language, it is because a hotfix is not available for that language.
Prerequisites
To apply this hotfix, you must be running Windows Server 2008 R2. Additionally, you must have the Failover Clustering feature installed.
Registry information
To use the hotfix in this package, you do not have to make any changes to the registry.
Restart requirement
You may have to restart the computer after you apply this hotfix.
Hotfix replacement information
This hotfix does not replace a previously released hotfix.
File information
The global version of this hotfix installs files that have the attributes that are listed in the following tables. The dates and the times for these files are listed in Coordinated Universal Time (UTC). The dates and the times for these files on your local computer are displayed in your local time together with your current daylight saving time (DST) bias. Additionally, the dates and the times may change when you perform certain operations on the files.
Windows Server 2008 R2 file information notes
Important Windows 7 hotfixes and Windows Server 2008 R2 hotfixes are included in the same packages. However, hotfixes on the Hotfix Request page are listed under both operating systems. To request the hotfix package that applies to one or both operating systems, select the hotfix that is listed under "Windows 7/Windows Server 2008 R2" on the page. Always refer to the "Applies To" section in articles to determine the actual operating system that each hotfix applies to.
-
The MANIFEST files (.manifest) and the MUM files (.mum) that are installed for each environment are listed separately in the "Additional file information for Windows Server 2008 R2" section. MUM and MANIFEST files, and the associated security catalog (.cat) files, are extremely important to maintaining the state of the updated component. The security catalog files, for which the attributes are not listed, are signed with a Microsoft digital signature.
For all supported x64-based versions of Windows Server 2008 R2
File name |
File version |
File size |
Date |
Time |
Platform |
---|---|---|---|---|---|
Clussvc.exe |
6.1.7600.20760 |
4,583,936 |
20-Jul-2010 |
05:28 |
x64 |
For all supported IA-64-based versions of Windows Server 2008 R2
File name |
File version |
File size |
Date |
Time |
Platform |
---|---|---|---|---|---|
Clussvc.exe |
6.1.7600.20760 |
7,706,112 |
20-Jul-2010 |
04:23 |
IA-64 |
Status
Microsoft has confirmed that this is a problem in the Microsoft products that are listed in the "Applies to" section.
More Information
For more information, view the cluster log in the call stack information section.
For more information about VSS, visit the following Microsoft website:General information about VSSFor more information about the CVssWriter class, visit the following Microsoft website:
General information about the "CVssWriter"classFor more information, click the following article number to view the article in the Microsoft Knowledge Base:
980253 The Cluster service stops responding in Windows Server 2008 if you run some backup applications in parallelFor more information about software update terminology, click the following article number to view the article in the Microsoft Knowledge Base:
824684 Description of the standard terminology that is used to describe Microsoft software updates
Additional file information
Additional file information for Windows Server 2008 R2
Additional files for all supported x64-based versions of Windows Server 2008 R2
File name |
Amd64_a5459988b1efc49eb308562558659abd_31bf3856ad364e35_6.1.7600.20760_none_b29c91cd59243e57.manifest |
File version |
Not applicable |
File size |
715 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
07:42 |
Platform |
Not applicable |
File name |
Amd64_dc9233eda6ab1ef2ecbef13d0015f3f4_31bf3856ad364e35_6.1.7600.20760_none_dd0844c70a5990a7.manifest |
File version |
Not applicable |
File size |
715 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
07:42 |
Platform |
Not applicable |
File name |
Amd64_microsoft-windows-f..overcluster-clussvc_31bf3856ad364e35_6.1.7600.20760_none_1650dbd7d44eb4f1.manifest |
File version |
Not applicable |
File size |
7,438 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
06:42 |
Platform |
Not applicable |
File name |
Update.mum |
File version |
Not applicable |
File size |
1,893 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
07:42 |
Platform |
Not applicable |
File name |
Wow64_microsoft-windows-f..overcluster-clussvc_31bf3856ad364e35_6.1.7600.20760_none_20a5862a08af76ec.manifest |
File version |
Not applicable |
File size |
4,604 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
05:24 |
Platform |
Not applicable |
Additional files for all supported IA-64-based versions of Windows Server 2008 R2
File name |
Ia64_9d61a1d07584e650bf13f850ded8bb29_31bf3856ad364e35_6.1.7600.20760_none_3a85c5c3a952785b.manifest |
File version |
Not applicable |
File size |
1,070 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
07:41 |
Platform |
Not applicable |
File name |
Ia64_microsoft-windows-f..overcluster-clussvc_31bf3856ad364e35_6.1.7600.20760_none_ba33e44a1bef4cb7.manifest |
File version |
Not applicable |
File size |
7,436 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
06:30 |
Platform |
Not applicable |
File name |
Update.mum |
File version |
Not applicable |
File size |
1,463 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
07:41 |
Platform |
Not applicable |
File name |
Wow64_microsoft-windows-f..overcluster-clussvc_31bf3856ad364e35_6.1.7600.20760_none_20a5862a08af76ec.manifest |
File version |
Not applicable |
File size |
4,604 |
Date (UTC) |
20-Jul-2010 |
Time (UTC) |
05:24 |
Platform |
Not applicable |
Call stack information
The following Cluster log entries indicate that the lock is obtained in the OnFreeze method path, and that the OnBackupShutdown and OnThaw methods are called: 00000e04.00002714::2010/01/22-12:20:50.808 INFO [VSS] OnPrepareBackup returning - true
00000e04.00002714::2010/01/22-12:20:56.122 INFO [VSS] HandleBackupGum - Initiating the backup 00000e04.00002714::2010/01/22-12:20:56.122 INFO [VSS] HandleOnFreezeGum - Stopping the Death Timer 00000e04.00002714::2010/01/22-12:20:56.122 INFO [VSS] HandleBackupGum - Acquiring lock and flushing DB 00000e04.00002714::2010/01/22-12:20:56.122 INFO [VSS] HandleBackupGum - Completed the backup Request 00000e04.00002714::2010/01/22-12:20:56.124 INFO [VSS] OnFreeze returning true 00000e04.00002714::2010/01/22-12:20:57.236 INFO [VSS] OnBackupShutdown - Snap Shot Id = 350a310a-ef17-45a2-89bf-5d678b9a32ff [VSS] OnThaw returning false 00000e04.00002714::2010/01/22-12:20:57.236 INFO Current request is Aborted - Context4194304[VSS] OnThaw returning false