This article discusses tips on fixes and changes for Microsoft System Center Operations Manager 2007 (OpsMgr) Agent Health. This article discusses hotfixes and registry keys. Be aware that these hotfixes and registry keys may not be noticed in your environment directly but might be causing issues of which you are not aware.
This article is for Operations Manager 2007 SP1 and Operations Manager 2007 R2. For Operations Manager 2012, Operations Manager 2012 SP1, and Operations Manager 2012 R2 see the following article:
Make sure that your agents, management servers, and gateways are running the latest Cumulative Update (CU). At the time that this article was written, the latest CU version was CU6. To download Cumulative Update 6 for System Center Operations Manager 2007 R2 (KB 2626076), go to the following Microsoft website:
You do not receive event notifications after you back up and then clear event logs in Windows Server 2008 or in Windows Vista. Therefore, OpsMgr is prevented from providing a warning for events after the event logs are cleared.
For Operations Manager, this causes a high number of handles
The CPU usage of an application or a service that uses MSXML 6.0 to handle XML requests reaches 100% in Windows Server 2008, Windows Vista, Windows XP Service Pack 3, or other systems that have MSXML 6.0 installed:
FIX: You receive a "Provider Load Failure" error message or the Wmiprvse.exe process stops responding when you use a SQL Server WMI provider to obtain information about SQL Server 2005, SQL Server 2008, or SQL Server 2008 R2 services:
Extended Protection for Authentication: This feature enhances the protection and handling of credentials when authenticating network connections by using Integrated Windows Authentication (IWA). In Operations Manager, there may be some errors doing a client push. In this case, a network trace would contain the following:
Server Error, (91) Invalid user identifier
For more information, view the following article in the Microsoft Knowledge Base:
There is a memory leak in IIS when stress-testing our IIS discovery. This QFE is important for whoever uses OpsMgr with the IIS MP, and even more important when the APM feature is used in OM12. For more information, see "FIX: Memory leak in Rhs.exe after you configure the IIS 7.5 W3SVC service in a Windows Server 2008 R2 SP1 failover cluster":
If you experience a problem with the configuration of this registry value, you see the following event:
Event Type: Error Event Source: ESE Event Category: Transaction Manager Event ID: 623 Description: HealthService (<PID>) The version store for instance <instance> ("<name>") has reached its maximum size of <value>Mb. It is likely that a long-running transaction is preventing cleanup of the version store and causing it to build up in size. Updates will be rejected until the long-running transaction has been completely committed or rolled back. Possible long-running transaction: SessionId: <value> Session-context: <value> Session-context ThreadId: <value>. Cleanup:<value>
Note This event may report the issue with other Operations Manager processes, depending on the affected role.
We recommend that you set the version store size to double its default size. For example, if you set the version store size on a computer that hosts a Management Server role, set the registry value to 10240 (decimal).
If you experience an issue with this registry value, you see the following event:
Event Source: HealthService Event Category: Health Service Event ID: 2015 User: N/A Description: A workflow in the Health Service has generated a message which exceeds the size limit, and has been discarded.
Note An increase in the limit can adversely affect the Management group performance. This is because this increase causes lots of discovery data to be collected by the Management server and by the Root Management server.