Article ID: 892422 - Last Review: November 27, 2007 - Revision: 6.6 Overview of event ID 1123 and event ID 1122 logging in Windows 2000-based and Windows Server 2003-based server clustersOn This PageSUMMARYThis article discusses communications behavior that occurs between nodes that are in a cluster. Additionally, this article discusses why event IDs 1123 and 1122 may be logged in the System log of Microsoft Windows Server 2003-based and Microsoft Windows 2000-based server clusters. INTRODUCTIONCluster nodes periodically communicate with each other on a
network that is designated for cluster use. Each node has access to the
following communication types:
For intra-node communications, cluster nodes communicate over User Datagram Protocol (UDP) port 3343. Each node in the cluster periodically exchanges sequenced, unicast UDP datagrams with every other node in the cluster. The purpose of this exchange is to determine whether all nodes are running correctly and to monitor the health of network links. The cluster network driver (Clusnet.sys) manages cluster communications. The cluster network driver performs the following functions:
Event messages that are similar to the following are logged: Message 1 Event ID:
1123 Event ID:
1123 Event ID:
1122 Event ID:
1122 MORE INFORMATIONThe heartbeat processThe exchange of UDP datagrams between nodes in a cluster is known as the "heartbeat process". By default, heartbeats are sent every 1.2 seconds from each network interface for each node to each network interface for every other node that is in the cluster. In Windows Server 2003, multicast datagrams can be used to reduce the amount of heartbeat traffic that occurs between cluster nodes. By default, Windows Server 2003 uses multicast datagrams when three or more nodes are configured in a cluster.Event IDs 1123 and 1122Event ID 1123 indicates that node A in the cluster did not receive a heartbeat from node B in the cluster for two heartbeat intervals over a specified network interface. That means that node A did not receive a heartbeat from node B for 2.4 seconds.Event ID 1122 indicates that node A received a heartbeat from node B. This communication update is received after 2.4 seconds but before 4.8 seconds. Event ID 1122 is logged if communications are re-established over a network interface that was previously shut down. For example, event ID 1122 occurs when a node that was shut down rejoins the cluster. The regroup processAssume that node A does not receive an update from node B after six consecutive heartbeats over all network interfaces that are enabled for internal cluster communications. In this case, node B is assumed to be inactive. The cluster may perform a "regroup" process. During a regroup process, the cluster network driver on node A notifies the Membership Manager and the Node Manager that a failure has occurred. The Membership Manager and the Node Manager initiate a regroup operation that takes node B offline and removes it from active membership in the cluster. When this regroup process occurs, event ID 1126 is logged in the System log. Event ID 1135 may be subsequently logged in the System log. Event ID 1135 indicates that a node has been removed from active cluster membership. Messages that are similar to the following are logged:Message 1 Event ID:
1126 Event ID:
1135 Troubleshooting event IDs 1123 and 1122When event ID 1123 is followed by event ID 1122, you can generally ignore the events if the following conditions are true:
Message 1 Event ID:
1126 Event Id:
1127 Network adaptor teamingNetwork adaptor teaming can involve multi-port card or separate single-port PCI network adaptors.Note Network adaptor teaming is not supported on the cluster heartbeat network adaptor. The following articles discuss network adaptor teaming with Windows Clustering: 254101
(http://support.microsoft.com/kb/254101/
)
Network adaptor teaming and server clustering
276457
(http://support.microsoft.com/kb/276457/
)
Event success messages 4201 and 1122 using Windows Clustering
Network adaptor driver issuesNetwork adaptor drivers may be outdated or incorrect. Additionally, some drivers may not match the drivers on other nodes in the cluster.Network device failuresNetwork devices, such as switch ports or network adaptors, may not be working correctly. However, if all cluster networks log the same error message, a network device is unlikely to be the cause. If only one of the cluster networks logs event IDs 1123 and 1122, you may have one of the following problems:
Node resource issuesA node resource problem occurs because the Server service cannot keep up with incoming or outgoing network connections. The Server service cannot meet the demand for the network items that are queued by the network layer of the I/O stream. In this case, a Server service event, such as event ID 2022, may be logged in the System log. A message that is similar to the following is logged:Event ID:
2022 317249
(http://support.microsoft.com/kb/317249/
)
How to troubleshoot event ID 2021 and event ID 2022
DPC requests that occur in a cluster are typically
caused by the following sources:
814607
(http://support.microsoft.com/kb/814607/
)
Microsoft support for server clusters with 3rd party system components
Incorrect software on Windows-2000 based cluster nodesIn a Windows 2000-based cluster, all cluster nodes must be running Windows 2000 Service Pack 3 or a later version. If your Windows 2000-based computer logged event ID 2022, view the following articles in the Microsoft Knowledge Base to resolve this issue:317249
(http://support.microsoft.com/kb/317249/
)
How to troubleshoot event ID 2021 and event ID 2022
245080
(http://support.microsoft.com/kb/245080/
)
Receiving multiple instances of event ID 2022
If you still experience this issue, apply the hotfix
that is described in the following article in the Microsoft Knowledge Base:
830901
(http://support.microsoft.com/kb/830901/
)
Event ID 2022 is logged and your Windows 2000-based computer may stop responding
Incorrect registry settingsImportant This section, method, or task contains steps that tell you how to modify the registry. However, serious problems might occur if you modify the registry incorrectly. Therefore, make sure that you follow these steps carefully. For added protection, back up the registry before you modify it. Then, you can restore the registry if a problem occurs. For more information about how to back up and restore the registry, click the following article number to view the article in the Microsoft Knowledge Base:322756
(http://support.microsoft.com/kb/322756/
)
How to back up and restore the registry in Windows To resolve event messages in Windows 2000-based and Windows Server 2003-based clusters, you may have to make changes to the following registry subkey on each node: HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Services\lanmanserver\parameters Important In Windows 2000-based clusters, you must install hotfix 872790
before you make these changes.Add the following DWORD values to the registry subkey: Value Name: MaxRawWorkItems
To create the MaxWorkItems DWORD value, follow these steps:
Data Type: REG_DWORD Value data: 512 (decimal) Value Name: MaxFreeConnections Data Type: REG_DWORD Value data: 4096 (decimal) Value Name: MinFreeConnections Data Type: REG_DWORD Value data: 100 (decimal) Value Name: MaxWorkItems Data Type: REG_DWORD Value data: 6000 (decimal)
High kernel-mode CPU usageTo troubleshoot high kernel-mode CPU usage, use System Monitor to identify the problem. High kernel mode CPU usage may be caused by the following sources:
High CPU usage that is caused by SNMP agentsThird-party Simple Network Management Protocol (SNMP) agents that run in a cluster may periodically contact the NTFS file system on a shared cluster disk resource. The agents use the CreateFile function to contact NTFS. This behavior can cause significant CPU usage when the SNMP agent caches data on a specific volume.Multicast issuesMulticast issues may occur in a Windows Server 2003 cluster. To troubleshoot multicast issues, disable multicast support in the cluster. For more information about how to disable multicast, click the following article number to view the article in the Microsoft Knowledge Base:307962
(http://support.microsoft.com/kb/307962/
)
Multicast support enabled for the cluster heartbeat
REFERENCES For more information, click the following article numbers
to view the articles in the Microsoft Knowledge Base: 286342
(http://support.microsoft.com/kb/286342/
)
Network failure detection and recovery in Windows Server 2003 Clusters
242600
(http://support.microsoft.com/kb/242600/
)
Network failure detection and recovery in a two-node Windows Server 2000 cluster
258750
(http://support.microsoft.com/kb/258750/
)
Recommended private "Heartbeat" configuration on a cluster server
311081
(http://support.microsoft.com/kb/311081/
)
Troubleshooting multiple cluster symptoms on the same SAN
For more information about DPC, visit the following
MSDN Web site: http://technet2.microsoft.com/WindowsServer/en/Library/7b1c90d1-877e-4c55-ab8c-9c65101a503d1033.mspx
(http://technet2.microsoft.com/WindowsServer/en/Library/7b1c90d1-877e-4c55-ab8c-9c65101a503d1033.mspx)
For advanced information about DPC, see the following
Sysinternals Web site: http://technet.microsoft.com/en-us/sysinternals/bb963898.aspx
(http://technet.microsoft.com/en-us/sysinternals/bb963898.aspx)
Microsoft
provides third-party contact information to help you find technical support.
This contact information may change without notice. Microsoft does not
guarantee the accuracy of this third-party contact information.
APPLIES TO
| Article Translations
|
Back to the top
