![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||||
Solution Type Problem Resolution Sure Solution 1576683.1 : Windows BSOD WHEA_UNCORRECTABLE_ERROR(124) & correctable WHEA events when disabling network ports on X4170M2/X4270M2
In this Document
Applies to:Sun Fire X4270 M2 Server - Version Not Applicable to Not Applicable [Release N/A]Sun Fire X4170 M2 Server - Version Not Applicable to Not Applicable [Release N/A] Windows Server - Version 2008 x64 to 2008 x64 Windows Server - Version 2008 to 2008 Information in this document applies to any platform. SymptomsTo discuss this information further with Oracle experts and industry peers, we encourage you to review, join or start a discussion in the My Oracle Support Community - Sun x86 Systems
When disabling unconnected network ports in Windows 2008 / 2008R2 running on X4170M2 or X4270 M2 hardware you may see a number of correctable WHEA events being logged in the Windows System event logs. This may also lead to uncorrectable WHEA events under heavy network load which will trigger a Windows Blue Screen of Death (BSOD) crash of type WHEA_UNCORRECTABLE_ERROR(124) causing the system to unexpectedly reset. These symptoms are not always seen and may depend on what network ports have been enabled/disabled in windows. For example, we have seen the issue using the following configuration of the on-board network ports : Local Area Connection 1 - ENABLED AND ACTIVE WITH CABLE
Local Area Connection 2 - DISABLED WITH NO CABLE ATTACHED Local Area Connection 3 - DISABLED WITH NO CABLE ATTACHED Local Area Connection 4 - ENABLED AND ACTIVE WITH CABLE
When this is configured you may start to see a number of correctable WHEA events in the Windows system event log similar to this : Log Name: System
Source: Microsoft-Windows-WHEA-Logger Date: 07/03/2013 19:05:16 Event ID: 17 Task Category: None Level: Warning Keywords: User: LOCAL SERVICE Computer: hostname Description: A corrected hardware error has occurred. Component: PCI Express Root Port Error Source: Advanced Error Reporting (PCI Express) Bus:Device:Function: 0x0:0x0:0x0 Vendor ID:Device ID: 0x8086:0x3406 << !! Sometimes reported as 0x8086:0x3409 depending on the port involved Class Code: 0x30000
If heavy network load is placed on the active ports then an uncorrectable WHEA event is possible, causing a windows BSOD crash similar to this:
WHEA_UNCORRECTABLE_ERROR (124)
2: kd> !errrec fffffa8019d9f8d8
Note: This issue may also be seen when disabling network ports on Intel based PCIe network cards on these platforms also, not just the on-board network ports. CauseThe uncorrectable WHEA BSOD crash is caused by a fatal error on the PCIe bus generated by one of the Intel Kawela network ports. This fatal PCIe bus error is triggered by a firmware issue caused by a mis-match of the supported Max Payload Size.
SolutionFirstly this is NOT a hardware fault and no parts should be replaced as it will not resolve the issue.
This issue is fixed by using a specially created MPSTool to update the Intel NIC EEPROM. This tool can be found on the latest Tools & Drivers CD (SW 1.7.3 or later - Patchid 18489051) which can be downloaded from My Oracle Support patch downloads.
On the CD image the tool resides at the following location : tools_and_drivers_CD:/Windows/W2K8R2/Tools/64bit/Network/MPSTools
How to use this tool:
The following workarounds are also available which should prevent the correctable and uncorrectable WHEA events :
1. In Windows leave any unused networks ports enabled so that the adapter reports status as "Network cable unplugged" rather than DISABLED.
OR
2. Change the maximum payload size in the BIOS to 256 :
Press F2 during POST to enter the BIOS and then navigate to :
References<BUG:16507719> - WHEA_UNCORRECTABLE_ERROR(124) WHEN DISABLING NETWORK PORTS IN WIN2008R2 ON LYNX+Attachments This solution has no attachment |
||||||||||||||||||
|