![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||
Solution Type Problem Resolution Sure Solution 1017926.1 : Sun Fire[TM] 3800-6800: Troubleshooting NCPQ_TO errors
PreviouslyPublishedAs 229185 Applies to:Sun Fire 3800 Server - Version Not Applicable and laterSun Fire 4800 Server - Version Not Applicable and later Sun Fire 6800 Server - Version Not Applicable and later All Platforms SymptomsDescription:This document aids in troubleshooting Non-Cacheable Pending Queue Time Outs (NCPQ_TO) on Sun Fire 3800/4800/4810/6800 systems. NCPQ_TOs occur when data requests in Non-Cacheable address space do not complete a transaction. Non-Cacheable addresses space is Safari Device config and I/O address space. Symptoms:Error messages indicating a NCPQ_TO occurred are seen on the Domain Console. The error messages are also stored in the Domain Console Buffer and can be retrieved by the Sun Fire System Controller (SC) command showlogs. If a loghost is configured, the error messages are stored on the loghost. NCPQ_TOs can occur during normal operation of the Domain or during POST.
Feb 26 10:46:02 systemx DomC.SC: ErrorMonitor:Domain C has a SYSTEM ERROR Feb 26 10:46:02 systemx DomC.SC: /N0/SB1 encountered the first error Feb 26 10:46:02 systemx DomC.SC: RepeaterSbbcAsic reported first error on /N0/SB1 Feb 26 10:46:02 systemx DomC.SC: /partition1/domain0/SB1/bbcGroup0/sbbc0: FE [15:15] : 0x1 ErrSum [31:31] : 0x1 SafErr [09:08] : 0x1 Fireplane device asserted an error Feb 26 12:20:47 systemx DomC.SC: /partition1/domain0/SB1/bbcGroup0/cpuAB/cpusafariagent0: AFAR (high)[0x531] : 0x0000063c AFAR [42:32] [10:00] : 0x63c AFAR (low)[0x541] : 0xff800000 AFAR_2 (high)[0x571] : 0x0000063c < AFAR_2 [42:32] [10:00] : 0x63c AFAR_2 (low)[0x581] : 0xff800000 < AFSR (high)[0x551] : 0x00080000 PERR [19:19] : 0x1 AFSR_2 (high)[0x591] : 0x00080000 PERR [19:19] : 0x1 EMU B[0x511] : 0x03000000 AID_LK [24:24] : 0x1 NCPQ_TO [25:25] : 0x1 CauseInterpretation of the example error above:A System error is detected and Domain C is PAUSED. From the device path in the error messages it can be determined that the error is detected on SB1 CPU A . /partition1/domain0/SB1/bbcGroup0/cpuAB/cpusafariagent0
Non-Cacheable Schizo Device Pair Agent ID 1E Leaf B. (I/O Boat 9 Slots 0,1,2 ) Use Document 1006063.1 for decoding. Possible Causes:There are many possible hardware and software root causes for NCPQ_TOs. They can be caused by faulty CPUs, I/O Bridge ASICs (Schizo), PCI cards as well as Bugs in the Microcode of cPCI/PCI cards.
SolutionTroubleshooting:In general the device indicated by the AFAR_2 is likely to be the cause for the NCPQ_TO. However the device reporting the error can as well be the cause. It is advised to investigate whether the errors are a result of newly installed or moved cPCI or PCI adapters. Make sure to reseat any newly installed or relocated adapters. Make sure that drivers are up to date on the cards as well. Assuming this is not a newly installed PCI card (or driver issue), please collect an extended Explorer (see Document 1019066.1) and open a Service Request with Support Services.
To discuss this information further with Oracle experts and industry peers, we encourage you to review, join or start a discussion in an appropriate My Oracle Support Community, Oracle Sun Technologies Community.
Internal Troubleshooting Instructions: References<NOTE:1006063.1> - Non-Cacheable Address Space tables for Sun[TM] Fire 3800/4800/4810/6800/E2900/E4900/E6900/V1280 and Netra[TM] 1280/1290 Server<NOTE:1019066.1> - Sun Fire[TM] v1280, 3800, 4800, 4810, 6800, E2900, E4900, E6900 and Netra[TM] 1280, 2900 servers: How to collect scextended or 1280extended Explorer Attachments This solution has no attachment |
||||||||||||
|