Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1635267.1
Update Date:2018-05-17
Keywords:

Solution Type  Problem Resolution Sure

Solution  1635267.1 :   Sun Ultra[tm] 45 Workstation: Fatal System Bus Error has occurred.  


Related Items
  • Sun Ultra 45 Workstation
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>Usx/Blade/Netra>SN-SPARC: USx
  •  


NOTICE: SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major

In this Document
Symptoms
Cause
Solution


Created from <SR 3-8345821381>

Applies to:

Sun Ultra 45 Workstation - Version Not Applicable to Not Applicable [Release N/A]
Oracle Solaris on SPARC (64-bit)

Symptoms

PSU FRUIDSEEPROM that has certain checksum bit patterns cause CPU temperature error and Ebus Panic. 

The Workstation randomly panics with a similar stack shown bellow:

Jan 28 12:57:17 Chicago genunix: [ID 843051 kern.info] NOTICE: SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major
Jan 28 12:57:19 Chicago unix: [ID 836849 kern.notice]
Jan 28 12:57:19 Chicago ^Mpanic[cpu1]/thread=30001264ce0:
Jan 28 12:57:19 Chicago unix: [ID 604061 kern.notice] Fatal System Bus Error has occurred
Jan 28 12:57:20 Chicago unix: [ID 100000 kern.notice]
Jan 28 12:57:20 Chicago genunix: [ID 723222 kern.notice] 000002a10007fe70 px:px_err_cb_intr+134 (129e000, 300003b7090, 180c5f0, 2, 1, 2a10007ff20)
Jan 28 12:57:20 Chicago genunix: [ID 179002 kern.notice]   %l0-3: 00000300003d0d30 0000000000000000 0000000000000008 00000300003b7078
Jan 28 12:57:20 Chicago   %l4-7: 00000300000afc40 0000000000000000 000000000183d400 00000300003b7090
Jan 28 12:57:21 Chicago genunix: [ID 723222 kern.notice] 000002a10007ff50 unix:current_thread+170 (e, 29ec7ed4040, 7, 0, 5801, 7bb255e0)
Jan 28 12:57:21 Chicago genunix: [ID 179002 kern.notice]   %l0-3: 00000000010076e4 000002a1004d0eb1 000000000000000e 00000000000007bf
Jan 28 12:57:21 Chicago   %l4-7: 0000000000000001 00000000fef1e500 0000000000000000 000002a1004d1760
Jan 28 12:57:21 Chicago genunix: [ID 723222 kern.notice] 000002a1004d1800 pic16f747:pic_ioctl+104 (0, 600016e8fe8, fec7bf6e, 100003, 0, 5801)
Jan 28 12:57:21 Chicago genunix: [ID 179002 kern.notice]   %l0-3: 0000000000005800 00000000ffffffff 00000000000000fa 00000600016e8ff0
Jan 28 12:57:21 Chicago   %l4-7: 0000029ec7ed4000 0000000000000000 0000000000000000 00000000000000e8
Jan 28 12:57:22 Chicago genunix: [ID 723222 kern.notice] 000002a1004d18d0 genunix:___const_seg_900000212+1c60c (6000173ef40, 5801, fec7bf6e, 100003, 60000c018f8, 11fc6c8)
Jan 28 12:57:22 Chicago genunix: [ID 179002 kern.notice]   %l0-3: 0000060000eab800 0000060000eab800 0000000000000004 000006000164a788
Jan 28 12:57:22 Chicago   %l4-7: 00000000feed0400 0000000000000000 0000000000000000 00000000018a8400
Jan 28 12:57:22 Chicago genunix: [ID 723222 kern.notice] 000002a1004d1990 genunix:ioctl+184 (4, 60000dd8e78, fec7bf6e, ff000000, 5800, 5801)
Jan 28 12:57:23 Chicago genunix: [ID 179002 kern.notice]   %l0-3: 0000000000000000 0000000000000000 0000000000000004 0000000000008c0a
Jan 28 12:57:23 Chicago   %l4-7: 0000000000000001 0000000000000000 0000000000000000 0000000000000000
Jan 28 12:57:23 Chicago unix: [ID 100000 kern.notice]
Jan 28 12:57:23 Chicago genunix: [ID 672855 kern.notice] syncing file systems...
Jan 28 12:57:23 Chicago genunix: [ID 733762 kern.notice]  9
Jan 28 12:57:25 Chicago genunix: [ID 733762 kern.notice]  6
Jan 28 12:57:26 Chicago genunix: [ID 904073 kern.notice]  done
Jan 28 12:57:27 Chicago genunix: [ID 111219 kern.notice] dumping to /dev/dsk/c1t0d0s1, offset 65536, content: kernel
Jan 28 12:57:40 Chicago genunix: [ID 409368 kern.notice] ^M100% done: 54370 pages dumped, compression ratio 3.86,
Jan 28 12:57:40 Chicago genunix: [ID 851671 kern.notice] dump succeeded
Jan 28 12:58:28 Chicago genunix: [ID 540533 kern.notice] ^MSunOS Release 5.10 Version Generic_118833-22 64-bit

 

The FMA also reports several events like this:  ereport.io.fire.jbc.ebus_to

Cause

The issue could be related to software. Indeed, there are some bugs with the same scenarios fixed in Solaris 10 Update 4.

For the Technical Service Engineers: Please check if the system is not hit by any of the following bugs, already fixed in Solaris 10 Update 3 and 4.

Bug 15437926 : SUNBT6628796 PIC STATE MACHINE NEEDS TO BE RESET PRIOR TO ACCESSING PIC FUNCTION
Bug 15440010 : SUNBT6632191 EXCESSIVE FAN NOISE AFTER SYSTEM COOLS DOWN AND FAN RPM SOMETIMES

 

Solution

1) Please check if your system is running Solaris 10 update 4.
2) Furthermore, check if you've recently changed some Hardware or Software configuration in your system (eg: new PCI cards, new patches, etc).
3) If you have installed the Solaris 10 update 4 and you don't have any change in your system, it's possible that it is a hardware issue. Please open a Service Request by one of the following methods:

* Use My Oracle Support to open the service request using active CSI# for the system.

* Call Oracle directly using your local support center

 

 

For the Technical Service Engineers: This issue often is fixed after motherboard replacement, however, in this case I solved the issue after I replaced the PSU.

Action Plan:

1) Ensure that the system is running Solaris 10 Update 4 and there was NOT any Software or Hardware change in the last time.

2) If the problem persists, please provide with the HW onsite activity. Probably motherboard, PCI or PSU could be affected. Please check all events and provide with the analysis before to send the FE onsite with the part or parts affected.

 

 

 


 


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback