Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-75-1395544.1
Update Date:2017-05-15
Keywords:

Solution Type  Troubleshooting Sure

Solution  1395544.1 :   Sun SPARC Enterprise Mx000 Server: How to gather information when the XSCF is inaccessible via the network  


Related Items
  • Sun SPARC Enterprise M9000-32 Server
  •  
  • Sun SPARC Enterprise M8000 Server
  •  
  • Sun SPARC Enterprise M5000 Server
  •  
  • Sun SPARC Enterprise M9000-64 Server
  •  
  • Sun SPARC Enterprise M4000 Server
  •  
  • Sun SPARC Enterprise M3000 Server
  •  
  • Sun SPARC Enterprise M5000 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>Enterprise>SN-SPARC: Mx000
  •  




Applies to:

Sun SPARC Enterprise M3000 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun SPARC Enterprise M4000 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun SPARC Enterprise M9000-64 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun SPARC Enterprise M5000 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun SPARC Enterprise M8000 Server - Version Not Applicable to Not Applicable [Release N/A]
Information in this document applies to any platform.
M3000, M4000, M5000, M8000, M9000, M9000-32, m9000-64



Purpose

This is to aid with gathering information from a Mx000 system that the XSCF is inaccessable via the network.

Troubleshooting Steps

If the platform is totally dead and the XSCFU is inaccessible via the network port. The only way to gather any information is via the serial port on the XSCF.

If the XSCF doesn't boot correctly or fails post then the network is not configured.

You would need to connect a console to serial port and power on the system.

The console should start to display booting messages similar to the following.

Example 1 - XSCFU post detects a memory error
------------------------------------------------------------------
XSCF uboot 01080001 (May 8 2009 - 15:09:36)

SCF board boot factor = a080
memory test ..
Memory compare test

memory error detect 80000008, address 00000000 data aaaaaaaa -> deadffef
capture_data hi deadffef lo deadbeef ecc eeee
capture_attributes 01113001 address 00000000   <------ Notice that the XSCF stops here so no further information.
---------

Example2 - XSCFU corruption
-----------------------------------------
XSCF uboot 01100001 (Feb 3 2011 - 14:43:12)

SCF board boot factor = 4080
memory test ..
Memory compare test
................finish
DDR Real size: 256 MB
DDR: 224 MB

## Booting image at ff800000 ...
Image Name: XSCF kernel 01100001 2.6.11.12-s
Image Type: PowerPC Linux Kernel Image (gzip compressed)
Data Size: 1457363 Bytes = 1.4 MB
Load Address: 00000000
Entry Point: 00000000
Verifying Checksum ... OK
Uncompressing Kernel Image ... OK
## Loading RAMDisk Image at ff980000 ...
Image Name: XSCF rootfs 01100002 ,2011/05/12
Image Type: PowerPC Linux RAMDisk Image (gzip compressed)
Data Size: 5486484 Bytes = 5.2 MB
Load Address: 00000000
Entry Point: 00000000
Verifying Checksum ... Bad Data CRC   <------ Notice that the xscf stops here so no further information.
----------------

Example3 - XSCFU internal fault
-----------------------------------------

system is looping at boot:
## Booting image at ff800000 ...
Image Name: XSCF kernel 01090003 2.6.11.12-s
Image Type: PowerPC Linux Kernel Image (gzip compressed)
Data Size: 1457048 Bytes = 1.4 MB
Load Address: 00000000
Entry Point: 00000000
Verifying Checksum ... OK
Uncompressing Kernel Image ... OK
## Loading RAMDisk Image at ff980000 ...
Image Name: XSCF rootfs 01100000 ,2010/10/07
Image Type: PowerPC Linux RAMDisk Image (gzip compressed)
Data Size: 5479265 Bytes = 5.2 MB
Load Address: 00000000
Entry Point: 00000000
Verifying Checksum ... OK
Loading Ramdisk to 0bac6000, end 0bfffb61 ... OK
XSCF uboot 01080001 (May 8 2009 - 15:09:36)
SCF board boot factor = 40c0
DDR Real size: 256 MB
DDR: 224 MB
XSCF FAULT (reason=0)     <------ Notice that the xscf stops here so no further information.
----------------


Example 4 - XSCFU has lost the firmware - unable to login as default.
-----------------------------------------------------------------------------------

XSCF uboot  01080000  (Sep 26 2008 - 09:31:29)

XSCF uboot  01080000  (Sep 26 2008 - 09:31:29)

SCF board boot factor = 4080
memory test ..
Memory compare test
................finish
    DDR Real size: 256 MB
    DDR: 224 MB

## Booting image at ff800000 ...
   Image Name:   XSCF kernel 01080001 2.6.11.12-s
   Image Type:   PowerPC Linux Kernel Image (gzip compressed)
   Data Size:    1456856 Bytes =  1.4 MB
   Load Address: 00000000
   Entry Point:  00000000
   Verifying Checksum ... OK
   Uncompressing Kernel Image ... OK
## Loading RAMDisk Image at ff980000 ...
   Image Name:   XSCF rootfs 01080004 ,2009/01/08
   Image Type:   PowerPC Linux RAMDisk Image (gzip compressed)
   Data Size:    5454338 Bytes =  5.2 MB
   Load Address: 00000000
   Entry Point:  00000000
   Verifying Checksum ... OK
   Loading Ramdisk to 0bacc000, end 0bfffa02 ... OK
Linux version 2.6.11.12-sec (gcc version 3.4.4) #1 Thu Jan 8 14:29:31 JST 2009
new message buffer at 0f700000 size 1048576
log_buf_len: 1048576
mpc85xx_cds_setup_arch
Built 1 zonelists
Kernel command line: root=/dev/ram rw console=ttyS0,9600 init=/sbin/init_change_root panic=1 mem=240M
OpenPIC Version 1.2 (1 CPUs and 44 IRQ sources) at fbe79000
PID hash table entries: 1024 (order: 10, 16384 bytes)
Dentry cache hash table entries: 32768 (order: 5, 131072 bytes)
Inode-cache hash table entries: 16384 (order: 4, 65536 bytes)
Memory: 218752k available (2188k kernel code, 668k data, 316k init, 0k highmem)
Mount-cache hash table entries: 512 (order: 0, 4096 bytes)
Freeing initrd memory: 5326k freed
RAMDISK driver initialized: 16 RAM disks of 32768K size 1024 blocksize
i2c-algo-cpm: CPM2 I2C algorithm module version 0.1 (Mar 22, 2005)
FCC ENET Version 0.3
TCP established hash table entries: 8192 (order: 4, 65536 bytes)
TCP bind hash table entries: 8192 (order: 3, 32768 bytes)
ip_tables: (C) 2000-2002 Netfilter core team
arp_tables: (C) 2002 David S. Miller
EXT2-fs warning: checktime reached, running e2fsck is recommended
VFS: Mounted root (ext2 filesystem).
Freeing unused kernel memory: 316k init
switching initrd filesystem, ramdisk to tmpfs
SCF Linux Boot Script 2006/03/04 for ROM boot environment
fsl-sec2 hardware crypt accelerator model3a ver 0.02 enabled

XSCF initial process start (pid=104)

load /scf/modules/lites_ldrv.ko  --  complete
load /scf/modules/drvscftrace.ko  --  complete
load /scf/modules/sec2_rsa.ko  --  complete
load /scf/modules/sec2_md5.ko  --  complete
load /scf/modules/sec2_des.ko  --  complete
load /scf/modules/sec2_arc4.ko  --  complete
load /scf/modules/sec2_aes.ko  --  complete
load /scf/modules/sec2_sha256.ko  --  complete
load /scf/modules/sec2_sha1.ko  --  complete
load /scf/modules/hw_random.ko  --  complete
load /scf/modules/scsi_mod.ko  --  complete
load /scf/modules/sd_mod.ko  --  complete
load /scf/modules/usbcore.ko  --  complete
load /scf/modules/ohci-hcd.ko  --  complete
load /scf/modules/usb-storage.ko  --  complete
load /scf/modules/drvbootfmem.ko  --  complete
load /scf/modules/drvmbc.ko  --  complete


***** WARNING *****
XSCF initialization terminate,
because there is no XSCF-Firmware in this XSCF board.
Please install XSCF-Firmware.

*** SCF_INIT was set FACTORY mode automatically. ***


login: default
login: cannot run /scf/bin/rbash: No such file or directory  <-------- Unable to login as default.

 

SOLUTION
As part of your data collection when opening a service request with Oracle, connect to the serial port of the XSCF and gather the boot logs of the XSCF to allow the issue to be investigated in a timely manner.

 

All these errors we had to replace the XSCF to clear the fault.

 ---------------------------------------------------------------

For the M3000, the mbu needs replacement. No system operation is possible until the MBU containing the XSCF is replaced.

For the M4000/M5000, the XSCF needs replacement. No system operation is possible until the XSCF is replaced.

For the M8000/M9000, the indicted XSCF needs replacement. Continue operation on the other XSCF. XSCF redundancy is compromized until the XSCF is replaced.

References

<NOTE:1620578.1> - Sun SPARC(R) Enterprise M3000/M4000/M5000/M8000/M9000 (OPL) Servers: XSCF displays memory error detected during boot

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback