Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1019438.1
Update Date:2017-10-06
Keywords:

Solution Type  Technical Instruction Sure

Solution  1019438.1 :   How to gather data from a Sun Netra[TM] CT900 system.  


Related Items
  • Sun Netra CT900 Server
  •  
  • Sun Netra CP3260 ATCA Blade Server
  •  
  • Sun Netra CP3220 ATCA Blade Server
  •  
  • Netra SPARC T3-1B
  •  
  • Sun Netra CP3010 Blade Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>Usx/Blade/Netra>SN-SPARC: Netra Cxxxx
  •  

PreviouslyPublishedAs
239648


Applies to:

Sun Netra CP3010 Blade Server - Version Not Applicable and later
Sun Netra CP3220 ATCA Blade Server - Version Not Applicable and later
Sun Netra CP3260 ATCA Blade Server - Version Not Applicable and later
Sun Netra CT900 Server - Version Not Applicable and later
Netra SPARC T3-1B - Version Not Applicable and later
All Platforms

Goal

Customer should be able to collect the needed info on certain Netra CT900 problems.

 

Fix

Enter the Main Content


Description
This documents describes how to collect data from the the various component parts of a Sun Netra[TM] CT900 server

 

Symptoms

  • Unable to login to Netra CT900
  • Malfunction of Node in CT900
  • Chassis configuration issue
  • Power issuse
  • Network or communication issues to CT900


Purpose and Scope

This document gives details on how to gather data from components in a Netra CT900,  from the chassis LEDs, ShMM, and Nodes


Steps to Follow
These are the steps to follow to gather the data necessary for diagnosing problems on the Netra CT900

Data can be collected from the Netra CT900 server from the following components

  1. Chassis related LEDS

  2. Shelf Management Module (ShMM)

  3. Four Switches (Two Switch fabric, BASE & FABRIC, per Switch blade)

  4. Node Boards (These cover a variety of architectures and operating systems)

 

  • Chassis related LEDs

Note the status of the various chassis LEDS.
Refer to <Document: 1019516.1>


  • Shelf Management Module (ShMM)

The ShMMs run a version of Linux with specific applications designed to manage the CT 900 shelf such as clia.

There are two ShMMs, one of which is active and the other back up.  Failover can occur as a result of an error or can be instigated from command line.  Log into active ShMM as root (via serial connection, telnet or ssh).  See the following documents for cabling instructions.

  • Serial connection <Document: 1019404.1>

  • Network connection <Document: 1019514.1>

Ensure that the connection is with the active ShMM using the shmstatus command (see example below).  This is required unless it is necessary to investigate a problem that is specific to backup ShMM.  NOTE: Many script commands will produce no output when runing on backup ShMM.

# clia shmstatus
Pigeon Point Shelf Manager Command Line Interpreter
Host: "Active"

 

  • Run the summary script.

NOTE: The /etc/summary script is the equivalent to the Solaris[TM] explorer and also includes information about shelf configuration such as board information.

Example:

# /etc/summary
send debug file /tmp/debug.log to PPS
# 

The script will generate a log file (/tmp/debug.log) which should be provided to Oracle/Sun Support.


  • Capture the verbose output of the event log using the clia sel -v command

Example:

# clia sel -v
Pigeon Point Shelf Manager Command Line Interpreter
0x027A: Event: at: Sep 11 18:30:23 2008; from IPM Controller: 0x96, LUN: 0, Channel: 0
"Temperature" (0x01) sensor # 6
"Threshold" (0x01) event Deasserted
"Upper Non-Critical Going High"
0x027B: Event: at: Sep 11 18:30:29 2008; from IPM Controller: 0x20, LUN: 0, Channel: 0
"IPMB Link" (0xf1) sensor # 26
"Sensor-specific" (0x6f) event Asserted
"Undefined"
OEM code 1: 0x00
OEM code 2: 0x87

NOTE: The output shown above is truncated for clarity of this document


  • Various sensor outputs: [clia] sensordata board <slot #> <sensor #>

Upon review of debug.log file, there might be a need to look into particular reading of sensors.  And the sensor # is obtained from SEL (system event list) above.



  • Switches

It is beyond the scope of this document to go into detail about switch blade architecture but the following discussion will allow basic configuration and statistics to be collected and provide a introduction into obtaining information from switch blade.

NOTE: There are 4 Switch fabrics, 2 per switch blade.  Each switch blade contains a fabric of BASE and EXTENDED (referred as FABRIC).  For full details it is necessary to run all the commands on all the Switch fabrics.

See the following documents for instructions and guidance for cabling

  • Serial connection <Document: 1019404.1>

  • Network connections <Document: 1019514.1>

Login into required switch fabric as "admin" and ensure all output is being logged via the Unix script command or equivalent.  The default password is <NULL> (just hit return key).


  • Obtain a print out of present switch configuration

First enter into user privilege mode using the en command (default password is also <NULL>).

(CP3240H-BEX-Z Base) > en
Password: <NULL>
(CP3240H-BEX-Z Base) #

NOTE: The prompt changed from > to #.

  • Then capture the current running configuration.

Example:

(CP3240H-BEX-Z Base) # show running-config
!Current Configuration:
!
!System Description "CP3240H-BEX-Z Base"
!System Software Version "1.1.8.0"
!
set prompt "v-ct900a-sw-b-b-gmp02"
serviceport protocol none
serviceport ip 129.156.204.81 255.255.252.0 129.156.204.1
vlan database
exit
configure
!
logging buffered
ipv6 router ospf
exit
router rip
exit
lineconfig
exit
spanning-tree configuration name "00-20-13-F1-0D-3D"
router ospf
exit
interface  0/1
exit

NOTE: This output is truncated for clarity.

  • Obtain the switch's physical and other properties

(CP3240H-BEX-Z Base) # show hardware

System Description............................. CP3240H-BEX-Z Base
Machine Type................................... CP3240H-BEX-Z
Machine Model.................................. CP3240H-BEX-Z
Serial Number.................................. 1544DTI-0742330074
FRU Number..................................... 375-3523-01
Part Number.................................... 375-3523-01
Maintenance Level.............................. A
Manufacturer................................... 0x34b7
Burned In MAC Address.......................... 00:20:13:F1:0E:6D
Software Version............................... 1.3.3.0
Operating System............................... Linux 2.4.20_mvl31
Network Processing Device...................... BCM56504 REV 1

Additional Packages............................ FASTPATH QOS
                                                FASTPATH Multicast
                                                FASTPATH IPv6

(CP3240H-BEX-Z Base) # show sysinfo
System Description............................. ATS1160 Base
System Name....................................
System Location................................
System Contact.................................
System Object ID............................... lvl7
System Up Time................................. 25 days 0 hrs 45 mins 12 sec

NOTE: This output is truncated for clarity.

  • Obtain a summary of the switch port configuration

(CP3240H-BEX-Z Base) # show port all
Admin Physical Physical Link Link LACP
Intf Type Mode Mode Status Status Trap Mode
------ ------ ------- ---------- ---------- ------ ------- -------
0/1 Enable Auto 100 Full Up Enable Enable
0/2 Enable Auto 1000 Full Up Enable Enable
0/3 Disable Auto Down Enable Enable
0/4 Disable Auto Down Enable Enable
0/5 Disable Auto Down Enable Enable
0/6 Disable Auto Down Enable Enable
0/7 Disable Auto Down Enable Enable
0/8 Disable Auto Down Enable Enable
0/9 Disable Auto Down Enable Enable
0/10 Disable Auto Down Enable Enable
0/11 Enable Auto 1000 Full Up Enable Enable
0/12 Enable Auto 100 Full Up Enable Enable
0/13 Enable Auto 1000 Full Up Enable Enable
0/14 Enable Auto 100 Full Up Enable Enable
0/15 Disable Auto Down Enable Enable
0/16 Disable Auto Down Enable Enable
0/17 Enable Auto Down Enable Enable
0/18 Enable Auto Down Enable Enable
0/19 Enable Auto Down Enable Enable
0/20 Enable Auto 100 Full Up Enable Enable
0/21 Enable Auto Down Enable Enable
0/22 Enable Auto Down Enable Enable
0/23 Enable Auto Down Enable Enable
0/24 Enable Auto Down Enable Enable
  • Gather statistics of individual ports

Where a problem indicates, it may be necessary to gather statistics for individual ports.

(CP3240H-BEX-Z Base) # show interface 0/1
Packets Received Without Error................. 1430658
Packets Received With Error.................... 0
Broadcast Packets Received..................... 1141
Packets Transmitted Without Errors............. 22407251
Transmit Packet Errors......................... 0
Collision Frames............................... 0
Time Since Counters Last Cleared............... 44 day 3 hr 4 min 12 sec

(CP3240H-BEX-Z Base) # show interface ethernet 0/1

Total Packets Received (Octets)................ 32041289
Packets Received 64 Octets..................... 500635
Packets Received 65-127 Octets................. 0
Packets Received 128-255 Octets................ 0
Packets Received 256-511 Octets................ 0
Packets Received 512-1023 Octets............... 1
Packets Received 1024-1518 Octets.............. 0
Packets Received > 1522 Octets................. 0
Packets RX and TX 64 Octets.................... 133197800
Packets RX and TX 65-127 Octets................ 6160159
Packets RX and TX 128-255 Octets............... 2369932
Packets RX and TX 256-511 Octets............... 573424
Packets RX and TX 512-1023 Octets.............. 210512
Packets RX and TX 1024-1518 Octets............. 0
Packets RX and TX 1519-2047 Octets............. 0
Packets RX and TX 2048-4095 Octets............. 0
Packets RX and TX 4096-9216 Octets............. 0

Total Packets Received Without Errors.......... 500635
Unicast Packets Received....................... 0
Multicast Packets Received..................... 0
Broadcast Packets Received..................... 500635
Total Packets Received with MAC Errors......... 1
Jabbers Received............................... 0
Fragments/Undersize Received................... 0
Alignment Errors............................... 0
FCS Errors..................................... 1
Overruns....................................... 0

The output of the above commands should be sufficient to provide initial information

when contacting Sun Support Center about a Netra CT900 switch or connectivity problem.



  • Node Boards

These boards can be a variety of architectures and running different operating systems. Use the appropriate tool such as explorer and snapshot for Solaris and sosreport for Red Hat 5 to collect data.

Note the ShMM section above will collect data about board power cycles, hardware type and slots.

References:

http://download.oracle.com/docs/cd/E19217-01/index.html

 



Product
Sun Netra CT 900 Server
Netra CP3010 Board
Netra CP3020 Board
Netra CP3060 Board
Sun Netra CP3140 ATCA Switch Blade
Sun Netra CP3220 ATCA Blade Server
Sun Netra CP3260 ATCA Blade Server
Netra CP3240 Switch

Netra SPARC T3-1BA



@ Internal Comments
@ This document contains normalized content and is managed by the the Domain Lead(s) of the respective domains. To notify content owners of a knowledge gap contained in this document, and/or prior to updating this document, please contact the domain engineers that are managing this document via the “Document Feedback” alias(es) listed below:

@ Domain Lead: Dencho.Kojucharov@sun.com
@ Feedback Alias: blade_normalizers@sun.com

@ normalized, CT900, Netra, ATCA, shelf, clia, summary, pigeon point, CP3010, CP3020, CP3060, CP3220, CP3250, CP3260, CP3140, CP3240, ShMM, ShMc,

References


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback