Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1509311.1
Update Date:2017-09-13
Keywords:

Solution Type  Technical Instruction Sure

Solution  1509311.1 :   How to isolate disk problems on an Adaptec RAID controller (Cougar)  


Related Items
  • Sun Netra T5440 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: T5xx0
  •  




Created from <SR 3-6405677831>

Applies to:

Sun Netra T5440 Server - Version Not Applicable and later
Information in this document applies to any platform.

Goal

 Determine how to isolate disk issues on an Adaptec RAID controller (Cougar)

Solution

Explorer message files & console logs are not usually helpful to isolate disk failures when placed in a hardware RAID volume.  Explorer 6.5 collects Adaptec RAID controller event logs (event_1.out), configuration file (getconfig_1.out), & other files in the RAIDmanager directory.  Explosum will parse these files to provide a summary of the configuration & problems.

If the customer has an older version of explorer or does not wish to submit an explorer, please request output from the following commands:

   cd /opt/StorMan
   ./arcconf getversion
   ./arcconf getstatus 1
   ./arcconf getconfig 1
   ./arcconf getlogs 1 event
   ./arcconf getlogs 1 device

Also obtain files:

   /opt/StorMan/RaidEvt.log
   /opt/StorMan/RaidEvtA.log
   /opt/StorMan/RaidErr.log
   /opt/StorMan/RaidErrA.log

If the boot disk is affected, then you must boot via the Live CD which is obtained by Intel's site.  Please choose the correct executable for the system & burn to CD.

The configuration file contains the following information:

   Logical Dev: 0  Simple_volume  Optimal  285686 MB  Present (0,8)       Bootable
   Logical Dev: 1  Simple_volume  Optimal  285686 MB  Present (0,9)    
   Logical Dev: 2  5  Optimal  857078 MB  Present (0,10 0,11 0,12 0,13)   
   Logical Dev: 3  5  Optimal  857078 MB  Present (0,14 0,15 0,16 0,17) 
     
   Phys Dev: 0  Online       0,8  SEAGATE  ST930003SSUN300G  0868  000949719S9K      286102 MB 
   Phys Dev: 1  Online       0,9  SEAGATE  ST930003SSUN300G  0868  000949718QTX      286102 MB     
   Phys Dev: 2  Online      0,10  SEAGATE  ST930003SSUN300G  0868  00094970Y0JG      286102 MB     
   Phys Dev: 3  Online      0,11  SEAGATE  ST930003SSUN300G  0868  000949702LKZ      286102 MB     
   Phys Dev: 4  Online      0,12  SEAGATE  ST930003SSUN300G  0868  000949702LD0     286102 MB      
   Phys Dev: 5  Online      0,13  SEAGATE  ST930003SSUN300G  0868  000949702L62     286102 MB     S.M.A.R.T.   
   Phys Dev: 6  Online      0,14  SEAGATE  ST930003SSUN300G  0868  00094970XZH3     286102 MB      
   Phys Dev: 7  Online      0,15  SEAGATE  ST930003SSUN300G  0868  000949702LKX      286102 MB     
   Phys Dev: 8  Online      0,16  SEAGATE  ST930003SSUN300G  0868  00094970XZFF      286102 MB     
   Phys Dev: 9  Online      0,17  SEAGATE  ST930003SSUN300G  0868  00094971A037      286102 MB     
   Phys Dev: 10  Hot Spare  0,18  SEAGATE  ST930003SSUN300G  0868  00094970XX39     286102 MB      
   Phys Dev: 11  Hot Spare  0,19  SEAGATE  ST930003SSUN300G  0868  00094970XZ66    286102 MB

Please notice that the info above contains more data, but is a summary of the configuration.  It also indicates if the volumes & physical devices are online or failed.  It also indicates if the controller determine that a device has encountered a SMART error, which predicts possible failure in the future.

 

The RaidEvt.log contains a daily list of installed devices & an event of disk failure, removal, & insertion, as follows:

   October 10, 2012 1:15:46 PM GMT ERR 401:A01C0S12L-- txslimsccf4 Failed drive: controller 1, enclosure 0, slot 4, S/N 00100271BM4V 3SE1BM4V (Vendor: SEAGATE Model: ST930003SSUN300G).
   October 16, 2012 6:33:58 AM GMT INF 408:A01C0S12L-- txslimsccf4 Physical drive removed: controller 1, enclosure 0, slot 4, S/N 00100271BM4V 3SE1BM4V.
   October 16, 2012 6:33:58 AM GMT INF 408:A01C0S12L-- txslimsccf4 Physical drive removed: controller 1, enclosure 0, slot 4, S/N 00100271BM4V 3SE1BM4V.
   October 16, 2012 6:34:47 AM GMT INF 407:A01C0S12L-- txslimsccf4 Physical drive added: controller 1, enclosure 0, slot 4, S/N 00113074NPJH 6SE4NPJH.
   October 16, 2012 6:34:47 AM GMT ERR 401:A01C0S12L-- txslimsccf4 Failed drive - Device will not spin up: controller 1, enclosure 0, slot 4, S/N 00113074NPJH 6SE4NPJH (Vendor: SEAGATE Model: ST930003SSUN300G).

These messages indicate that the disk in slot 4 has failed & was replaced 6 days later.  Please notice that a newly inserted drive usually creates a failure to spin up, which should be ignored.


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback