Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1543382.1
Update Date:2018-01-05
Keywords:

Solution Type  Technical Instruction Sure

Solution  1543382.1 :   Sun Storage 7000 Unified Storage System: How To check if a clustron card is faulted  


Related Items
  • Sun ZFS Storage 7420
  •  
  • Sun Storage 7110 Unified Storage System
  •  
  • Sun Storage 7210 Unified Storage System
  •  
  • Sun Storage 7410 Unified Storage System
  •  
  • Sun Storage 7310 Unified Storage System
  •  
  • Sun ZFS Storage 7320
  •  
  • Sun Storage 7720 Unified Storage System
  •  
Related Categories
  • PLA-Support>Sun Systems>DISK>ZFS Storage>SN-DK: 7xxx NAS
  •  


This document is intended to show when a faulted clustron card must be replaced

In this Document
Goal
Solution
References


Created from <SR 3-6888103048>

Applies to:

Sun Storage 7310 Unified Storage System - Version All Versions and later
Sun Storage 7720 Unified Storage System - Version All Versions and later
Sun ZFS Storage 7320 - Version All Versions and later
Sun Storage 7110 Unified Storage System - Version All Versions and later
Sun Storage 7410 Unified Storage System - Version All Versions and later
7000 Appliance OS (Fishworks)
Be able to check if a clustron card is faulted - this only happens when in cluster configuration

Goal

To discuss this information further with Oracle experts and industry peers, we encourage you to review, join or start a discussion in the My Oracle Support Community - Disk Storage ZFS Storage Appliance

Process to check if a clustron card is faulted - this only happens when in cluster configuration

 

Solution

NOTE: To confirm that the cluster 'links' cabling is correctly configured - See Document ID 2081179.1

 

Here are the symptoms we would observe in that situation :

 1.  From one of the cluster nodes,  the clustron card is shown as faulted using the appliance CLI command : maintenance hardware show

slot-001     PCIe 0                 faulted   Sun Microsystems, Inc.  Fishworks CLUSTRON 100                                    unknown                                                                                                                          
On the other cluster node :

slot-001     PCIe 0                 ok        Sun Microsystems, Inc.  Fishworks CLUSTRON 100                                    unknown                                          

Moreover, using the maintenance problems show CLI command, we would also find a message like :

> maintenance problems select problem-002 show                                                                
Properties:                                                                                                                         
                          uuid = 35dc30f6-787b-c2b7-c519-d768fe3b94fd                                                               
                          code = PCI-8000-CA                                                                                        
                     diagnosed = 2013-3-1 23:24:49                                                                                  
                   phoned_home = 2013-3-1 23:27:45                                                                                  
                      severity = Major                                                                                              
                          type = Fault                                                                                              
                           url = http://sun.com/msg/PCI-8000-CA
                   description = A device is failing to respond                                                                     
                        impact = Possible loss of services provided by the                                                          
                                 device instances associated with this fault                                                        
                      response = One or more device instances may be disabled                                                       
                        action = Contact your service provider for proper                                                           
                                 repair procedures.     

Components:                                                                                                                         
 
component-000   33%  SSI-ST-TC7310-1-upper: PCIe 0 (faulted)                                                                        
                     Manufacturer: Sun Microsystems, Inc.                                                                           
                     Part number: 371-3024                                                                                          
                     Model: Fishworks CLUSTRON 100                                                                                  
component-001   33%  SSI-ST-TC7310-1-upper: PCIe 0 (degraded)                                                                       
                     Manufacturer: Sun Microsystems, Inc.                                                                           
                     Part number: 371-3024                                                                                          
                     Model: Fishworks CLUSTRON 100                                                                                  
component-002   33%  SSI-ST-TC7310-1-upper: PCIe 0 (degraded)                                                                       
                     Manufacturer: Sun Microsystems, Inc.                                                                           
                     Part number: 371-3024                                                                                          
                     Model: Fishworks CLUSTRON 100   

In this situation, the customer should gather a supportbundle from each cluster node and contact Oracle support.

The support engineer should first try to clear the error using fma commands, since this may merely be a soft error reported on the PCI bus. Once the error has been cleared, monitor for a couple a days to check if the issue re-appears. In that case, the PCI card must be replaced.

 

References

<NOTE:1447284.1> - Sun Storage 7000 Unified Storage System: How to upgrade a clustered system

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback