Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1606800.1
Update Date:2017-10-05
Keywords:

Solution Type  Technical Instruction Sure

Solution  1606800.1 :   Sun SPARC[TM] M5-32 and Sun SPARC[TM] M6-32 servers: Field Replaceable Unit (FRU) Replacement Methods  


Related Items
  • SPARC M5-32
  •  
  • SPARC M6-32
  •  
  • Oracle SuperCluster M6-32 Hardware
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>Enterprise>SN-SPARC: Mx-32
  •  




Applies to:

SPARC M5-32 - Version All Versions and later
SPARC M6-32 - Version All Versions and later
Oracle SuperCluster M6-32 Hardware - Version All Versions and later
Information in this document applies to any platform.

Goal

The purpose of this document is to summarize the Field Replacable Units (FRUs) in the SPARC Mx-32 hardware platform, and for each FRU list its type of replacement and if needed an explanatory note. The source of information for this document is the SPARC M5-32 and SPARC M6-32 Servers Service Manual . For more details related to servicing the Mx-32 hardware in general, and a specific FRU in detail, please visit the relevant sections of the SPARC M5-32 and SPARC M6-32 Servers Service Manual. This document can be used as a quick reference to find out which service method is to be used,  given the system configuration and the FRU which requires service. If this document causes doubt or confusion, the SPARC M5-32 and SPARC M6-32 Servers Service Manual is leading.

 

To discuss this information further with Oracle experts and industry peers, we encourage you to review, join or start a discussion in the My Oracle Support Community - M Series Servers

Solution


Types of FRU replacement

Field Replaceable Units (FRUs) on the SPARC M5-32 and SPARC M6-32 Servers can be replaced using one of two possible replacement methods:

Cold Replacement :

Each FRU can be replaced through Cold Service replacement. Cold service replacement involves the entire platform to be powered off, including the removal from AC input power to the platform.  If you do not power off the entire platform, then power off the Physical Domain (PDomain) that contains the FRU.  The table below contains a note for each FRU, showing which other FRUs and PDomains are affected by the replacement.

Hot Replacement :

A number of FRUs can be replaced through Hot Service replacement. Hot Service FRUs can be removed while the PDomain(s) are running. Hot-swappable FRUs do not require any preparation prior to servicing,  where as Hot-pluggable FRUs do require preparation prior to servicing. The method to prepare a FRU for removal can be different for each FRU. Preparation for removal includes one or more of pushing the Attention (ATTN) button, running software commands on the affected PDomains and running software commands on the active Service Processor (SP) . After physically replacing the FRU, it must be reconfigured back into its PDomain. The steps to reconfigure the FRU back into the PDomain include one or more of pushing the Attention (ATTN) button, running software commands on the affected PDomains and running software commands on the active Service Processor (SP). Any special consideration will be listed in a note for each FRU in the table below.

 

Before and after the actual hardware intervention

Before the actual FRU replacement takes places, it is the customer's responsability to take all necessary steps to make the FRU available for replacement.
What steps have to be taken depends on the chosen FRU replacement method. Hot FRU Replacement requires some preparation and sometimes requires one or more PDomains to be stopped, Cold FRU Replacement requires a lot of preparation and planning, and one of more PDomains need to be stopped.
How much time it takes to actually get tot the point where we can start the FRU replacement, or how much time it takes to stop and start affected PDomains, is very much depending on how the Mx-32 hardware is divided over the various DCU, and which DCUs make up which PDomain. Such stop and start times are out-of-scope for this document at this point in time, customers should have a fairly good idea as to how long it takes to stop and start their PDomains. Future version of this document might contain a few examples on PDomain stop and start times, but the best option will always be to rely on customer information.

What replacement method is available for this FRU ?

For each FRU, the document lists the FRU name, the available replacement methods, and if needed a specific note. If in doubt, please do consult the SPARC M5-32 and SPARC M6-32 Servers Service Manual, collaborate in  the My Oracle Support Community - M Series Servers or contact Oracle Support.

FRUReplacement methodChassis Outage NeededNotes

Power Signal Distribution Board (PSDB)

Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Power Supply (PS) Hot-Swappable No Replace 1 (one) PS at the time
Fan Module (FM) Hot-Pluggable No Stop FM before replacing, replace 1 (one) FM at the time
Scalability Switch Board (SSB) Hot-Pluggable No If SSB current_config_state = Enabled, all PDomains with expandable = true must be stopped prior to SSB replacement
Clock Board (CLK) Hot-Pluggable No Replace 1 (one) CLK at the time, only Standby CLK can be replaced, if faulty CLK is Active CLK, stop all PDomains  and failover CLK. Stop faulty CLK before replacing
Service Processor (SP) Hot-Pluggable No With PDomains running, only Standby SP can be replaced. Removing both SPs will shutdown the system
Input Output Switch Board (IOB) Hot-Pluggable No The PDomain using the IOB must be stopped before replacing the IOB
PCIe Hot-Plug Carrier (CAR) Hot-Pluggable No Turn off both CAR and its PICe IO Card before replacing, using the ATT button or the appropriate hotplug commands.   
Oracle SuperCluster M6-32 Hardware does not support PCIe card hot replacement (aka hotplug).  The PDomain which owns the PCIe slot must be shutdown to replace the PCIe carrier.
 
PCIe IO Card Hot-Pluggable No

Turn off both CAR and its PICe IO Card before replacing, using the ATT button or the appropriate hotplug commands.

Oracle SuperCluster M6-32 Hardware does not support PCIe card hot replacement (aka hotplug).  The PDomain which owns the PCIe slot must be shutdown to replace the PCIe card.
Sun Flash Accelerator  PCIe Card Cold No The PDomain using the Sun Flash card must be stopped before replacing the Sun Flash card.
EXpress Module SAS (EMS) Hot-Pluggable No Turn off the EMS before replacing, using the ATT button or the appropriate hotplug commands. If EMS cannot be turned off, the PDomain using it must be stopped.
Hard Disk Drive (HDD) Hot-Pluggable No Unconfigure the HDD before replacing, using the cfgadm command.
Solid State Drive (SSD) Hot-Pluggable No Unconfigure the HDD before replacing, using the cfgadm command.
Front LED Panel Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Upper Fan Cage Assembly Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Lower Fan Cage Assembly Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Scalability Card Cage Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Fan Power Cable Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Power System Cage Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Faulty AC Input Filter Hot-swappable No Before replacing AC input filter, remove the AC power cord. If in doubt which power cord to remove, stop all PDomains, remove all AC-input power, optionally disconnect all AC-input power cables.
Faulty AC Power Cord Hot-swappable No Before disconnecting the AC power cord from the AC input filter, power off the external circuit breaker for the AC power cord. If in doubt which power cord to remove, stop all PDomains, remove all AC-input power, optionally disconnect all AC-input power cables.
Rear LED Panel Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Service Processor Proxy (SPP) Hot-Pluggable No The PDomain using the SPP must be stopped before replacing the SPP. 
CPU Memory Unit (CMU) Hot-Pluggable No The PDomain using the CMU must be stopped before replacing or inserting the CMU.
CPU memory board filler panel Hot-Swappable No --
16GB/32GB Dimm Hot-Pluggable No The PDomain using the CMU which contains the Dimm, must be stopped before replacing the Dimm.
Input Output Unit (IOU) , a.k.a. I/O tower
Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Hard Disk Drive Cage Cold No The PDomain using the Hard Disk Drive cage must be stopped before replacing the Hard Disk Drive cage.
Scalability Assembly Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
IO Power Cable Assembly Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Scalability Midplane to PSDB Cable for Fans Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Scalability Midplane to PSDB Cable CABLE ASSY, SCMP-PSDB, LINK Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Scalability Card Cage Fans Cable Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
IO Data Cable Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Midplane Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
System Bus Bar Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Filler Panel Assembly Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Cabled Lower Bus Bar Assembly Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Side Panels Cold No Unless there is sufficient space around the server to remove the side panels, all PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables.
Front/Rear Cabinet Door Hot-Swappable No --
Scalability Assembly EMI plate Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Rear LED Panel Cable Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Battery in a Service Processor (SP) Hot-Pluggable No With PDomains running, only Standby SP can be replaced. Removing both SPs will shutdown the system
Battery in a Service Processor Proxy (SPP) Hot-Pluggable No The PDomain using the SPP must be stopped before replacing the SPP.
Cable Management Assembly Hot-Swappable No --
Bus Bar Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Midplane Power Cable Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables
Internal Link to Front LED Panel Cable Cold Yes All PDomains need to be stopped, remove all AC-input power, optionally disconnect all AC-input power cables

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback