Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1528030.1
Update Date:2017-09-13
Keywords:

Solution Type  Technical Instruction Sure

Solution  1528030.1 :   How to Replace a SPARC T5-4 or T5-8 Server Main Module Motherboard:ATR:1528030.1:2  


Related Items
  • Oracle SuperCluster T5-8 Full Rack
  •  
  • SPARC T5-8
  •  
  • Oracle SuperCluster T5-8 Half Rack
  •  
  • SPARC T5-4
  •  
  • Oracle SuperCluster T5-8 Hardware
  •  
  • Oracle Exalytics T5-8
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: SPARC-CAP VCAP
  •  




In this Document
Goal
Solution
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: FRU CAP

Applies to:

Oracle SuperCluster T5-8 Hardware - Version All Versions and later
SPARC T5-8 - Version All Versions and later
Oracle SuperCluster T5-8 Full Rack - Version All Versions and later
Oracle SuperCluster T5-8 Half Rack - Version All Versions and later
SPARC T5-4 - Version All Versions and later
Information in this document applies to any platform.

Goal

How to remove and replace a SPARC T5-4 or T5-8 Server Main Module Motherboard

*************************************************************
To report errors or request improvements on this procedure,
please Add a comment on Doc ID: 1528030.1
*************************************************************

Solution

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED:
Qualified service technicians with SPARC T5-4 / T5-8 experience/training.

Time Estimate: 120 minutes

Task Complexity: 2

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS

PROBLEM OVERVIEW: replace a faulty SPARC T5-4 or T5-8 Main Module Motherboard

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :
The server must be fully shut down and the power cords disconnected.

 

ESD Caution:

  • Circuit boards and drives contain electronic components that are extremely sensitive to static electricity. Ordinary amounts of static electricity from clothing or the work environment can destroy the components located on these boards. Do not touch the components along their connector edges.
  • Use a Antistatic Wrist strap. Attach one end of the strap to your wrist and the other end to the chassis, depending on what type of strap you use, with the adhesive end or the metal plug.
  • Use an Antistatic Mat. Place ESD-sensitive components such as motherboards, memory, and other PCBs on an antistatic mat

 

 

DAMAGE ALERT: Perform a visual inspection of the replacement part to make sure that there are no damaged components, connectors, bent pins, damaged packages during shipping, etc. If the part is damaged, don't install it into the system, order a new part. Handle with caution and package carefully the return part just as the new part was packaged to avoid any damages during shipping.
NOTE: The LDOM configuration data (if any exists) needs to be saved before motherboard replacement to avoid loss of LDOM configuration, refer to doc 1019720.1 for details.


WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE:

Main Module Motherboard Overview

When replacing the main module motherboard, remove the service processor and System Configuration PROM from the old motherboard and install these components on the new motherboard. The service processor contains the Oracle ILOM server configuration data and the System Configuration PROM contains the server host ID and MAC address. Transferring these components will preserve the server-specific information stored on these modules.

System firmware consists of two components: a service processor component and a host component. The service processor component is located on the service processor and the host component is located on the motherboard. In order for the server to operate correctly, these two components must be compatible.

After replacing the motherboard, the host firmware on the motherboard might be incompatible with the service processor firmware on the service processor that you transferred to the new motherboard. In this case, the server firmware must be loaded as described in Install the Main Module Motherboard.

 

The main module motherboard is a cold-service component that can be replaced only by authorized service personnel.

1. Remove the main module from the server.

   Squeeze the release latches together on the two extraction levers, and pull the extraction levers out to disengage the main module from the server.

   Pull the main module halfway out of the server. Press the levers back toward the center of the main module. This will keep the levers from being damaged when the main module is outside
   the server.

  

Caution – Due to the weight of the main module, the following step requires two people to perform. Do not attempt to lift the main module alone.

   Remove the main module completely from the server.

   Remove the cover from the main module. Press the green button near the front edge of the cover and slide the cover back and up off the main module.


2. Make a note of the locations of the components in the main module before removing any of them.

The server software keeps track of the location of some components, such as the hard drives and RAID expansion modules, so you might have to reinstall the software on some components if they are moved to different locations in the main module. Make a note of the location of the components in the main module and install these components in the same locations in the new main module so that you do not have to reinstall software on any of these components.

3. Take the necessary ESD precautions.

4. Remove the following components from the main module:

- Remove a Hard Drive
- Remove the Service Processor Card
- Remove the System Battery
- Remove the Front I/O Assembly
- Remove a Storage Backplane

5. Remove the system configuration PROM from the faulty motherboard.

NOTE: Put the system configuration PROM aside to install it on the replacement motherboard.

B) Install the Main Module Motherboard

1. Install the system configuration PROM from the old motherboard onto the replacement motherboard.

2. Replace these components on the main module:

- Install a Storage Backplane
- Install the Front I/O Assembly
- Install the System Battery
- Install the Service Processor Card
- Install a Hard Drive

NOTE: Install the components in the same slots that you removed them from in the old main module to keep from having to reinstall software on these components.

Caution – Due to the weight of the main module, the following step requires two people to perform. Do not attempt to lift the main module alone.

3. Insert the main module back into the server.

    Open the levers so that they are fully open.

    Insert the main module into its slot in the server until the levers begin to engage.

    Press the levers back together, toward the center of the module, then press the levers firmly against the module to fully seat the module back into the server.

4. Connect a terminal or a terminal emulator (PC or workstation) to serial management port.

If the service processor detects that the new host firmware component is incompatible with service processor firmware component, further action will be suspended and the following message will be delivered over the serial management port.

Unrecognized Chassis: This module is installed in an unknown or unsupported chassis. You must upgrade the firmware to a newer version that supports this chassis.

If you see this message, go on to Step 5.

5. Download the system firmware.

a. If needed, configure the service processor’s network port to enable the firmware image to be downloaded.
b. Download the system firmware. You can load any supported system firmware version, including the firmware revision that had been installed prior to the replacement of the motherboard. However, Oracle strongly recommends installing the newest version of the system firmware.

Follow the related instructions in the Oracle ILOM documentation, available at the following link: http://docs.oracle.com/cd/E37444_01/

NOTE: In some rare cases when updating the SysFW via CLI (and during CLI execution), it may fail with the following error:

Hostname: <your hostname>

-> load -source tftp://xxx.xxx.xxx.xxx/Sun_System_Firmware-9_4_2_e-SPARC_T5-4+T5-8.pkg
Cannot read power state

load: Command Failed

If that happens, use the below steps (confirmed by FSEs to be a successful workaround to resolve the issue):

- Returning original part
- Install the last firmware with the old part
- Install again the new part
- Try to re-flash the last firmware

NOTE: part could be either MM, MB or SP

This failure is due to bug 24804437, it is just filed to get engineering attention and assist us in determining the technical issue or perhaps provide an alternative/appropriate solution.

Please add a defect to bug 24804437 in your SR.

C) Verify Main Module Motherboard Functionality

NOTE: A limited number of spare Main Module Motherboards may require a LSI NVDATA FW Update. The update is available via the MOS patched listed below. Details are in the README.

T5-4 Patch 16801172

T5-8 Patch 16801179

1. Verify that the OK LED is lit on the main module and that the Fault LED is not lit.

2. Verify that the front and rear Service Required LEDs are not lit.

3. If the previous steps indicate that no faults have been detected, then the main module motherboard has been replaced successfully.

NOTE: Re-activate any RAID volumes that existed prior to replacing the motherboard. Only perform this task if your system had RAID volumes prior to replacing the motherboard. For details refer to doc 1387771.1 and the SPARC T5-4 and SPARC T5-8 Service Manual

4. Set ILOM date/time  (-> set /SP/clock datetime=MMDDhhmmYYYY.ss)

NOTE: In case the FW upgrade via ILOM BUI fails, please use the CLI method (see example in doc 2019500.1). You may need a source of TFTP in order to upgrade firmware via CLI commands, for details refer to: Solaris[TM] Operating System: How to set-up a TFTP server. (Doc ID 1004474.1). See also doc 2076315.1 in References.


OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE FE/ADMINISTARTOR NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:
Boot system and monitor boot sequence for errors. Test functionality of system:
1. Run the Solaris "fmadm faulty" and SP/ILOM "show faulty" command (if only ALOM is supported run "showfaults -v" command) to verify that the fault has been cleared.
2. Perform one of the following tasks based on your verification results:
   * If the previous steps did not clear the fault, refer to doc 1004229.1 for information about the tools and methods you can use to diagnose and clear
     component faults.
   * If the previous steps indicate that no faults have been detected, the component has been replaced successfully. No further action is required
3. Restart software applications per applicable administration guides to resume system operation.

PARTS NOTE:
refer to:  https://support.oracle.com/handbook_partner/Systems/SPARC_T5_4/components.html#SystemBoard

             https://support.oracle.com/handbook_partner/Systems/SPARC_T5_8/components.html#SystemBoard



REFERENCE INFORMATION:
SPARC T5-4 Service Manual: http://docs.oracle.com/cd/E29659_01/pdf/E29663.pdf

SPARC T5-8 Service Manual: https://docs.oracle.com/cd/E35078_01/html/E35082/index.html

See also:  Oracle Integrated Lights Out Manager (ILOM) 3.2.1 Documentation Collection: http://docs.oracle.com/cd/E37444_01/


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback