Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1526068.1
Update Date:2018-03-29
Keywords:

Solution Type  Technical Instruction Sure

Solution  1526068.1 :   How to Remove and Replace SPARC T5-2 Memory Risers and/or DIMMS:ATR:1526068.1:0  


Related Items
  • SPARC T5-2
  •  
  • SPARC T5-2
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: SPARC-CAP VCAP
  •  




In this Document
Goal
Solution


Applies to:

SPARC T5-2 - Version All Versions and later
Information in this document applies to any platform.

Goal

How to Remove and Replace SPARC T5-2 Memory Risers and DIMM

*********************************************************************
To report errors or request improvements on this procedure,
please Add a comment on Doc ID: 1526068.1
*********************************************************************

Solution

DISPATCH INSTRUCTIONS

WHAT SKILLS ARE REQUIRED?: No special skills required, Customer Replaceable Unit (CRU) procedure

Time Estimate: 180 minutes

Task Complexity: 0

REMOVAL/REPLACEMENT INSTRUCTIONS:

PROBLEM OVERVIEW: Memory Risers and DIMMs replacing.

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

ESD Caution:

  • Circuit boards and drives contain electronic components that are extremely sensitive to static electricity. Ordinary amounts of static electricity from clothing or the work environment can destroy the components located on these boards. Do not touch the components along their connector edges.
  • Use a Antistatic Wrist strap. Attach one end of the strap to your wrist and the other end to the chassis, depending on what type of strap you use, with the adhesive end or the metal plug.
  • Use an Antistatic Mat. Place ESD-sensitive components such as motherboards, memory, and other PCBs on an antistatic mat

 

 

DAMAGE ALERT: Perform a visual inspection of the replacement part to make sure that there are no damaged components, connectors, bent pins, damaged packages during shipping, etc. If the part is damaged, don't install it into the system, order a new part. Handle with caution and package carefully the return part to avoid any damages during shipping.


The customer should conduct an orderly software system shutdown. Then power down system and disconnect the power cords. A data backup is not a pre-requisite but is a wise precaution.

WHAT ACTIONS ARE REQUIRED?:

The server includes eight memory risers. Four DIMM slots are on each memory riser. Four memory risers are associated with each CMP in the server. A label is next to next memory riser that shows the number of the CMP and of the riser.

NOTE: The server fails to boot unless all memory riser slots are populated.

DIMM FRU names are based on the location of the memory riser in the server and the DIMM slot on the memory riser. For example, the full FRU name for the top-most DIMM slot (BOB1/CH0/D0) on the first memory riser (CM0/CMP/MR0) is:

/SYS/MB/CM0/CMP/MR0/BOB1/CH0/D0

Memory Riser and DIMM Population Rules

The memory riser configuration rules for the server are as follows:
    - The server contains eight memory risers. Four memory risers are supported per CPU.
    - Each of the eight memory riser slots in the server must be filled with a memory riser.

The DIMM configuration rules for each DIMM are as follows:
    - Each slot on a memory riser must be filled with a DIMM or a DIMM filler panel.
    - All memory risers assigned to a CPU must contain the same type and number of DIMMs.
    - In a typical configuration, DIMMs are installed in all four slots on every memory riser.
    - If memory risers are half-populated, DIMMs must be in the two slots with a black slot body.
    - If memory risers are quarter-populated, a DIMM must be in the slot with a black ejector and a black slot body.

 

Locate a Failed DIMM (LEDs)

Each memory riser has a Remind button, a Power LED, and Fault LEDs adjacent to each DIMM. This procedure describes how to identify a faulty DIMM using these buttons and LEDs.

1. Press the System Remind button to identify the memory riser that contains the faulty DIMM.
2. Lift and remove the faulty memory riser.
3. Press the Remind button on the memory riser to identify the faulty DIMM. An amber Fault LED will light next to the faulty DIMM.

NOTE: The front and rear panel Service Required LEDs are also lit when the system detects a DIMM fault.

 

Locate a Failed DIMM (Oracle ILOM)

The Oracle ILOM show faulty command displays current system faults, including DIMM failures:

1. Type show faulty at the -> prompt:

-> show faulty
Target                 | Property              | Value
--------------------+---------------------+-------------------
/SP/faultmgmt/0  | fru                      | /SYS/MB/CM0/CMP/MR1/BOB1/CH0/D0
/SP/faultmgmt/0  | timestamp           | Dec 21 16:40:56
/SP/faultmgmt/0/ | timestamp           | Dec 21 16:40:56 faults/0
/SP/faultmgmt/0/ | sp_detected_fault | /SYS/MB/CM0/CMP/MR1/BOB1/CH0/D0
faults/0               |                           | Forced fail(POST)

2. Locate the DIMM that corresponds to the listed name.

In this example, /SYS/MB/CM0/CMP/MR1/BOB1/CH0/D0 indicates the memory riser that is second farthest from the power supplies and the DIMM in a slot with white ejector tabs and a black slot.

 

Remove a Memory Riser and DIMM

This procedure can be performed by customers. The system must be completely powered down before performing this procedure.

CAUTION: These procedures require that you handle components that are sensitive to ESD. This sensitivity can cause the component to fail. To avoid damage, ensure that you follow antistatic practices as described in ESD Measures.

1. Prepare for servicing:
     a. Attach an antistatic wrist strap.
     b. Power off the server and unplug power cords from the power supplies.
     c. Extend the server to the maintenance position.
     d. Remove the top cover.
2. Identify the memory riser with the faulty DIMM by pressing the blue "Fault Remind button" located on the air divider.
     - If the memory riser Service Action Required LED is off, all DIMMs on this riser are operating properly.
     - If the memory riser Service Action Required LED is on (amber), one or more of the DIMMs installed on this riser is faulty or misconfigured
3. Lift the memory riser that has its Service Action Required LED lit straight up to remove the memory riser from the memory module socket.
4. Identify the faulty or misconfigured DIMM(s) by pressing the Remind button on the memory riser.
5. On DIMMs that display an amber Fault LED, remove the DIMMs.
     a. Press down both DIMM slot ejector tabs as far as they will go.
     b. Carefully lift the DIMM straight up.

CAUTION: Whenever you remove a memory riser or DIMM, you should replace it with another memory riser or a DIMM or a filler panel; otherwise, the server might overheat due to improper airflow.

Install a DIMM and a Memory Riser

1. Attach an antistatic wrist wrap and unpack the DIMMs and place them on an antistatic mat.
2. Install the DIMMs into the memory riser by performing the following tasks.
     a. Ensure that the ejector levers at both ends of the memory module slot are in a fully open position.
     b. Align each DIMM with the empty connector slot, aligning the notch in the DIMM with the key in the connector.
        The notch ensures that the DIMM is oriented correctly.
     c. Gently press the DIMM into the slot until the ejector tabs lock the DIMM in place.
        Repeat these steps until each DIMM has been installed.
3. Push the memory riser module into the associated CPU memory riser slot until the riser module locks in place.
4. Return the server to operation:
     a. Install the top cover.
     b. Return the server to the normal rack position.
     c. Reinstall the power cords to the power supplies and power on the server.
5. The following link to the T5-2 Service Guide, can be used as a guideline for verifying Replacement DIMMs: http://docs.oracle.com/cd/E28853_01/html/E28854/z40012f81428990.html#scrolltoc

 

OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTIONS ARE REQUIRED TO RETURN THE SYSTEM TO AN OPERATIONAL STATE?:
Boot system and monitor boot sequence for errors. Test functionality of system:
1. Run the Solaris "fmadm faulty" and SP/ILOM "show faulty" command to verify that the fault has been cleared.
2. Perform one of the following tasks based on your verification results:
   * If the previous steps did not clear the fault, refer to doc 1004229.1 for information about the tools and methods you can use to diagnose and clear
     component faults.
   * If the previous steps indicate that no faults have been detected, the component has been replaced successfully. No further action is required
3. Restart software applications per applicable administration guides to resume system operation.

PARTS NOTE:

https://support.oracle.com/handbook_private/Systems/SPARC_T5_2/components.html#Memory


REFERENCE INFORMATION:

SPARC T5-2 Service Manual:  http://docs.oracle.com/cd/E28853_01/html/E28856/index.html

Oracle Integrated Lights Out Manager 3.2 Documentation
http://docs.oracle.com/cd/E37444_01/


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback