Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1956844.1
Update Date:2018-01-11
Keywords:

Solution Type  Technical Instruction Sure

Solution  1956844.1 :   How to Replace a SPARC T7-2 Memory Riser and/or DIMM [VCAP]  


Related Items
  • SPARC T7-2
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: SPARC-CAP VCAP
  •  
  • Microlearning>Video>ML-VID-VCAP
  •  




In this Document
Goal
Solution
References


Applies to:

SPARC T7-2 - Version All Versions and later
Information in this document applies to any platform.

Goal

How to Replace a SPARC T7-2 Memory Riser and/or DIMM

Solution





 

**************************************************************************************
To report errors or request improvements on this procedure, please Add a comment on Doc ID: 1956844.1
**************************************************************************************

ESD Caution:

  • Circuit boards and drives contain electronic components that are extremely sensitive to static electricity. Ordinary amounts of static electricity from clothing or the work environment can destroy the components located on these boards. Do not touch the components along their connector edges.
  • Use a Antistatic Wrist strap. Attach one end of the strap to your wrist and the other end to the chassis, depending on what type of strap you use, with the adhesive end or the metal plug.
  • Use an Antistatic Mat. Place ESD-sensitive components such as motherboards, memory, and other PCBs on an antistatic mat

Contamination Caution:

  • Dust particles of packaging material are number one cause of datacenter contamination. Make sure to remove all packaging material, up to the ESD safe packaging material, while still being outside the datacenter.

 

DISPATCH INSTRUCTIONS

WHAT SKILLS ARE REQUIRED?: No special skills required, Customer Replaceable Unit (CRU) procedure

Time Estimate: 30 minutes

Task Complexity: 0

REMOVAL/REPLACEMENT INSTRUCTIONS:

PROBLEM OVERVIEW: SPARC T7-2 Memory Riser and/or DIMM Replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

DAMAGE ALERT: Perform a visual inspection of the replacement part to make sure that there are no damaged components, connectors, bent pins, damaged packages during shipping, etc. If the part is damaged, don't install it into the system, order a new part. Handle with caution and package carefully the return part to avoid any damages during shipping.
  • Until further notice the T7-2 Memory Riser must be submitted through the CPAS process. The FE may not always see the CPAS note, so please make sure you alert the FE to add it in the task too. Note: SPARC T7-2 Memory Risers (P/N: 7319165) are CRU parts, but need to be dispatched with an FE on-site task and instructions to have the FE CPAS the part.
  • Refer to doc 2076330.1 for details: Mandatory NCAT/CPAS for Specific SPARC T8 Series Servers FRU's/CRU's

The customer should conduct an orderly software system shutdown. Then power down system and disconnect the power cords. A data backup is not a pre-requisite but is a wise precaution.

NOTE: For ALL scenarios where an AC power down or AC power cycle is required for a T7-x server, please always use the steps in doc 1571054.1 prior to physically removing AC power cables from the server.

 
WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE:

The server includes eight memory risers. Four DIMM slots are on each memory riser. Four memory risers are associated with each CMP in the server. A label is next to next memory riser that shows the number of the CMP and of the riser.

NOTE: The server fails to boot unless all memory riser slots are populated.

DIMM FRU names are based on the location of the memory riser in the server and the DIMM slot on the memory riser. For example, the full FRU name for the top-most DIMM slot (BOB1/CH0/D0) on the first memory riser (CM0/CMP/MR0) is:

/SYS/MB/CM0/CMP/MR0/BOB1/CH0/D0

 

Locate a Failed DIMM (LEDs)

Each memory riser has a Remind button, a Power LED, and Fault LEDs adjacent to each DIMM. This procedure describes how to identify a faulty DIMM using these buttons and LEDs.

1. Press the System Remind button to identify the memory riser that contains the faulty DIMM.
2. Lift and remove the faulty memory riser. Loosen the captive screw that secures the memory riser to the chassis. Open the latch and lift the memory riser straight up to remove the memory riser
from the memory riser socket. A 2.5mm HEX tool will be needed.
3. Press the Remind button on the memory riser to identify the faulty DIMM. An amber Fault LED will light next to the faulty DIMM.

NOTE: The front and rear panel Service Required LEDs are also lit when the system detects a DIMM fault.

 

Locate a Failed DIMM (Oracle ILOM)

The Oracle ILOM show faulty command displays current system faults, including DIMM failures:

1. Type show faulty at the -> prompt:

-> show faulty
Target                 | Property              | Value
--------------------+---------------------+-------------------
/SP/faultmgmt/0  | fru                      | /SYS/MB/CM0/CMP/MR1/BOB1/CH0/D0
/SP/faultmgmt/0  | timestamp           | Dec 21 16:40:56
/SP/faultmgmt/0/ | timestamp           | Dec 21 16:40:56 faults/0
/SP/faultmgmt/0/ | sp_detected_fault | /SYS/MB/CM0/CMP/MR1/BOB1/CH0/D0
faults/0               |                           | Forced fail(POST)

2. Locate the DIMM that corresponds to the listed name.

In this example, /SYS/MB/CM0/CMP/MR1/BOB1/CH0/D0 indicates the memory riser that is fourth farthest from the power supplies and the DIMM in a slot with white ejector tabs and a black slot.

NOTE: Unlike T5-2 the memory risers are not numbered sequentially. Please see service labels

 

Remove a Memory Riser and DIMM

This procedure can be performed by customers. The system must be completely powered down before performing this procedure.

CAUTION: These procedures require that you handle components that are sensitive to ESD. This sensitivity can cause the component to fail. To avoid damage, ensure that you follow antistatic practices as described in ESD Measures.

1. Prepare for servicing:
     a. Attach an antistatic wrist strap.
     b. Power off the server and unplug power cords from the power supplies.
     c. Extend the server to the maintenance position.
     d. Remove the top cover.
2. Identify the memory riser with the faulty DIMM by pressing the blue "Fault Remind button" located on the air divider.
     - If the memory riser Service Action Required LED is off, all DIMMs on this riser are operating properly.
     - If the memory riser Service Action Required LED is on (amber), one or more of the DIMMs installed on this riser is faulty or misconfigured
3. Loosen the captive screw that secures the memory riser to the chassis.Open the latch and lift the memory riser straight up to remove the memory riser from the memory riser socket. A 2.5mm HEX tool will be needed.
4. Identify the faulty or misconfigured DIMM(s) by pressing the Remind button on the memory riser.
5. On DIMMs that display an amber Fault LED, remove the DIMMs.
     a. Press down both DIMM slot ejector tabs as far as they will go.
     b. Carefully lift the DIMM straight up.

CAUTION: Whenever you remove a memory riser or DIMM, you should replace it with another memory riser or a DIMM or a filler panel; otherwise, the server might overheat due to improper airflow.

Install a DIMM and a Memory Riser

1. Attach an antistatic wrist wrap and unpack the DIMMs and place them on an antistatic mat.
2. Install the DIMMs into the memory riser by performing the following tasks.
     a. Ensure that the ejector levers at both ends of the memory module slot are in a fully open position.
     b. Align each DIMM with the empty connector slot, aligning the notch in the DIMM with the key in the connector.
        The notch ensures that the DIMM is oriented correctly.
     c. Gently press the DIMM into the slot until the ejector tabs lock the DIMM in place.
        Repeat these steps until each DIMM has been installed.
3. Push the memory riser module into the associated CPU memory riser slot until the riser module locks in place.
4. Return the server to operation:
     a. Install the top cover.
     b. Return the server to the normal rack position.
     c. Reinstall the power cords to the power supplies and power on the server.
5. The following link to the T7-2 Service Guide, can be used as a guideline for verifying Replacement DIMMs:

OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:
Boot system and monitor boot sequence for errors. Test functionality of system:
1. Run the Solaris "fmadm faulty" and SP/ILOM "show faulty" command to verify that the fault has been cleared.
2. Perform one of the following tasks based on your verification results:
   * If the previous steps did not clear the fault, refer to doc 1004229.1 for information about the tools and methods you can use to diagnose and clear
     component faults.
   * If the previous steps indicate that no faults have been detected, the component has been replaced successfully. No further action is required
3. Restart software applications per applicable administration guides to resume system operation.

PARTS NOTE:

SPARC T7-2 Server  https://support.oracle.com/handbook_private/Systems/SPARC_T7_2/components.html#Memory


REFERENCE INFORMATION:

SPARC T7-2 Server Service Manual http://docs.oracle.com/cd/E54983_01/html/E54987/index.html



Oracle Integrated Lights Out Manager 3.2 Documentation
http://docs.oracle.com/cd/E37444_01/

 

 

 

Save

References

<NOTE:1571054.1> - Performing an AC power cycle on the T3/T4/T5/S7/T7/T8 Servers

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback