Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-2056023.1
Update Date:2017-10-11
Keywords:

Solution Type  Technical Instruction Sure

Solution  2056023.1 :   M8-8 / M7-8 / M7-16 How to replace a Faulty DIMM [VCAP]  


Related Items
  • Oracle SuperCluster M7 Hardware
  •  
  • SPARC M7-16
  •  
  • SPARC M8-8
  •  
  • SPARC M7-8
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: SPARC-CAP VCAP
  •  
  • Microlearning>Video>ML-VID-VCAP
  •  




In this Document
Goal
Solution
 Determine Which DIMM Is Faulty 
 Remove the DIMM 
 Install the DIMM
 Install the CMIOU into the server
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: this is now a FRU

Applies to:

SPARC M7-16 - Version All Versions and later
Oracle SuperCluster M7 Hardware - Version All Versions and later
SPARC M8-8 - Version All Versions and later
SPARC M7-8 - Version All Versions and later
Information in this document applies to any platform.

Goal

CAP PROBLEM OVERVIEW: M8-8 / M7-8 / M7-16 DIMM in a CMIOU chassis - DIMM failure

*********************************************************************
To report errors or request improvements on this procedure, please go to
My Oracle Support, and put a comment on Doc ID: 2056023.1

*********************************************************************

ESD Caution:

  • Circuit boards and drives contain electronic components that are  extremely sensitive to static electricity. Ordinary amounts of static electricity from clothing or the work environment can destroy the components located on these boards. Do not touch the components along their connector edges.
  • Use a Antistatic Wrist strap. Attach one end of the strap to your wrist and the other end to the chassis, depending on what type of strap you use, with the adhesive end or the metal plug.
  • Use an Antistatic Mat. Place ESD-sensitive components such as motherboards, memory, and other PCBs on an antistatic mat.

 

Contamination Caution:

  • Dust particles of packaging material are number one cause of datacenter contamination. Make sure to remove all packaging material, up to the ESD safe packaging material, while still being outside the datacenter.

 

Solution

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE ENGINEER NEED: M8-8 / M7-8 / M7-16 product training

TASK COMPLEXITY: 2

TIME ESTIMATE: 90 minutes

HOT replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? : The Physical Domain to which the CMIOU is assigned must be shut down (HOST stopped). 

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

 

Determine Which DIMM Is Faulty 

1. Use one of these Oracle ILOM commands to display faulty components:

  1.  -> show faulty
  2.  -> show /System/Open_Problems
  3.  faultmgmtsp> fmadm faulty

2. Locate the CMIOU with the faulty DIMM by the CMIOU amber Service Required LED

3. Prepare the CMIOU with the faulty DIMM for removal, check M8-8 / M7-8 / M7-16 - How to replace a Faulty CMIOU in a CMIOU chassis (Doc ID 1951961.1) , "Preparing a CMIOU for Removal" for details

4. Remove the CMIOU with the faulty DIMM from the chassis, check M8-8 / M7-8 / M7-16 - How to replace a Faulty CMIOU in a CMIOU chassis (Doc ID 1951961.1) , "Removing a CMIOU" for details

5. Locate, press, and hold the blue Fault Remind button on the motherboard.

  1. An illuminated green Fault Remind Power LED indicates that there is power available to light the faulty DIMM LED. Any faulty DIMM is identified by an associated amber LED until you
    release the button. Note - If LEDs for two DIMMs light, replace both of the DIMMs.

5. Confirm that the DIMM next to the illuminated DIMM Fault LED is the same DIMM that was reported to be faulty by the fmadm faulty command.

6. Visually check to ensure that all of the other DIMMs are seated properly in their slots.

 

Remove the DIMM 

Use caution when pressing the DIMM ejector latch to ensure that you do not come into contact with the adjacent DCDC power board. Doing so might loosen, or otherwise adversely impact, the DCDC board, causing the server to report system errors.

1. Push down on the ejector tabs on each side of the DIMM until the DIMM is released.

DIMMs and heat sinks on the motherboard might be hot.

2. Grasp the top corners of the faulty DIMM, lift it out of its slot, and place it on an antistatic mat.

 

Install the DIMM

1. Unpack the replacement DIMM, and place it on an antistatic mat.

2. Ensure that the ejector tabs on the connector that will receive the DIMM are in the open position.

3. Align the DIMM notch with the key in the connector.

Ensure that the orientation is correct. The DIMM might be damaged if the orientation is reversed.

4. Push the DIMM into the connector until the ejector tabs lock the DIMM in place.

 

Install the CMIOU into the server

1. Install the CMIOU back into the server, check M8-8 / M7-8 / M7-16 - How to replace a Faulty CMIOU in a CMIOU chassis (Doc ID 1951961.1) , "Installing a CMIOU" for details

 

Return the faulted component to Oracle.

Caution - The removed DIMM must be properly repackaged to prevent damage during return transportation to Oracle.  The DIMM should be repackaged in identical fashion as the delivered FRU. See the Service Manual for details about proper repackaging.
  1. If the replacement included safety covers on the connectors, install the covers on the component that you are
  2. In the shipping container that contained the replacement component:
    1. Using the same material used to pack the replacement component, position the component so that it is not free to move.
    2. Add any required paperwork or other documentation in the container.
    3. Except when packing CMIOUs, include any tools that were loaned to you by Oracle. Do not place tools inside a container that is being used to return a CMIOU.
  3. Close the shipping container and seal it with the packaging tape supplied by Oracle.
  4. Apply the shipping label to the shipping container.
  5. Notify Oracle or an authorized shipper that the carton container is ready for pickup.

  

 

OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

Verify that there is no faulty components

  1. -> show faulty
  2.  -> show /System/Open_Problems
  3.  faultmgmtsp> fmadm faulty

Perform one of the following tasks based on your verification results

  1. If the previous steps did not clear the fault, refer to doc 1309092.1 for information about the tools and methods you can use to diagnose and clear component faults.
  2. If the previous steps indicate that no faults have been detected, the component has been replaced successfully. No further action is required

If required, change the values for the /HOSTx/diag default_level and hw_change_level before starting the host.

-> set /HOST/diag/ default_level=max
-> set /HOST/diag/ hw_change_level=max

Restart the PDomain you stopped in M8-8 / M7-8 / M7-16 - How to replace a Faulty CMIOU in a CMIOU chassis (Doc ID 1951961.1) , "Preparing a CMIOU for Removal" ,  step 3.5, and monitor/capture console HOST console output (use two windows):

-> start /Servers/PDomains/PDomain_z/HOST/console
-> start /Servers/PDomains/PDomain_z/HOST

Restart software applications per applicable administration guides to resume system operation.

After all PDomains have restarted, repeat the steps to verify that there is no faulty components, to ensure starting the PDomains has not triggered new faults.

Return the /HOSTx/diag default_level and hw_change_level to their original value.

 

SPARC M7 Series Servers Service Manual http://docs.oracle.com/cd/E55211_01/html/E55215/index.html


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback