Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-2360588.1
Update Date:2018-05-10
Keywords:

Solution Type  Technical Instruction Sure

Solution  2360588.1 :   How to Replace an Exadata X7-2 Storage Cell Server Memory DIMM  


Related Items
  • Exadata X7-2 Hardware
  •  
  • Exadata X7-8 Hardware
  •  
  • Oracle SuperCluster M8 Hardware
  •  
  • Zero Data Loss Recovery Appliance X7 Hardware
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  




Oracle Confidential PARTNER - Available to partners (SUN).
Reason: Exadata internal only for Oracle support engineers use and approved HW partners

Applies to:

Exadata X7-8 Hardware - Version All Versions to All Versions [Release All Releases]
Zero Data Loss Recovery Appliance X7 Hardware - Version All Versions to All Versions [Release All Releases]
Oracle SuperCluster M8 Hardware - Version All Versions to All Versions [Release All Releases]
Exadata X7-2 Hardware - Version All Versions to All Versions [Release All Releases]
x86_64

Goal

How to Replace an Exadata X7-2 Storage Cell Server Memory DIMM.

Solution

DISPATCH INSTRUCTIONS:

WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED?: Exadata X7-2 Training

TIME ESTIMATE: 30 minutes

TASK COMPLEXITY: 2

 

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:

PROBLEM OVERVIEW:  An Exadata X7-2 Storage Cell Server Memory DIMM needs replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY?:

IMPORTANT NOTE TO TSC ENGINEER:  CUT & PASTE the “CUSTOMER ACTIVITY” sections of the Pre-Replacement and Post-Replacement steps into a SR Note and ensure the customer is aware to do these steps prior to the scheduled field engineer activity, and during and after the replacement activity.

CUSTOMER ACTIVITY:

Shutdown of the storage cell is required prior to the part replacement:

Complete Steps 1 to 6 of Note ID 1188080.1 “Steps to shut down or reboot an Exadata storage cell without affecting ASM”.

Where noted, the SQL, CellCLI and commands under ‘root’ should be run by the Customers DBA, unless the Customer provides login access to the Field Engineer

These steps are also provided in the documentation:
   https://docs.oracle.com/cd/E80920_01/DBMMN/maintaining-exadata-storage-servers.htm#DBMMN21128

 

WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE?:

Prepare the Server for Service

The customer should have already prepared the server and powered it off.  If not, provide them the instructions in the previous section.

1. Extend the server to the maintenance position
2. Disconnect the power cords from the power supplies.
3. Attach an anti-static wrist strap to your wrist and to a metal area on the chassis or the rack.
4. Remove the server top cover. Use a Torx T10 screwdriver to unlock the release button latch.

Caution - Ensure that all power is removed from the server before removing or installing the memory DIMM. You must disconnect the power cables from the system before performing these procedures.

 

Caution - These procedures require that you handle components that are sensitive to electrostatic discharge. This sensitivity can cause the components to fail. To avoid damage, ensure that you follow anti-static practices as described in Electrostatic Discharge Safety.


Removing the Faulted Memory DIMM

1. Open the latch on the SuperCap tray and swing it up from the air baffle.

2. Lift up and remove the air baffle.

3. Identify and note the location of the failed DDR4 DIMM by pressing the Fault Remind button on the motherboard. The Fault Remind button is blue and located in the middle of the motherboard, near the handle and entrance to the center cable routing channel. Faulty DIMMs are identified with a corresponding amber LED on the motherboard next to the DIMM slot.

Note - When you press the Fault Remind button, an LED located next to the Fault Remind button lights green, indicates that there is sufficient voltage in the fault remind circuit to light any fault LEDs that were lit due to a failure. If this LED fails to light when you press the Fault Remind button, it is likely that the capacitor powering the fault remind circuit lost its charge. This can happen if you press the Fault Remind button for a long time with fault LEDs lit, or if power was removed from the server for more than 15 minutes.

4. Rotate both DIMM socket ejectors outward as far as they will go. The DIMM is partially ejected from the socket.

5. Carefully lift the DIMM straight up to remove it from the socket.


Installing the Replacement Memory DIMM

1. Unpack the replacement DDR4 DIMM and place it on an anti-static mat.

2. Ensure that the replacement DDR4 DIMM matches the size of the DIMM it is replacing.

3. Ensure that the ejector tabs are in the open position.

4. Align the notch in the replacement DIMM with the connector key in the connector socket.

5. Push the DDR4 DIMM into the connector socket until the ejector tabs lock the DIMM in place.

6. Lower the air baffle back into place over the DIMMs and CPU's.

7. Lower and clip the SuperCAP tray back into the air baffle.


Return the Server to Operation

1. Install the server top cover. Use a Torx T10 screwdriver to lock the release button latch.
2. Reconnect the power cords to the server power supply and connect any other cables to their original locations.
3. Return the server to the normal rack position.
4. Once the power cords have been re-attached and the ILOM has booted you will see a slow blink on the green LED for the server. Power on the
server by pressing the power button on the front of the unit.
5. Connect to the server console via the ILOM and monitor the boot.
      By default the ILOM serial console displays the primary console output.
      In the event of unexpected boot behavior, it is advisable to connect to both ILOM serial and ILOM graphics consoles at the same time and monitor.

 

OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE?:

FIELD SERVICE ENGINEER and CUSTOMER ACTIVITY:

1. Verify all expected hardware is visible to the server and the fault is cleared. Assistance from the customer for server login access will be required.

2. Verify there are no outstanding faults in ILOM:

# ipmitool sunoem cli 'show faulty'
Connected. Use ^D to exit.
-> show faulty
Target | Property | Value
-------------------+-----------------------+-----------------------------------

-> Session closed
Disconnected
#

If ILOM is still showing the memory DIMM as faulted, then manually clear the DIMM fault as follows:

a. login to the ILOM CLI and launch a Fault Management Shell session

-> start /SP/faultmgmt/shell
Are you sure you want to start /SP/faultmgmt/shell (y/n)? y

faultmgmtsp>

b. use the "fmadm faulty -a" command to list out all active faulty components

faultmgmtsp> fmadm faulty -a

c. If the DIMM that was just replaced is still listed as a fault then indicate that the DIMM has been replaced by using the command "fmadm replaced <fru|cru|uuid>" for example:

faultmgmtsp> fmadm replaced /SYS/MB/P0/D2

d. confirm that the faults are cleared and then exit out of the fault management shell
  

3. Verify there are no outstanding alerts in the Cell:

# cellcli -e list alerthistory

4. Re-activate the Storage Cell grid disks.  Follow Steps 7 to 10 of Note ID 1188080.1 “Steps to shut down or reboot an Exadata storage cell without affecting ASM”.

Where noted, the SQL, CellCLI and commands under ‘root’ should be run by the Customers DBA, unless the Customer provides login access to the Field Engineer.

These steps are also provided in the documentation:
 https://docs.oracle.com/cd/E80920_01/DBMMN/maintaining-exadata-storage-servers.htm#DBMMN21128

 

PARTS NOTE:

7330697 [F] 16GB DDR4-2666 Registered DIMM

 

REFERENCE INFORMATION:

Oracle Exadata Database Machine Maintenance Guide: https://docs.oracle.com/cd/E80920_01/DBMMN/maintaining-exadata-storage-servers.htm#DBMMN21128

Oracle Server X7-2L Documentation https://docs.oracle.com/cd/E72463_01/index.html

Steps to shut down or reboot an Exadata storage cell without affecting ASM (Doc ID 1188080.1)

 


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback