![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||
Solution Type Technical Instruction Sure Solution 2347200.1 : How to Replace an Oracle Server X7-8 Memory DIMM
In this Document
Applies to:Oracle Server X7-8 - Version All Versions to All Versions [Release All Releases]Information in this document applies to any platform. GoalHow to Replace an Oracle Server X7-8 Memory DIMM. SolutionDISPATCH INSTRUCTIONS WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED?: TIME ESTIMATE: 60 minutes TASK COMPLEXITY: 0 FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS: PROBLEM OVERVIEW: An Oracle Server X7-8 Memory DIMM needs replacement WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? Note - Oracle Server X7-8 can be configured as either a one 4-socket server, two independent 4-socket servers, or one 8-socket server. Depending on your Customers configuration will depend on if you need to bring the whole system down or not:
A data backup is not a prerequisite but is a wise precaution. If the server module (SMOD) with effective CMOD, CPU, or DIMM is still up and functioning, Customer should perform an orderly and graceful shutdown of applications and OS on the server module (SMOD) being replaced. This includes performing a "-> stop /System" of this SMOD to bring it down into "standby mode". In Standby power mode, the Power/OK LED on the front panel of the server module will begin flashing. WHAT ACTION DOES THE ENGINEER NEED TO TAKE: Caution - Loss of service or component damage. Do not replace any components except for CMODs and internal CMOD subcomponents while the server is in warm service mode.
Caution - Data Loss. Do not remove more than one fan module from a column while the system is in Main power mode. This action removes power from the CMODs and causes an immediate shutdown. On an eight-CMOD system, this applies to all fan modules. On a four-CMOD system, this applies to the fan modules in the left-hand fan frame.
This procedure describes how to prepare the server for warm service. Use warm service to remove and replace CMOD, DIMM, and processor components without accessing the server back panel to disconnect AC power cords or shutting down Oracle ILOM. When Oracle ILOM detects that two fan modules in a single cooling zone (a vertical column) have been removed, the SP removes power from the CMODs, allowing you to service CMODs and their subcomponents without removing the power cords. Oracle ILOM remains available in warm service mode. Preparing a CPU Module (CMOD) for Removal Note - This procedure can also be completed as a cold service procedure.
Caution - Data loss. Removing the power cords when the server is in Main power mode results in an immediate shut down of the server. Do not remove the power cord if the server is in Main power mode. Power off the server to Standby power mode first.
1. Log in to the SP Oracle ILOM CLI. Log in as a user with root or administrator privileges. For example, open an SSH session, and at the command line type: ssh root@sp_ip_address 2. Power off the server gracefully to Standby power mode. -> stop /System
1. Identify which group of fan modules (left or right) to remove to access the CMOD. 2. Remove the fan modules. a. To unlock a fan module, push in the green release button. Caution - Data Loss. Do not remove more than one fan module from a column while the system is in Main power mode. This action removes power from the CMODs and causes an immediate shutdown. On an eight-CMOD system, this applies to all fan modules. On a four-CMOD system, this applies to the fan modules in the left-side fan frame.
b. To remove the fan module, pull it out of the slot. 3. Remove a fan frame. a. To remove the fan frame, hold it by the green labels at the center of the frame and pull it out of the server. 4. Identify the CMOD. 5. To unlock the CMOD, squeeze together the green tabs on the end of the CMOD lever. 6. To disconnect the CMOD from the connector on the midplane, rotate the CMOD lever down and away from the CMOD. Caution - Pinch point. Keep your fingers clear of the underside of the lever.
The lever disconnects the CMOD from the midplane and its DPCC. 7. Use the lever to slide the CMOD partially out of the server until you can grab it with two hands. Caution - The CMOD is heavy. Be prepared to hold it firmly when it is clear of the slot.
8. To remove the CMOD from the chassis, slide the CMOD completely out of the server. 9. Remove the CMOD top cover. 10. Lift the CMOD top cover away from the CMOD. Caution - These procedures require that you handle components that are sensitive to electrostatic discharge. This sensitivity can cause the components to fail. To avoid damage, ensure that you follow antistatic practices as described in Electrostatic Discharge and Static Prevention Measures.
Identifying and Removing a DIMM DIMM Physical Layout 1. Identify and note the location of the faulty DDR4 DIMM by pressing the Fault Remind button on the motherboard.
To locate the failed DIMM, press and hold the Fault Remind button. This procedure uses the DIMM fault remind test circuit in the CMOD to identify the failed DIMM. The circuit is a charged, time-limited circuit. Once power is removed from the server you have 10 minutes to use the circuit for troubleshooting. 2. Verify that the green Fault Remind Power LED indicator is lit. The Fault Remind circuit remains charged for about 10 minutes after power is removed from the CMOD, either by disconnecting power from the server, or by removing the CMOD from the chassis. When you press the Fault Remind button, the Charge Status indicator lights if there is enough power to use the fault remind circuit. Otherwise it remains unlit. 3. With the Fault Remind button pressed, look for a lit DIMM fault LED indicator. 4. Rotate both DIMM socket ejectors outward as far as they will go. 5. Remove the faulty DIMM from the CMOD. Note - Replace each faulty DIMM with either another DIMM of the same rank size (quad-rank or dual-rank) or leave the socket empty.
Installing a DIMM Note - Use this procedure to install DIMMs for a memory upgrade or a configuration change, or as part of a DIMM reset (removal and installation).
1. Unpack the replacement DDR4 DIMM and place it on an antistatic mat. 3. Locate the DIMM slot. a. Ensure that the ejector tabs are in the open position. 4. To install the DIMM in the slot, simultaneously press down on both edges of the DIMM. 5. Verify that the DIMM sits evenly in the slot and is locked. 6. Install the CMOD cover.
1. Ensure that the CMOD lever is in the fully-open position. a. Squeeze together the green tabs on the end of the lever. 2. Position the CMOD in the slot. Caution - The CMOD is heavy. Be prepared to hold it firmly until it is securely supported in its slot.
3. On the front-facing side, ensure that the hinge for the lever is at the bottom. 4. Slide the CMOD into the slot until it stops. 5. To install the CMOD, rotate the lever up until it locks into place and is flush with the front of the CMOD. Caution - Pinch point. When operating the lever, keep your fingers clear of the back side and hinged end of the lever.
6. Install the fan frame. a. Position the fan frame at the opening in the front of the server with the air vane hinges at the top. Note - The center of the fan frame is marked with green labels. The labels indicate where to grab the frame when you want to install or remove it.
b. Slide the fan fame into the server until it stops and is flush with the front of the server. 7. Install the four fan modules. a. Align the fan module with the slot. Caution - Component damage. Do not apply excessive force when sliding the fan module into the server. Ensure that the connector on the CMOD and the connector on the fan module are aligned correctly.
b. To install the fan module, slide it into the slot until it stops and gently push it inward until the fan module locks into place. 8. Ensure that all external front and back components are fully installed. 9. Ensure that all cables are connected to the back of the server. 10. Connect all AC power cables to their inlets on the back of the server and verify that they are locked. 11. If necessary, connect the other end of the AC power cables to the supply outlet. 12. Ensure that the server is powering into Standby power mode. 13. Power on the server module. -> start /System
14. Log out of Oracle ILOM. 15. Verify that the Power/OK indicator led lights steady on.
Log in to the ILOM CLI and launch a Fault Management Shell session -> start /SP/faultmgmt/shell
Are you sure you want to start /SP/faultmgmt/shell (y/n)? y faultmgmtsp> use the "fmadm faulty -a" command to list out all active faulty components faultmgmtsp> fmadm faulty -a
If the DIMM that was just replaced is still listed as a fault then indicate that the DIMM has been replaced by using the command "fmadm replaced <fru|cru|uuid>" for example: faultmgmtsp> fmadm replaced /SYS/MB/P0/D2
confirm that the faults are cleared and then exit out of the fault management shell faultmgmtsp> fmadm faulty -a
No faults found faultmgmtsp> exit -> Enter the following command to check status is normal status: -> show /System/Memory/DIMMs/DIMM_x
note: the "x" represents the DIMM number of the DIMM replaced Example: ->show /System/Memory/DIMMs/DIMM_0 /System/Memory/DIMMs/DIMM_0 Targets: Properties: health = OK health_details = - part_number = 001-0003-01,M393B2G70DB0-YK0 serial_number = 00CE011412225F9C48 location = P0/D0 (CPU 0 DIMM 0) manufacturer = Samsung memory_size = 16 GB Check if any error output from event log -> show /SP/logs/event/list
PARTS NOTE: 7330697 [C] 16GB DDR4-2666 Registered DIMM REFERENCE INFORMATION: Sun Server X7-8 Service Manual
Attachments This solution has no attachment |
||||||||||||||
|