Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1983205.1
Update Date:2018-05-15
Keywords:

Solution Type  Technical Instruction Sure

Solution  1983205.1 :   How to Replace an Exadata X5/X6 CPU and Heatsink  


Related Items
  • Oracle SuperCluster T5-8 Full Rack
  •  
  • Big Data Appliance X5-2 Starter Rack
  •  
  • Oracle SuperCluster M7 Hardware
  •  
  • Big Data Appliance X5-2 Full Rack
  •  
  • Zero Data Loss Recovery Appliance X6 Hardware
  •  
  • Exadata X5-8 Hardware
  •  
  • Exadata SL6 Hardware
  •  
  • Exadata X5-2 Full Rack
  •  
  • Exadata X5-2 Hardware
  •  
  • Oracle SuperCluster T5-8 Half Rack
  •  
  • Exadata X6-2 Hardware
  •  
  • Exadata X6-8 Hardware
  •  
  • Exadata X5-2 Eighth Rack
  •  
  • Exadata X5-2 Quarter Rack
  •  
  • Exadata X4-8 Hardware
  •  
  • Big Data Appliance X5-2 Hardware
  •  
  • Exadata X5-2 Half Rack
  •  
  • Zero Data Loss Recovery Appliance X5 Hardware
  •  
  • Big Data Appliance X6-2 Hardware
  •  
  • Oracle SuperCluster M6-32 Hardware
  •  
  • Oracle SuperCluster T5-8 Hardware
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  




In this Document
Goal
Solution
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: FRU replacement on Engineered system

Applies to:

Oracle SuperCluster M7 Hardware - Version All Versions and later
Zero Data Loss Recovery Appliance X6 Hardware - Version All Versions and later
Exadata X5-2 Full Rack - Version All Versions and later
Exadata X5-2 Half Rack - Version All Versions and later
Exadata X5-2 Quarter Rack - Version All Versions and later
Information in this document applies to any platform.

Goal

 How to Replace a Exadata X5/X6 CPU & Heatsink

Solution

 CAP PROBLEM OVERVIEW: Exadata X5/X6 CPU & HEAT SINK REPLACEMENT

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE ENGINEER NEED:

Exadata Trained

Caution - The correct color-coded processor removal and replacement tool must be used
to remove and replace a processor; otherwise, the processor or the processor socket might
be damaged. Additionally, do not use a tool that is designed for earlier generations of Intel
processors (CPUs). If you use an earlier generation tool, the processor will only be partially
held by the tool and it could fall from the tool, which could damage the processor socket

X5-2 Service guide : http://docs.oracle.com/cd/E41059_01/html/E48312/napsm.gnriy.html#scrolltoc

X5-2L Service guide : http://docs.oracle.com/cd/E41033_01/html/E48325/cnpsm.gnqav.html#scrolltoc

TIME ESTIMATE: 60 minutes

TASK COMPLEXITY: 3-FRU

FIELD ENGINEER INSTRUCTIONS

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

If the system is still up and functioning, the customer should perform an orderly and graceful shutdown of applications and OS. Then power
off the server and remove the AC power cords from the system.
A data backup is not a prerequisite but is a wise precaution.

Refer to Doc: How to shutdown and startup Exadata X5 compute node and storage cells when performing hardware
maintenance (Doc ID 1982342.1)

For Extended information on this section, check MOS Note 1093890.1 Steps To Shutdown/Startup The Exadata & RDBMS
Services and Cell/Compute Nodes On An Exadata Configuration. (https://support.oracle.com/epmos/faces/ui/km
/SearchDocDisplay.jspx?id=1093890.1&type=DOCUMENT )


For a documentation reference, in the Exadata Owner’s Guide (E13874), use the section of chapter 7 titled “Non-Emergency Power
Procedures” section “Powering Off Oracle Exadata Rack” sub-section “Powering off Database Servers” available on the customer's
cell server image in the /opt/oracle/cell/doc.
http://amomv0115.us.oracle.com/archive/cd_ns/E13877_01/doc/doc.112/e13874/maintenance.htm#autoId18

 

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:


1. Prepare the server for service.
Power off the server and disconnect the power cords from a. the power supplies.
b. Extend the server to the maintenance position in the rack.
c. Attach an anti-static wrist strap.
d. Remove the top cover.

e. If the server has an Oracle PCIe NVMe switch card installed, disconnect the
switch card cables from the disk backplane, carefully pull them through the
chassis mid-wall, and move them to side away from the air baffle.

f. Lift the air baffle up and out of the server and put it aside.

Removing the CPU.


a.Identify the location of the faulty CPU. To identify a faulty CPU, press and hold the blue Fault Remind button on the
motherboard. (LED is off: CPU is operating properly, LED is on (amber): CPU is faulty and should be replaced.)

b. Gently press down on the top of the heat sink to counteract the pressure of the captive spring-loaded screws that secure the
heat sink to the motherboard. Loosen the four Phillips captive screws in the heat sink for the faulty CPU. Using a No. 2 Phillips
screwdriver turn the screws counter clockwise alternately one and one half turns until they are fully removed.

c. To separate the heat sink from the top of the CPU, gently wiggle the heat sink left and right, while pulling upward. A thin layer
of thermal compound separates the heat sink and the CPU. This compound also acts as an adhesive. Note –Do not allow the
thermal compound to contaminate the work space or other components.

d. Place the heat sink upside down on a flat surface.

e. Use an alcohol pad to clean the thermal grease from both the bottom of the heat sink and the top of the CPU. Caution
–Handle CPU socket pins with extreme care. CPU and socket pins are very fragile. A light touch can bend the CPU socket pins
and damage the board beyond repair.

f. Disengage the processor release lever on the right side of the processor socket
(viewing the server from the front) by pushing down on the lever and moving it
to the side away from the processor, and then rotating the lever upward

g. Disengage the processor release lever on the left side of the processor socket
(viewing the server from the front) by pushing down on the lever and moving it
to the side away from the processor, and then rotating the lever upward

h. To lift the ILM assembly load plate off of the processor socket, rotate the
processor release lever on the right side of the processor toward the closed
position (the ILM assembly load plate is lifted up as the release lever is lowered
toward the closed position) and carefully swing the ILM load plate to the fully
open position

i. To remove the processor from the processor socket, acquire the processor
removal and replacement tool.

- Locate the button in the center of the top of the processor removal and
replacement tool and press it to the down position

- Properly position the tool over the processor socket and lower it into place
over the processor socket

To properly position the tool over the processor socket, rotate the tool until the green
triangle on the side of the tool is facing the front of the server and it is over the left side of
the processor socket when viewing the server from the front.

- Press the release lever on the tool to release the center button and engage
the processor

- Grasp the tool by the sides and remove it from the server ,Turn the tool upside down and verify that it contains the CPU

j. While holding the CPU tool up side down, press the center button on the tool to release the CPU. Carefully grasp the CPU on the
front and back edges, lift it out of the tool and carefully place the removed CPU into an anti-static container.

 

Installing the CPU.

a. Unpack the replacement CPU and place it on an anti-static mat.


b. Ensure that the two CPU socket release levers are in the fully open position and that the CPU pressure frame is in the fully open
position.

c. Press the button in the center of the removal/insertion tool to the down position. Turn the tool upside down, grasp the new CPU
by its front and back edges and position the CPU (circuit side up) in the tool. Ensure that the triangle on the corner of the CPU
aligns with the triangle on the side of the CPU removal/insertion tool. Lower the CPU into the tool. Then press the tool release
lever to release the center button and engage the CPU. An audible click indicates that the CPU is locked in place.

d. Properly position the tool over the CPU socket and lower it into place. Carefully align the CPU over the CPU socket. Ensure that
the notches on the sides of the CPU align with the keys on the socket. To properly position the tool in the CPU socket, rotate the
tool until the green triangle on the side of the tool is facing the front of the server and it is over the left side of the CPU socket
(when viewing server from the front) and lower the tool onto the CPU socket.

e. Press the center button in the tool down to release the CPU so that it is placed in the CPU socket. Caution – Do not press down
on the CPU. Irreparable damage to the CPU or motherboard might occur from excessive downward pressure. Do not forcibly
seat the CPU into the socket. Excessive downward pressure might damage the socket pins.

f. Remove the processor removal/insertion tool.

g. Visually check the alignment. Close the pressure frame ensure that the pressure frame sits flat around the periphery of the CPU.

h. Engage the release levers by rotating them downward and slipping them under the catch left first and then the right one.

Install the CPU Heat Sink Assembly.

a. Use the syringe to apply approximately 0.1 ml of thermal grease to the center of the top of the CPU. Do not distribute the
grease.

b. Inspect the heat sink for dust and lint. Clean if necessary.

c. Carefully position the heat sink on the CPU, aligning it with the mounting posts to reduce movement after it makes initial contact
with the layer of thermal grease. Caution – When handling the heat sink, be careful not to get the thermal compound grease on
your fingers so as not to transfer the grease to other server components.

d. Lower the heat sink onto the CPU. Caution – Avoid moving the heat sink after it has contacted the top of the CPU. Too much
movement could disturb the layer of thermal compound, causing voids, and leading to ineffective heat dissipation and
component damage.

e. Tighten the Phillips screws alternately one-half turn until they are fully seated.

 

Return the Server to operation

a. Replace a. the air baffle

b. If the server has a NVME switch card installed, place the switch card cables in the cable
troughs in the metal motherboard removal bracket and the air baffle, carefully
pull the cables through the chassis mid-wall, and reconnect the cables to the
disk backplane.

c. Replace the top cover

d. Return the server to it's normal operating position within the rack.

e. Re-install the AC power cords and any data cables that were removed.

f. Power on server. Verify that the Power/OK indicator led lights steady on.

 

OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

Boot up system and verify full functionality

Refer to doc How to shutdown and startup Exadata X5 compute node and storage cells when performing hardware
maintenance (Doc ID 1982342.1)

 

REFERENCE INFORMATION:

How to shutdown and startup Exadata X5 compute node and storage cells when performing hardware
maintenance (Doc ID 1982342.1)

Oracle Server X5-2 service guide: http://docs.oracle.com/cd/E41059_01/html/E48312/napsm.html#scrolltoc

Oracle server X5-2L service guide : http://docs.oracle.com/cd/E41033_01/html/E48325/cnpsm.html#scrolltoc


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback