Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-2347910.1
Update Date:2018-02-20
Keywords:

Solution Type  Technical Instruction Sure

Solution  2347910.1 :   How to Replace a CPU in a Private Cloud Appliance X7 Oracle Server Upgrade  


Related Items
  • Private Cloud Appliance X7-2 Server Upgrade
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  




In this Document
Goal
Solution
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: internal CAP for FRU

Applies to:

Private Cloud Appliance X7-2 Server Upgrade - Version All Versions and later
x86_64

Goal

How to Replace a CPU in a Private Cloud Appliance X7 Oracle Server Upgrade

Solution


CAP PROBLEM OVERVIEW: X7-2 CPU REPLACEMENT

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE ENGINEER NEED:

Oracle Server X7-2 Training

TIME ESTIMATE: 60 minutes

TASK COMPLEXITY: 3-FRU

FIELD ENGINEER INSTRUCTIONS

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

Caution  -  Ensure that all power is removed from the server before removing or installing a processor. You must disconnect the power cables from the system before performing these procedures.

X7-2 CPU Removal and Replacement Procedure:
https://docs.oracle.com/cd/E72435_01/html/E72445/gqtmk.html#scrolltoc



If the system is still up and functioning, the customer should perform an orderly and graceful shutdown of applications and OS. Then power off the server and remove the AC power cords from the system.
A data backup is not a prerequisite but is a wise precaution.
 

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

1. The server should be placed in maintenance mode via OVM manager.

An Oracle VM Server can be placed into maintenance mode to perform hardware or software maintenance.
When an Oracle VM Server is placed in maintenance mode, any virtual machines running on the Oracle VM Server are automatically migrated to other Oracle VM Servers in the server pool, if they are available.
If the Oracle VM Server is the master Oracle VM Server in the server pool, this role is moved to another Oracle VM Server in the server pool, if available, after the server is shutdown.
If any of these automatic processes fail, check the Oracle VM Server event log for reasons why the failure occurred.

a. To place an Oracle VM Server into maintenance mode using GUI :

Logging into the Oracle VM Manager Web UI
To open the Login page of the Oracle VM Manager Web UI, enter the following address in a Web browser:

https://manager-vip:7002/ovm/console

Where, manager-vip refers to the virtual IP address, or corresponding host name, that you have configured for your management nodes during installation.
By using the virtual IP address, you ensure that you always access the Oracle VM Manager Web UI on the active management

b. In the Servers and VMs tab, select the Oracle VM Server in the navigation pane.
c. Click Edit Server in the management pane toolbar.

2. The Edit Server dialog box is displayed. Select the Maintenance Modecheck box to place the Oracle VM Server into maintenance mode.

3. Log into the ILOM of the faulty node and and power down the node. The ILOM IP address can be found by looking at the node via the Dashboard .

See How to determine the compute node and ILOM service processor IP addresses in OVCA. (Doc ID 1591197.1) if needed.


2. Prepare the server for service.

  1. Power off the server and disconnect the power cords from the power supplies.
  2. Extend the server to the maintenance position in the rack.
  3. Attach an antistatic wrist strap to your wrist, and then to a grounded area on the rack.
  4. Remove the server top cover.
  5. Lift the air baffles up and out of the server and set them aside.

Note - On single-processor systems, processor socket 1 (P1) is shipped without a heatsink and processor filler cover. To protect the delicate processor socket pins, do not remove the processor cover that was installed on the motherboard.

3. Identify the location of the faulty processor by pressing the Fault Remind button on the motherboard.  The processor fault LED for the faulty processor lights. The processor fault LEDs are located next to the processors.  If the processor fault LED is off, then the processor is operating properly.  If the processor fault LED is on (amber), then the processor is faulty and must be replaced.

Note - When you press the Fault Remind button, an LED located next to the Fault Remind button lights green, indicates that there is sufficient voltage in the fault remind circuit to light any fault LEDs that were lit due to a failure. If this LED fails to light when you press the Fault Remind button, it is likely that the capacitor powering the fault remind circuit lost its charge. This can happen if you press the Fault Remind button for a long time with fault LEDs lit, or if power was removed from the server for more than 15 minutes.

4. Using the 12.0 in-lbs Torx T30 driver, loosen the four captive nuts that secure the processor-heatsink module to the socket: fully loosen nut 4, then 3, then 2, then 1.

5. Lift the processor-heatsink module from the socket.  Always hold the processor-heatsink module along the ends of the fins to prevent damage, not the flat sides.

6. Separate the processor from the heatsink:

  1. Flip over the processor-heatsink module, place it on a flat surface, and locate the thermal interface material (TIM) breaker slot.
  2. While holding down the processor-heatsink module by the edges, insert a flat blade screwdriver into the TIM breaker slot.  The blade of the screwdriver goes into the slot between the heatsink and processor carrier, not between the processor and processor carrier.
  3. Using a rocking motion, gently pry the corner of the processor carrier away from the heatsink.
  4. Remove the processor carrier with processor from the heatsink by prying or pinching the plastic latch tabs that attach the processor to the heatsink.

Note - A thin layer of thermal grease separates the heatsink and the processor. This grease acts as an adhesive. Do not allow the thermal grease to contaminate the work space or other components.

Caution - Be careful not to touch the processor socket pins. The processor socket pins are very fragile. A light touch can bend the processor socket pins beyond repair.

7. If you plan on reusing either the heatsink or processor, use an alcohol pad to clean the thermal grease on the underside of the heatsink and on the top of the processor.

Caution - Failure to clean thermal grease from the heatsink could result in the accidental contamination of the processor socket or other components. Also, be careful not to get the grease on your fingers, as this could contaminate components.

Caution - Whenever you remove a processor, replace it with another processor and reinstall the processor heatsink; otherwise, the server might overheat due to improper airflow.


Installing the CPU.

1. Unpack the replacement processor but do not remove it from the packaging tray.

2. Ensure that the replacement processor is identical to the failed processor that was removed.

3. Prepare the new processor for installation:

  1. Use the syringe supplied with the new or replacement processor to apply 0.3 cc of thermal interface material (TIM) in an "X" pattern to the processor contact area of the heatsink.
  2. Note - Do not distribute the TIM; the pressure of the heatsink will do so for you when you install the heatsink.

4. Install the new processor:

  1. Align the pin 1 indicators between the heatsink and processor carrier in the packaging tray, and place the heatsink (thermal side down) onto the processor carrier until it snaps in place and lies flat.  Note - The processor carrier has latching posts at each corner: two that insert into heatsink holes and two that attach to the edge of the heatsink.
  2. Lift the processor-heatsink module out of the packaging tray.
  3. Align the processor-heatsink module to the processor socket bolster plate on the motherboard, matching the pin 1 location (a triangle indicator).
  4. Place the processor-heatsink module on the socket on the motherboard.  The socket bolster plate has alignment pins that go into holes on the processor-heatsink module to help center the module during installation.
  5. Ensure that the processor-heatsink module lies evenly on the bolster plate and that the captive screws align with the threaded socket posts.
  6. Using a 12.0 in-lbs (inch-pounds) torque driver (part number 7352217) with a Torx T30 bit, tighten the processor-heatsink module to the socket. First, fully tighten captive nuts 1 and 2. Then fully tighten nuts 3 and 4.  As you tighten nuts 3 and 4, some resistance occurs as the bolster leaf spring rises and comes in contact with the heatsink.
  7. If a 12.0 in-lbs torque driver is not available, the CPU heatsink can be safely installed with the following guidelines. Using a torx T30 hand tool (not electric), with a simple screwdriver type handle, gently tighten each of the screws until they bottom-out, at which point a sharp increase in resistance will be felt. At that point apply only a modest tightening torque by hand, such as you would apply when turning a door-knob to open a door.

5. Return the Server to operation:

  1. Install the air baffles.
  2. Install the server top cover.
  3. Return the server to the normal rack position.
  4. Reconnect the power cords to the server power supplies and reconnect the data cables.
  5. Power on the server.

6. Use Oracle ILOM to clear server processor faults.

     a. To show server faults, log in to the server as root using the Oracle ILOM CLI, and type the following command to list all known faults on the system:

-> show /SP/faultmgmt

The servers lists all known faults, for example:

-> show /SP/faultmgmt
Targets:
shell
0 (/SYS/MB/P0)
Properties:
Commands:
cd
show

     b. To clear the fault on processor 0, type the following command as an example:

-> set /SYS/MB/P0 clear_fault_action=true

For example:

-> set /SYS/MB/P0 clear_fault_action=true
Are you sure you want to clear /SYS/MB/P0 (y/n)? y
Set ‘clear_fault_action’ to ‘true’


OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

Boot up system and verify full functionality

 When you have finished performing maintenance on the Oracle VM Server and you are ready for it to rejoin the server pool, perform the same procedure and uncheck the Maintenance Mode check box

 

REFERENCE INFORMATION:

Oracle Server X7-2 Service Manual:
https://docs.oracle.com/cd/E72435_01/html/E72445/index.html


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback