Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1684984.1
Update Date:2018-05-15
Keywords:

Solution Type  Technical Instruction Sure

Solution  1684984.1 :   How to Replace a Sun Server X4-8, Oracle Server X5-8 CPU Module (CMOD)  


Related Items
  • Sun Server X4-8
  •  
  • Oracle Server X5-8
  •  
  • Exadata X5-8 Hardware
  •  
  • Exadata X6-8 Hardware
  •  
  • Exadata X4-8 Hardware
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  




In this Document
Goal
Solution
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: FRU - Oracle and Partner only CAP

Applies to:

Sun Server X4-8 - Version All Versions and later
Exadata X4-8 Hardware - Version All Versions and later
Oracle Server X5-8 - Version All Versions and later
Exadata X5-8 Hardware - Version All Versions and later
Exadata X6-8 Hardware - Version All Versions and later
Information in this document applies to any platform.

Goal

How to Replace a Sun Server X4-8, Oracle Server X5-8 CPU Module (CMOD).

Solution

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED?:
Sun Server X4-8, Oracle Server X5-8 Training

TIME ESTIMATE: 60 minutes

TASK COMPLEXITY: 3

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:

PROBLEM OVERVIEW: A Sun Server X4-8, Oracle Server X5-8 CPU Module (CMOD) needs replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

System should be powered down.

Note: If this server is part of an Exadata, please follow shutdown instructions in DOC ID: 1982342.1

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:


How to Remove a Compute Module (CMOD)

================================================
Caution - The CMOD is not a hot-swap component. Power off the system before removing.
================================================


1.Shutdown OS and power off server.

2.Remove the AC power cords for Cold service.

3.Identify which group of fan modules (left or right) to remove to access the CMOD.  CMOD 0-3 (left fan group), CMOD 4-7 (right fan group).

4.Remove the 4 fan modules.

5.Remove the fan frame.

6.Identify the CMOD.

7.To unlock the CMOD, push down on the green labeled ejector handle latch release.

8.To disconnect the CMOD from the connector on the midplane, rotate the CMOD lever downward and away from the CMOD.

9.Use the lever to pull the CMOD partially out of its slot until you can grab it with both hands.  Then pull it the rest of the way out of the server.

10.Rotate the lever inward until it's closed and latched.

 

================================================
Caution - To protect components from damage, always use an anit-static mat and wear and anti-static wrist strap.
================================================


How to Remove the Compute Module (CMOD) Cover

1.Press the cover release button on the top of the cover.

2.Slide the cover toward the back of the CMOD and lift the cover off the CMOD.

================================================
Caution - Component Damage - CMOD components are extremely sensitive to electro-static discharge. Wear a wrist strap and use an anti-static mat.
================================================


Then you should check if below parts are installed and need to be removed.

Reference doc:

Remove DIMM

How to Remove a DIMM
https://docs.oracle.com/cd/E40591_01/html/E40317/gnvar.html#scrolltoc


Remove CPU and heatsink

How to Remove a CPU and Heatsink Assembly (FRU)
https://docs.oracle.com/cd/E40591_01/html/E40317/gnsdy.html#scrolltoc



After removing all parts from Compute Module, then install all parts to new Compute Module.

Install CPU and heatsink

How to Install a CPU and Heatsink Assembly (FRU)
https://docs.oracle.com/cd/E40591_01/html/E40317/gnsdn.html#scrolltoc


Install DIMM
https://docs.oracle.com/cd/E40591_01/html/E40317/gnsei.html#scrolltoc



After installing all parts to the new Compute Module, then install the CMOD cover.

1.Set the cover on top of the CMOD with the cover release button toward the front of the module and with approximately 1 inch of the cover overhanging the rear of the module.  This leaves a gap of approximately 1 inch between lead edge of the cover and front top edge of the CMOD chassis. The cover should sit evenly on top of the module with the pins aligned with the slots in the sidewall.

2.Slide the cover toward the front of the module until it locks in place.  When the cover is properly installed, this action produces a click sound as the cover latch engages and locks the cover.

3.Ensure that the cover is locked in place.  The cover should not move unless the release button is pressed.


Install a CPU Module (CMOD)

1.Locate the module slot that you need to populate.

2.If necessary, remove the filler or CMOD that occupies the slot.

3.Open the CMOD lever to the fully open position by pushing down on the green labeled ejector handle latch release and rotating the handle downward, away from the center of the module.

4.Orient the CMOD so that the cover faces the right side and the connector at the top.

5.Carefully slide the module into the chassis until it stops.  In this position, the pawl at the lever hinge is aligned with the slot in the server.

6.To latch and lock the CMOD, rotate the lever upward until it locks into place and is flush with the front of the CMOD.

=============================================================================
Caution - Pinch point. Keep your fingers clear of the back side and hinged end of the lever.
=============================================================================


This action pushes the module into the chassis and engages the connector on the back of the module with the connector on the interior midplane. When the handle is locked, you cannot lower the lever without first releasing the lock on the handle.


7.Install the fan frame.

8.Install the four fan modules.

9.After removing a CMOD the DPCCs associated with the CMOD removed/replaced must also be reseated to ensure they have not become slightly unseated.

10.Prepare the server for operation.


OBTAIN CUSTOMER ACCEPTANCE
   WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE?:

How to verify the CMOD is working properly.

Power on server and log in ILOM to confirm if CPU is working properly.

1.Check CMOD status from ILOM.

1.1 Show /System/Processors/CPUs/CPU_x
note: the "x" means the CPU number which correlates with the CMOD number you replaced.

example

->show /System/Processors/CPUs/CPU_0
    Targets:

    Properties:
        health = OK  <------------CPU health is good
        health_details = -
        part_number = CM80636
        serial_number = Not Available
        location = P0 (CPU 0)
        model = Intel(R) Xeon(R) CPU E7-8895 v2 @ 2.80GHz
        max_clock_speed = 2.800 GHz
        total_cores = 15
        enabled_cores = 15
        temperature = Not Supported




1.2 Check if any error output from event log.

example

-> show /SP/logs/event/list



1.3 Check if any faults exist.

-> start /SP/faultmgmt/shell
Are you sure you want to start /SP/faultmgmt/shell (y/n)? y



example

faultmgmtsp> fmadm faulty
No faults found



If faults exist, follow below doc and try to clear fault.

Doc ID 1381773.1
How to clear FMA logs on the ILOM or Solaris:ATR:1381773.1:1 (Doc ID 1381773.1)
https://support.us.oracle.com/oip/faces/secure/km/DocumentDisplay.jspx?id=1381773.1

2.Check if CMOD is working normal on the Operating System side.

Note: If this server is part of an Exadata, please follow start up instructions in DOC ID: 1982342.1

For Solaris:

2.1 Use prtdiag -v command

example

# prtdiag -v
System Configuration: Sun Server X4-8
BIOS Configuration: American Megatrends Inc. 29011300 08/26/2010
BMC Configuration: IPMI 2.0 (KCS: Keyboard Controller Style)

==== Processor Sockets ====================================

Version Location Tag
-------------------------------- --------------------------
Intel(R) Xeon(R) CPU E7-8895 v2 @ 2.80GHz CPU 1
Intel(R) Xeon(R) CPU E7-8895 v2 @ 2.80GHz CPU 2
Intel(R) Xeon(R) CPU E7-8895 v2 @ 2.80GHz CPU 3
Intel(R) Xeon(R) CPU E7-8895 v2 @ 2.80GHz CPU 4
.
.




2.2 Check FMA info if any error exists.

example

fmadm faulty -a
STATE RESOURCE / UUID
-------- ----------------------------------------------------------------------



2.3 Check /var/adm/messages file if any error exists.

example

bash-3.00$ pwd
/var/adm

#grep -i warning messages
#grep -i error messages
#grep -i fail messages

PARTS NOTE:

REFERENCE INFORMATION:
Sun Server X4-8 Service Manual
https://docs.oracle.com/cd/E40591_01/html/E40317/index.html

Oracle Server X5-8 Service Manual
https://docs.oracle.com/cd/E56301_01/html/E56311/index.html

Compute Module (CMOD) Designations
https://docs.oracle.com/cd/E40591_01/html/E40317/gnsch.html#XFESMgnsfh

How to Shutdown and Startup Exadata X5 compute nodes and storage cells when performing hardware maintenance (includes Supercluster X5 storage cells) (Doc ID 1982342.1)


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback