Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-2052836.1
Update Date:2018-01-08
Keywords:

Solution Type  Technical Instruction Sure

Solution  2052836.1 :   How To Replace A CPU/HEATSINK In A Oracle ZFS Storage ZS4-4  


Related Items
  • Oracle ZFS Storage Appliance Racked System ZS4-4
  •  
  • Oracle ZFS Storage ZS4-4
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: DISK-CAP VCAP
  •  




In this Document
Goal
Solution
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: Partners may need to swap cpu's and need this doc

Applies to:

Oracle ZFS Storage Appliance Racked System ZS4-4 - Version All Versions to All Versions [Release All Releases]
Oracle ZFS Storage ZS4-4 - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Goal

To be able to remove and replace an Oracle ZFS Storage ZS4-4 CPU MODULE & HEATSINK

Solution

 

DISPATCH INSTRUCTIONS

- WHAT SKILLS DOES THE ENGINEER NEED: ZFS appliance / X64 server HW knowledge.

- TIME ESTIMATE: 75 minutes

- TASK COMPLEXITY: 3


FIELD ENGINEER INSTRUCTIONS

- PROBLEM OVERVIEW

What: CPU / Heat sink.
Where: Chassis SN and CPU ID to be specified by TSC. FE to confirm via BUI / CLI.
Why: The CPU/HEAT SINK reported failed and TSC has recommended replacement.


WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY?

The affected appliance head should be powered off.
NOTE: If this is a clustered appliance the other head should have already successfully performed a takeover if the partner head is to remain up with all the resources online.

 

HINT: "Online Help Maintenance Procedures" can be viewed before attending site via https://bluegill.us.oracle.com:215/ak8-rel/index.php/Maintenance[This section is not visible to customers.]

 

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

Caution – These procedures require that you handle components that are sensitive to static discharge. Take appropriate  electrostatic precautions to avoid component to failures.


1. Prepare the appliance for service.

   1.1 Power off the appliance and disconnect the power cord (s) from the power supply (s).

   1.3 Remove the appliance from the rack.


2. Attach an antistatic wrist strap.

   2.1 Remove the top cover.

   2.2 Press and hold the system Fault Remind button.

            The Fault Remind button is located on the divider between the cooling zone 1 and cooling zone 2.

 

2.3 To locate a failed CPU, look for the lit MR card Fault indicators and the lit CPU Fault indicator.  For more information, see CPU Fault Indicators in the Sun Server X4-4 Service Manual

           When a CPU is in a fault state, the Fault indicators for the CPU and both MR cards associated with the CPU light when the system Fault Remind button is pressed. 

 

3. Remove a Heatsink and CPU (FRU)

   3.1 Remove the two memory riser cards associated with the failed CPU.

   3.2 Remove the heatsink:

   3.2.1 Unscrew the four Phillips screws from the heatsink,Turn the screws alternately one and one half turns until they are fully removed.

                To remove the heatsink, break the seal created by the thermal compound by slightly twisting the heatsink left and right while pulling it upward.

                Do not allow the thermal compound to contaminate other components. Retain the heatsink. You need to reuse it.

 

4. Lift the heatsink filler panel out of the chassis. Place the heatsink upside down on the work space.

 

5. Open the spring-loaded CPU load plate release levers by pushing them down and moving them slightly toward the CPU socket and away from their retaining clips

       The levers are numbered by their required order of operation. The left-side lever (when viewed from the front of the server) must be opened first.

 

   5.1 Rotate the levers to the fully-open position.

       When the second lever is in its fully-open position, the load plate is unlocked.

   5.2 To open the load plate, lift the unhinged end to its fully-open position.

       Caution - Component damage. The pins of the CPU socket can be easily damaged. To remove the CPU, use the correct CPU replacement tool.

 

6. To remove the CPU, use the CPU replacement tool:

      Note - Ensure that you use the correct CPU replacement tool. The correct tool has part number G29477-002 affixed to the side, and it has a green label.

                 However, the label color alone is not an indicator of the correct tool. Verify that the part number is correct.

                 The tool is used to remove and install the CPU in the socket. The top side of the replacement tool has a button in the center and a tab on one side.

                      Pressing down on the button opens the tool. Pressing the tab closes the tool (and releases the button).

 

   6.1 Press down on the release button on top of the replacement tool.

          This action opens the tool.

               On one corner of the tool there is a label with a downward pointing triangle. Likewise, the CPU is marked with a triangle on one of its corners.

               This is a key that aids in correctly positioning the tool and the CPU with the CPU socket. The tool and the CPU are correctly positioned with the socket when all of the triangles are aligned.


   6.2 Orient the bottom of the tool over the CPU, ensuring that the triangle on the tool aligns with the triangle on the CPU.

             Lower the tool onto the CPU, ensuring that it sits evenly on the CPU.
             Push the release tab away from the center button.

              This action is accompanied by a click sound as the tool closes and grabs the CPU.              

              To remove the CPU, lift the tool upward and out of the server.

 

8. Depending on whether you are adding a new CPU and heatsink, or replacing one or both of these components because they are damaged, your kit might contain the following:

 

   ■ CPU and pre-greased heatsink
   ■ Pre-greased heatsink only
   ■ CPU only, with syringe to apply thermal grease to existing heatsink

 

  8.1 At the CPU socket, ensure that the CPU load plate and both load plate release levers are in their fully open position.

        To install a CPU, use the CPU replacement tool.

           Note - Ensure that you use CPU replacement tool, part number G29477-002. The part number is printed on the side of the tool. The tool is shipped with a new CPU.

           The tool is used to remove and install the CPU in the socket.

            The top side of the replacement tool has a button in the center and a tab on one side. Pressing down on the button opens the tool. Pressing the tab releases the button and closes the tool.

            Full details of usage is the Sun Server X4-4 Service Manual

 

   8.2 Inspect the CPU to ensure that it sits evenly within the socket.

   8.3 Close the CPU load plate.

   8.4 Lower and lock the right side lever, ensuring that the lever is secured under its retaining clip and that the bend in the lever locks the cover plate.

   8.5 The right side lever must be closed first.

   8.6 Lower and lock the left side load plate lever, ensuring that it is secured under its retaining clip.

 

9. Re-fitting the heatsink

    9.1 To apply the thermal compound, dispense the contents of the syringe as a single dollop in the center on the top of the CPU.

          Do not spread the thermal compound. The pressure applied during the heatsink installation performs this action.

 

   9.2 To install the heatsink:

       9.2.1 Align the captive spring-loaded heatsink screws with the threaded standoffs on the motherboard.

       9.2.2 Set the heatsink on top of the CPU.

                   Note: Once the heatsink is in contact with the CPU, avoid extra movement of the heatsink.

      9.2.3 Use a number 2 Phillips screwdriver and alternately tighten each screw one-half turn until all screws are completely tightened.

 

10. Return the server to operation

   10.1 Install the top cover.

   10.2 Return appliance head to normal rack position.

   10.3 Reconnect power cord(s) to power supply and power on appliance.

              Verify AC Present LED is lit.

   10.5 Confirm appliance powers boots and BUI is available.

              NOTE: If the CPU being installed is replacing a faulty CPU, manually clear the CPU fault using Oracle ILOM.

              For instructions on clearing server faults, refer to the Oracle Integrated Lights Out Manager (ILOM) 3.2.2 Users Guide.


NOTE: If clustered ensure head rejoins cluster and perform fail back if required.


OBTAIN CUSTOMER ACCEPTANCE

- WHAT ACTION DOES THE FIELD ENGINEER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

Confirm there are no faults reported in the appliance BUI under - Maintenance - Problems or Maintenance - Hardware - show details - CPU.
"Mark Repaired" any prior faults related to the just replaced CPU if required.


PARTS NOTE:
Check System Handbook for correct part number

 

REFERENCES

 

This is the online help documentation for Oracle's Sun™ ZFS Storage 7000 @https://bluegill.us.oracle.com:215/ak8-rel/index.php/Maintenance[This section is not visible to customers.]


NOTE:1379117.1 - Sun Storage 7000 Unified Storage System: How To Shutdown ZFSSA Cluster
NOTE:1416406.1 - Sun ZFS Storage Appliances Troubleshooting Resource Center
NOTE:1167013.1 - Sun Storage 7000 Unified Storage System: 'Level 2 CPU Cache Fault'
NOTE:1378725.1 - Sun Storage 7000 Unified Storage System: How to Identify a broken CPU
Oracle Unified Storage Systems Documentation : http://www.oracle.com/technetwork/documentation/oracle-unified-ss-193371.html

X4-4 Service Manual - http://docs.oracle.com/cd/E38212_01/html/E38221/index.html

 

References

http://docs.oracle.com/cd/E38212_01/html/E38221/index.html
Oracle Unified Storage Systems Documentation : http://www.oracle.com/technetwork/documentation/oracle-unified-ss-193371.html
<NOTE:1379117.1> - Sun Storage 7000 Unified Storage System: How To Shutdown a ZFSSA Cluster
<NOTE:1416406.1> - Sun ZFS Storage Appliances Troubleshooting Resource Center
<NOTE:1167013.1> - Sun Storage 7000 Unified Storage System: 'Level 2 CPU Cache Fault'
<NOTE:1378725.1> - Sun Storage 7000 Unified Storage System: How to Identify a broken CPU

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback