Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-2066641.1
Update Date:2017-10-03
Keywords:

Solution Type  Technical Instruction Sure

Solution  2066641.1 :   M7-8 / M7-16 How to replace a Faulty PDU  


Related Items
  • Oracle SuperCluster M7 Hardware
  •  
  • SPARC M7-16
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: SPARC-CAP VCAP
  •  


This document describes how to replace an M7-16 PDU

In this Document
Goal
Solution
 Prepare for PDU removal
 Remove the failing PDU
 Install the replacement PDU
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: M7 Information required for M7 series servers

Applies to:

Oracle SuperCluster M7 Hardware - Version All Versions and later
SPARC M7-16 - Version All Versions and later
Information in this document applies to any platform.

Goal

 CAP PROBLEM OVERVIEW: M7-16 PDU replacement

*********************************************************************
To report errors or request improvements on this procedure, please go to
My Oracle Support, and put a comment on Doc ID: 2066641.1

*********************************************************************

ESD Caution:

  • Circuit boards and drives contain electronic components that are  extremely sensitive to static electricity. Ordinary amounts of static electricity from clothing or the work environment can destroy the components located on these boards. Do not touch the components along their connector edges.
  • Use a Antistatic Wrist strap. Attach one end of the strap to your wrist and the other end to the chassis, depending on what type of strap you use, with the adhesive end or the metal plug.
  • Use an Antistatic Mat. Place ESD-sensitive components such as motherboards, memory, and other PCBs on an antistatic mat.

 

Contamination Caution:

  • Dust particles of packaging material are number one cause of datacenter contamination. Make sure to remove all packaging material, up to the ESD safe packaging material, while still being outside the datacenter.

Solution

 

WHAT SKILLS DOES THE ENGINEER NEED: M7-16 Product Training/Experience

TASK COMPLEXITY: 4

TIME ESTIMATE: 90 minutes

COLD replacement

FIELD ENGINEER INSTRUCTIONS

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

Physical Domains must be shut down. Server must be powered off. 

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

 

In general, it is highly discouraged to do a HOT PDU replacement, due to all of the I/O connections that might be disturbed. Official procedure requires COLD replacement. If the customer insists on hot replacement, please discuss the following challenges before proceeding with HOT PDU replacement:

  • I/O cables routed the side of the rack where the PDU needs replacement, must be temporarily relocated outside of the wire-form cable management guides adjacent to the PDU; this will increase the likely hood of an accidental I/O disconnection.
  • there must be room for the PDU to travel around the cable management guides, ensure that the weight of the cables can be supported outside of the cable guides such that no cables will be accidentally disconnected from the running server
  • before replacing the PDU ensure that all of the power supplies connected to the remaining PDU are running with no faults, during the PDU replacement there will be no PSU redundancy; any failure of the PSUs connected to the remaining PDU will cause the system to crash; any power fluctuations on the inputs to the remaining PDU will cause the system to crash.
  • if before the PDU replacement, the system is running on a minimum of N-1 PSUs, HOT PDU replacement will result in the system to run on half of the available PSUs; as a result the systems performance will be impacted as CPUs are automatically throttled back to consume less power.

 

 

Prepare for PDU removal

1. Familiarize yourself with all power specifications and requirements.

2. Determine which PDU requires service.

These PDUs will not be flagged by FMA and should only be replaced under advisement of an M7 product specialist 

3. Ensure that you have powered off all hosts and the server. Power down the server (requires Oracle ILOM Reset and Host Control (r) user role)

-> stop /System

Stopping the server can take some time, and you must wait until the following message appears on the host(s) console before proceeding to the next step.
-> SP NOTICE: Host is off


4. Disconnect the PDU input power cords that connect the faulted PDU to the facility AC power source.

5. Unpack the replacement PDU on a static-safe mat, open the rear server door, and attach an antistatic wrist strap.

6. Confirm that all PDU circuit breakers are switched off.

Ensure that the circuit breakers on both the faulted and replacement PDUs are turned completely off. 

 

 

Remove the failing PDU

1. Ensure that you have prepared the faulty PDU for removal

2. Shut down and power off any ancillary equipment installed in the rack.

3. Remove power using the circuit breakers on the appropriate PDU. Turn breakers off in the following sequence :

    R4, R5, L5, L4
    R0, R1, R2, L8, L7, L6
    R6, R7, R8, L2, L1, L0

    where R indicates the right PDU from the rear of the server, L indicates the left PDU from the rear of the server, and the number represents the PDU group number.

Caution - Because standby power is always present in the server, you must switch off the circuit breakers on the PDUs before accessing any cold-serviceable components.

 

4. Disconnect any power jumper cords connected to the faulty PDU from equipment in the rack.

check that all AC power cords to and from the PDU are labeled such that they can be correctly reconnected once the PDU has been replaced.

 

5. Cut any tie-wraps securing the faulty PDU power input lead cords to the tie-down brackets.

6. Disconnect the grounding strap connecting the top of the faulty PDU to the rack.

7. If the rack included a factory-installed PDU, use a T-25 wrench key to remove the four M5 screws and washers securing the faulty PDU to the mounting brackets.

8. Carefully lift the faulty PDU up and off the mounting brackets. Remove the PDU from the rack and place it on a clean work table.

 

Install the replacement PDU

1. Lift up the replacement PDU and, while ensuring that the circuit breakers are facing the rear of the rack, carefully set the replacement PDU's standoff bolts into the top and bottom bracket's keyhole slots.

2. (Optional) Use a T-25 Torx wrench and four M5 shipping screws and washers to secure the replacement PDU to the mounting brackets.

3. Route the power input lead cords between the rear RETMA rail and side panel.

Caution - Never twist, kink, or tightly bend a power input lead. 

 

4. Using tie-wraps, secure the replacement PDU input lead cables to the cable routing brackets.

5. Ensure that you have switched off every PDU circuit breaker on the replacement PDU.


6. Locate the replacement PDU input lead cord connectors. Depending on how you routed the cords when you installed the PDUs, route these cords either out the bottom of the rack or out the top.


7. Connect the replacement PDU power lead cords to the facility AC power source. If your rack contains two PDUs, ensure that each PDU is connected to different AC power source circuits, and reinstall the jumper cords in the same locations from which you removed them.

 

8. Switch on PDU Circuit Breakers. Turn breakers on in the following sequence.

R4, R5, L5, L4
R0, R1, R2, L8, L7, L6
R6, R7, R8, L2, L1, L0

where R indicates the right PDU from the rear of the server, L indicates the left PDU from the rear of the server, and the number represents the PDU group number.

9. Restart the server

-> start /System

Return the faulted component to Oracle.

 

OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

1. Verify that the fault has been cleared and the replaced component is operational

2. Verify that the Power OK LED is lit, and that the Fault LED and front and rear Service Required LEDs are not lit.

3. Verify that there is no faulty components

  1. -> show faulty
  2.  -> show /System/Open_Problems
  3.  faultmgmtsp> fmadm faulty

4. Perform one of the following tasks based on your verification results

  1. If the previous steps did not clear the fault, refer to doc 1309092.1 for information about the tools and methods you can use to diagnose and clear component faults.
  2. If the previous steps indicate that no faults have been detected, the component has been replaced successfully. No further action is required

 

======================== Other info =====================

REFERENCE INFORMATION:  Service Manual: http://docs.oracle.com/cd/E55211_01/html/E55215/index.html

2066641.1

References

<NOTE:1423708.1> - Replacement procedure for How to Replace a Power Distribution Unit on an Engineered System using Sun Rack II
http://www.oracle.com/technetwork/documentation/oracle-periph-serv-work-190023.html#cabracks

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback