![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||||
Solution Type Technical Instruction Sure Solution 1532486.1 : Mx-32 - How to Replace a Faulty Service Processor Proxy Module (SPP)
Removing and Replacing Service Processor Proxy modules and servicing SPP battery In this Document
Oracle Confidential INTERNAL - Do not distribute to customer (OracleConfidential). Applies to:SPARC M5-32 - Version All Versions and laterOracle SuperCluster M6-32 Hardware - Version All Versions and later SPARC M6-32 - Version All Versions and later Information in this document applies to any platform. In a full up 4 DCU Mx-32 platform with a single PDomain, all four SPPs operate as a single configured unit. If an SPP fails, the SP attempts to: ■ Automatically halt the server ■ Unconfigure the failed SPP from the set ■ Configure the remaining SPPs into a new set ■ Restart the server with the reduced SPP configuration. There is a backup battery installed on each SP/SPP. BATT,LITH,3V,125MA,1632,DISK otherwise known as CR1632 GoalCAP PROBLEM OVERVIEW: Mx-32 - SPP Failure ********************************************************************* *********************************************************************
ESD Caution:
Contamination Caution:
SolutionDISPATCH INSTRUCTIONS WHAT SKILLS DOES THE ENGINEER NEED: Mx-32 Product Training/Experience TASK COMPLEXITY: 3 TIME ESTIMATE: 60 minutes WARM replacement FIELD ENGINEER INSTRUCTIONS Caution – Caution – Do not turn off an SPP while it is configured into an active PDomain. Doing so will cause the associated PDomain to panic. WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? : The Blue 'ready to remove' LED on the face of the failed SPP should be lit. The physical domain in which the SPP was previously configured may be up or down. As long as the SPP to be replaced is properly configured out of the system it can be replaced. If it is not configured out, then the physical domain into which it is configured must be shut down and 'stopped'. WHAT ACTION DOES THE ENGINEER NEED TO TAKE: Removing a SPP1. Determining if a Service Processor Proxy Module can be removed without further intervention: a. Use one of these Oracle ILOM commands to display faulty components: -> show faulty or -> show /System/Open_Problems b. If there is a failed or faulted SPP, it will show up, and one should continue with step 2 below. If you see nothing then they fault condition for which the service action was filed has been cleared already, and the pending service action to replace a SPP is now suspect. 2. Determine if you need to save the TPM keystore from the faulty SPP. (See the Service Manual for full details with examples) a. In Oracle ILOM, see if TPM is activated. -> show -d properties /Servers/PDomains/PDomain_x/HOST/tpm mode ■ If mode = off, go to Step 3. b. At the PDomain host, determine if TPM is in use. $ tpmadm keyinfo ■ If you see an error message, TPM is not in use. Go to Step 3. c. Verify that TPM is migratable, and the TPM key blob is available. $ tpmadm keyinfo 00000000-0000-0000-0000-00000000000b d. Export the TPM key blob and the TPM authorization key to the hard drive. $ tpmadm migrate export 00000000-0000-0000-0000-0000000000b The default file name of the TPM key blob is tpm-migration.dat. The default file name of the TPM authorization key is tpm-migration.key. Both files are located in /var/tpm/system/.
3. Review affected components status' and LED states a. Determine to which DCU the SPP belongs by referring to the fault report displayed in Step 1. In the line that lists the name of the faulted SPP (e.g /SYS/SPPx), 'x' is the DCU number. b. Physical inspection of the SPP, the DCU and the IOU associated with the failed SPP is optimal. If all components are off, and the Blue 'Ready to Remove' LED is lit on the suspect SPP, then one may proceed with the replacement of the failed SPP in step 3. If physical inspection is for some reason not practical, the following commands will provide further information. c. Determine DCU and SPP status -> show /System/DCUs d. Determine failed DCU status. -> show /System/DCUs/DCU_x If the above command fails on the same DCU_x as the failed SPPx, then it is likely the SPP / DCU is powered off. e. Review SPP fault status. The 'show /SYS/SPPx' output contains a list of Properties one can use to confirm the fault state, power on status, part and serial number information, etc. Here is an example from a faulted SPP1: -> show /SYS/SPP1 Properties:
-> stop /HOSTx g. Stop the affected SPP -> stop /SYS/SPPx 4. Replace failed SPP module a. Use an approved and tested ground strap to protect the equipment from ESD damage. b. Remove the SPP: Use a T20 Torx screwdriver to loosen the captive screws (1) on the extraction levers. c. Swing the extraction levers 90 degrees out. d. Pull the SPP out of the slot. Place it on a static-safe mat.
Install a SPP
8. Restart the PDomain. -> start /HOSTx OBTAIN CUSTOMER ACCEPTANCE WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE: Restart software applications per applicable administration guides to resume system operation. Use one of these Oracle ILOM commands to display faulty components. Attachments This solution has no attachment |
||||||||||||||||||
|