![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||
Solution Type Technical Instruction Sure Solution 1988106.1 : M8-8 / M7-8 / M7-16 How to replace a Faulty Service Processor (SP) in a CMIOU chassis or in a Switch chassis [VCAP]
In this Document
Oracle Confidential PARTNER - Available to partners (SUN). Applies to:SPARC M7-16 - Version All Versions and laterSPARC M7-8 - Version All Versions and later Oracle SuperCluster M7 Hardware - Version All Versions and later SPARC M8-8 - Version All Versions and later Information in this document applies to any platform. GoalCAP PROBLEM OVERVIEW: M8-8 / M7-8 / M7-16 Service Processor (SP) in a CMIOU chassis or in a Switch chassis - SP Failure ********************************************************************* ********************************************************************* ESD Caution:
Contamination Caution:
Solution
DISPATCH INSTRUCTIONS WHAT SKILLS DOES THE ENGINEER NEED: M8-8 / M7-8 / M7-16 Product Training/Experience TASK COMPLEXITY: 3 TIME ESTIMATE: 60 minutes HOT replacement FIELD ENGINEER INSTRUCTIONS WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? : Depending on server model and populated CMIOU slots it may be necessary to stop the HOST controlled by the SPM on the target SP for removal. ILOM will not permit SPM role failover for Degraded pcie path SPMs (e.g., half populated M7-8 SuperCluster chassis with CMIOU occupying only slots 0,3,5, and 7). The bug fix for 24449320 (9.7.3.b and higher) will permit you to failover the SPM role to a Degraded pcie path SPM when the HOST is stopped. WHAT ACTION DOES THE ENGINEER NEED TO TAKE: Important note : After replacing the SP, make sure that the FPGA version is up to date and updated as appropriate - Refer to doc 2085572.1 Determining which SP requires service 1. Use one of these Oracle ILOM commands to display faulty components:
2. Locate the faulty SP by its amber Service Required LED Preparing to remove an SP It is recommended that you replace one SP in the system at a time.
1. Determine which SP is managing system activity
SPMs in SPARC M7-8 servers support PCIe connections. On these servers, hosts must either be powered off or must be running the Oracle Solaris OS for an SP to be removed. Do not remove SP PCIe devices when the server is booting or when the host is at the Open Boot prompt. 2. Ensure that the SPM on the SP is not managing hardware
3. Determine the next step
4.Change the Active SP assignment
5. Prepare the SPM for removal
6. For a SPARC M7-8 server with 2 PDomains only, change DCU1 SP assignment, when necessary
NOTE: It may be necessary to stop the HOST controlled by the SPM on the target SP for removal. ILOM will not permit SPM role failover for Degraded pcie path SPMs (e.g., half populated M7-8 SuperCluster chassis with CMIOU occupying only slots 0,3,5, and 7). The bug fix for 24449320 (9.7.3.b and higher) will permit you to failover the SPM role to a Degraded pcie path SPM when the HOST is stopped. The following ILOM command can be helpful to see the status of all SPM and the health of pcie paths 7. Determine the next step
8. Verify that the PCIe devices have been taken offline
NOTE: The following ILOM command can be helpful to see the status of all SPM and the health of pcie paths 9. Verify that the SPM on the SP is ready for removal
10. Prepare the SP for removal
When the SP is ready to remove, the health value will display Offline, and the blue Ready to Remove LED will light. 11. If you can access the SP, back up the configuration information
Removing an SP Only remove an SP when you have verified that the blue Ready to Remove LED on the SP is lit.
1. Use a grounding strap to protect the equipment from ESD damage 2. Locate the lit blue Ready to Remove LED from the rear of the server 3. Label, disconnect and relocate the cables attached to the serial and network ports 4. Pinch the ejector latches and open the ejector arms 5. Pull the SP halfway out of the SP tray 6. Close the ejector arms 7. Carefully remove the SP from the SP tray, using two hands, and avoid bumping the rear connectors 8. Place the SP on an antistatic mat
Installing an SP 1. Insert the SP into the slot and slide it in until the extraction levers start to close 2. Close the extraction levers fully until they lock into place Note: FE should connect to the serial port of the newly installed SP in order to collect POST and ILOM startup output. In case od unexpected FRU behavior the data should be uploaded into the SR 3. Refer to SPARC M7 Series Servers : SP or SPP FPGA firmware update (Doc ID 2085572.1) and check if any FPGA update is required. 4. Reinstall the serial management and network management cables
OBTAIN CUSTOMER ACCEPTANCE WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:
Verify that there is no faulty components
Perform one of the following tasks based on your verification results
The newly installed SP will update its system firmware from the Active-SP in the system.
Verify that the SP date is correct.
If TPM was initialized on the replaced SP, the proper steps should be completed. See "Securing Systems and Attached Devices in Oracle® Solaris 11.3" in the Solaris documentation. Verify that the Versaboot fallback image is installed.
Return the faulted component to Oracle.
======================== Other info ===================== REFERENCE INFORMATION: Service Manual: http://docs.oracle.com/cd/E55211_01/html/E55215/index.html Attachments This solution has no attachment |
||||||||||||||||
|