![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||
Solution Type Technical Instruction Sure Solution 1954706.1 : How to Replace a SPARC T7-1 Motherboard [VCAP]
In this Document
Oracle Confidential PARTNER - Available to partners (SUN). Applies to:SPARC T7-1 - Version All Versions to All Versions [Release All Releases]Information in this document applies to any platform. GoalHow to replace a SPARC T7-1 Motherboard Solution
************************************************************************************** NOTE: For ALL scenarios where an AC power down or AC power cycle is required for a T7-x server, please always use the steps in doc 1571054.1 prior to physically removing AC power cables from the server.
ESD Caution:
Contamination Caution:
DISPATCH INSTRUCTIONS PROBLEM OVERVIEW: SPARC T7-1 Motherboard Replacement NOTE: Customer should perform an orderly and graceful shutdown of applications and OS to get the OpenBoot PROM prompt. Run the printenv command and make a note of any OpenBoot PROM variables that have been modified. For ALL scenarios where an AC power down or AC power cycle is required for a T7-x server, please always use the steps in doc 1571054.1 prior to physically removing AC power cables from the server.
DAMAGE ALERT: Perform a visual inspection of the replacement part to make sure that there are no damaged components, connectors, bent pins, damaged packages during shipping, etc). If the part is damaged, don't install it into the system, order a new part. Handle with caution and package carefully the return FRU just as the new FRU was packaged, to avoid any damages during shipping.
Note - The LDOM configuration (if any) needs to be saved before motherboard replacement to avoid loss of LDOM configuration, refer to doc 1019720.1 for details.
Note - A data backup is not a prerequisite but is a wise precaution.
WHAT ACTION DOES THE ENGINEER NEED TO TAKE: Replace the Motherboard 1. Log into the ILOM and check the fruid container values and sync them if needed. a. To avoid mismatched fruid values causing a failure after a motherboard replacement the fruid data should be confirmed to have matching data in at least the Primary (DBP) and Backup1 (SPM) containers so that the motherboard will have it's container updated automatically after replacement. Go into restricted mode and use the showpsnc command to check this. -> set SESSION mode=restricted
WARNING: The "Restricted Shell" account is provided solely to allow Services to perform diagnostic tasks. [(restricted_shell) t7-1-bur09-a-sp:~]# showpsnc Primary: fruid:///SYS/DBP Backup 1: file:///persist/psnc_backup1.xml Backup 2: fruid:///SYS/MB Element | Primary | Backup1 | Backup2 ------------------+-------------------+-------------------+------------------- PPN 33316362+1+1 33316362+1+1 33316362+1+1 PSN AK00327467 AK00327467 AK00327467 MACADDR 00:10:E0:70:74:EE 00:10:E0:70:74:EE 00:10:E0:70:74:EE HOSTID 867074ee 867074ee 867074ee Product Name SPARC T7-1 SPARC T7-1 SPARC T7-1 [(restricted_shell) t7-1-bur09-a-sp:~]# exit b. The above example shows a system with all three containers properly in sync. If the output from the system does not show all of the containers with matching values then you should reset the SP and then re-check the values again. An ILOM reset will attempt to auto-populate the matching values if one container is out of sync. -> reset /SP
Are you sure you want to reset /SP (y/n)? y Performing reset on /SP c. After an ILOM reset if the Primary and Backup1 containers match then proceed with the following steps to replace the motherboard. If these two containers do not match then DO NOT proceed with the replacement yet. If you are unfamiliar with this process and require assistance please reference the steps for using copypsnc to fix the serial number detailed in the "How to update product serial number on systems which implement TLI functionality (Doc ID 1280913.1)" and contact the TSC if needed. How to access service mode and escalation mode on ILOM 3.x and later platforms (Doc ID 1019946.1). 2. Prepare the server for service. a. Power off the server and disconnect the power cords from the power supplies. Caution - Components inside the chassis might be hot. Use caution when servicing components inside the chassis.
Note - When replacing the motherboard, you will need to remove the SPM, SCC PROM and eUSB drive from the old motherboard and install these components on the new motherboard. The SP contains the Oracle ILOM system configuration data, and the SCC PROM contains the system host ID and MAC address. Transferring these components preserves the system-specific information stored on these modules. Whenever you replace the motherboard or the SPM, you must update the firmware so the portions of firmware in the SPM and on the motherboard are consistent.
3. Remove the top cover and open the fan door to remove all of the fan modules. 4. Open the clear plastic air duct assembly cover by lifting the edge of the cover closest to the rear of the server. The cover can stand in its upright position but perform the next steps if you need to remove the cover: Note - Always remove transceivers from a PCIe card(s) before removing the card from the server.
Note - Keep track of which slot each PCIe card was in so you can return them to their original positions.
6. If you are replacing the motherboard, remove the following components and place them on an ESD mat: a. Both memory risers (if installed) and All DIMMs on motherboard. Note - See "How to Replace a SPARC T7-1 Memory Mezzanine (Riser) (Doc ID 1964501.1)" on how to remove and install memory risers.
Note - Keep track of which slot each DIMM came from and return them to their original position.
Note - Keep track of which side each memory riser came from and return them to their original position.
b. SCC PROM d. eUSB Note - The eUSB is below the MRs. Remove the eUSB drive from old motherboard. You will reinstall the eUSB drive on the new motherboard
7. Disconnect all the cables from the motherboard. a. Disconnect the ribbon cables from the motherboard that go to the left and right LED front indicator modules (The left and right LED indicator modules do not need to be removed to replace the motherboard). Caution - Some of these data cables are delicate. Please use caution while disconnecting them and ensure they are safely out of the way when removing and installing the motherboard.
8. Disconnect the mid-wall from the chassis. 9. Lift the mid-wall out of the chassis. 10. Release the power supplies and pull them slightly out of the server (The power supplies do not need to be removed from the chassis to lift out the motherboard). 11. Lift the motherboard out of the chassis (You can use the bar by the rear I/O panel and the metal handle in front of the cable channel as handles to lift the motherboard). Note - Angle the motherboard up as you lift it out of the chassis.
12. Place the motherboard assembly on an ESD mat. 13. Install new motherboard into the server chassis. Caution: Ensure that the remaining cables do not get caught on edges of the motherboard or under it.
b. Tilt the motherboard to the right side so it gets under the power supply. 14. Insert the mid-wall into the chassis. 15. Fasten the mid-wall to the chassis. a. Tighten the four green captive screws that secure the mid-wall to the bottom of the chassis (Use a No. 2 Phillips screwdriver to tighten the captive screws). 16. Insert the power supplies back into their connectors on the motherboard (Make sure they line up correctly prior to inserting to prevent damage to the connector). 17. Reconnect all the cables to the motherboard. a. Connect the cable from the server intrusion switch. Caution - Some of these data cables are delicate. Please use caution while connecting them and ensure they are safely out of the way when removing and installing the motherboard.
d. Connect the DVD drive cable to the motherboard (Thread the DVD cable through the chassis mid-wall to reach the motherboard connector). 18. Install all the fan modules. 19. If you replaced the motherboard, install the following components saved from original motherboard: a. Both memory risers (if applicable) and All DIMMs. Note - See "How to Replace a SPARC T7-1 Memory Mezzanine (Riser) (Doc ID 1964501.1)" on how to remove and install memory risers.
Note - Keep track of which slot each DIMM came from and return them to their original position.
Note - Keep track of which side each memory riser came from and return them to their original position.
b. SCC PROM. 20. Install all PCIe cards saved from original motherboard. Note - Always install transceivers for a PCIe card(s) after installing the card(s) in the server.
Note - Remember to return each PCIe card to their original positions.
21. Close the clear plastic air duct assembly cover by rotating it down over the motherboard / two memory risers and then slightly pressing in the two tabs to secure it. See following if you need to reattach the cover: a. Open the fan cover. 23. Return the Server to operation. a. Remove any anti-static measures that were used. Unrecognized Chassis: This module is installed in an unknown or unsupported chassis. You must upgrade the firmware to a newer version that supports this chassis.
If you see this message, go on to Step #25. Note - Whenever you replace the SPM or the motherboard, update the firmware on the server so the portions of firmware in the two components remain consistent.
25. Download the system firmware. a. If needed, configure the SP network port to enable the firmware image to be downloaded. Refer to the Oracle ILOM documentation for network configuration instructions. Note - You can load any supported system firmware version, including the firmware revision that had been installed prior to the replacement of the motherboard.
Note - When replacing a motherboard on T3, T4, T5, and T7 systems, certain procedures will need to be followed if on board hardware raid volumes are configured. Re-activate any RAID volumes that existed prior to replacing the motherboard. Only perform this task if your system had RAID volumes prior to replacing the motherboard. For details please consult doc 1387771.1.
Note - The LDOM configuration (if any) needs to be restored after motherboard replacement to avoid loss of LDOM configuration, refer to doc 1019720.1 for details.
26. Power on server. Verify that the Power/OK indicator led lights steady on. 27. Set the system serial number/fruid data if needed. -> set SESSION mode=restricted
WARNING: The "Restricted Shell" account is provided solely to allow Services to perform diagnostic tasks. [(restricted_shell) t7-1-bur09-a-sp:~]# showpsnc Primary: fruid:///SYS/DBP Backup 1: file:///persist/psnc_backup1.xml Backup 2: fruid:///SYS/MB Element | Primary | Backup1 | Backup2 ------------------+-------------------+-------------------+------------------- PPN 33316362+1+1 33316362+1+1 33316362+1+1 PSN AK00327467 AK00327467 0000000000 MACADDR 00:10:E0:70:74:EE 00:10:E0:70:74:EE 00:10:E0:70:74:EE HOSTID 867074ee 867074ee 867074ee Product Name SPARC T7-1 SPARC T7-1 SPARC T7-1 [(restricted_shell) t7-1-bur09-a-sp:~]# c. When the motherboard is replaced the Backup2 fruid container will likely not match the Primary entry. If it does not you must enter escalation or service mode to fix it (if all three entries match this step is done). -> cd /SP/users
/SP/users -> create escuser Creating user... Enter new password: ******** Enter new password again: ******** Created /SP/users/escuser -> set escuser role=aucros Set 'role' to 'aucros' -> show escuser /SP/users/escuser Targets: ssh Properties: role = aucros password = ***** g. Set the check_physical_presence to false and then exit from the ILOM so that you can login as the newly created user. -> set /SP check_physical_presence=false
Set 'check_physical_presence' to 'false' -> show /SP check_physical_presence /SP Properties: check_physical_presence = false -> exit h. Login using the escuser login and enter escalation mode using the password that was provided by the TSC. t7-1-bur09-a-sp login: escuser
Password: Oracle(R) Integrated Lights Out Manager Version 3.2.4.34 r95732 Copyright (c) 2014, Oracle and/or its affiliates. All rights reserved. Warning: The system appears to be in manufacturing test mode. Contact Service immediately. Hostname: t7-1-bur09-a-sp -> cd /SP/users/ecsuser/escalation -> set SESSION mode=escalation Password:**** **** **** **** **** *** *** **** **** **** **** **** **** **** **** **** *** *** **** *** **** **** **** *** **** **** *** **** *** * Short form password is: NOSE HAAG MED [(escalation_mode) t7-1-bur09-a-sp:~]# i. Use the showpsnc command to confirm the current container values. Confirm that the primary container has a serial number (the value on the PSN line) that matches the system serial number. The system serial number can be checked by comparing to the serial number RFID tag on the front left hand side of the server. After confirming that there is a valid fruid primary use the copypsnc command to write the good data from the primary to the backup2 container on the MB. The following example shows copying from primary to the backup2, but you could also copy from backup1 if needed. [(escalation mode) t7-1-bur09-a-sp:~]# showpsnc [(escalation mode) t7-1-bur09-a-sp:~]# copypsnc Primary Backup2 [(escalation mode) t7-1-bur09-a-sp:~]# showpsnc j. At this point if all of the fruid containers match and have the correct serial number data this step is done. If more than one of the fruid containers had non-valid entries then the copypsnc command should be used to copy over the valid data to the other container that is not valid. (ie. "copypsnc Primary Backup1") After confirming all fruid data is correct reset the ILOM to confirm that the fruid data persists through a reboot and remove the escalation user if needed. -> reset /SP
Are you sure you want to reset /SP (y/n)? y Performing reset on /SP .......... ***login as the root user again and check the fruid data*** -> set SESSION mode=restricted WARNING: The "Restricted Shell" account is provided solely to allow Services to perform diagnostic tasks. [(restricted_shell) t7-1-bur09-a-sp:~]# showpsnc Primary: fruid:///SYS/DBP Backup 1: file:///persist/psnc_backup1.xml Backup 2: fruid:///SYS/MB Element | Primary | Backup1 | Backup2 -------------------+------------------------+-------------------------+------------------- PPN 33316362+1+1 33316362+1+1 33316362+1+1 PSN AK00327467 AK00327467 AK00327467 MACADDR 00:10:E0:70:74:EE 00:10:E0:70:74:EE 00:10:E0:70:74:EE HOSTID 867074ee 867074ee 867074ee Product Name SPARC T7-1 SPARC T7-1 SPARC T7-1 [(restricted_shell) t7-1-bur09-a-sp:~]# exit -> cd /SP/users /SP/users -> delete escuser Are you sure you want to delete /SP/users/escuser (y/n)? y Deleted /SP/users/escuser k. If trouble is encountered during any of the steps of accessing escalation mode and fixing the fruid containers please contact the TSC for assistance.
How to verify the Motherboard is working properly 1. Log into ILOM to confirm if motherboard status is working properly. Sample: -> show /SYS/MB
/SYS/MB
Properties: type = Motherboard
Commands:
-> 2. Check ILOM event log to see if any error related motherboard. -> show /SP/faultmgmt
-> show /SP/logs/event/list
OBTAIN CUSTOMER ACCEPTANCE WHAT ACTION DOES THE FE/CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:
References<NOTE:1571054.1> - Performing an AC power cycle on the T3/T4/T5/S7/T7/T8 Servers<NOTE:1280913.1> - How to update System, Chassis, and Product level Key Identity Properties on ILOM based systems which implement Top Level Identifier (TLI) functionality <NOTE:1019946.1> - How to access service mode and escalation mode on ILOM 3.x and later platforms Attachments This solution has no attachment |
||||||||||||||||
|