Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1961423.1
Update Date:2018-05-14
Keywords:

Solution Type  Technical Instruction Sure

Solution  1961423.1 :   How to Replace an Oracle Server X5-2L, X6-2L 24-Slot Disk Backplane [VCAP]  


Related Items
  • Oracle Server X6-2L
  •  
  • Oracle Server X5-2L
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  
  • Microlearning>Video>ML-VID-VCAP
  •  




Oracle Confidential PARTNER - Available to partners (SUN).
Reason: fru cap

Applies to:

Oracle Server X5-2L - Version All Versions to All Versions [Release All Releases]
Oracle Server X6-2L - Version All Versions to All Versions [Release All Releases]
x86_64

Goal

How to Replace an Oracle Server X5-2L, X6-2L 24-Slot Disk Backplane.

Solution

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED?:
Oracle Server X5-2L, X6-2L Training

TIME ESTIMATE: 60 minutes

TASK COMPLEXITY: 1

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:

PROBLEM OVERVIEW: An Oracle Server X5-2L, X6-2L 24-Slot Disk Backplane needs replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

If the system is still up and functioning, customer should perform an orderly and graceful shutdown of applications and OS.  Then power off the server and remove the AC power cords from the system.

A data backup is not a prerequisite but is a wise precaution.

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

X5-2L Remove the Storage Drive Backplane for Twenty-Four Drive Systems:
http://docs.oracle.com/cd/E41033_01/html/E48325/cnpsm.baeigbce.html#scrolltoc

X6-2L Remove the Storage Drive Backplane for Twenty-Four Drive Systems:
http://docs.oracle.com/cd/E62172_01/html/E62184/baeigbce.html#scrolltoc

1. Log into the ILOM check the fruid container values and sync them if needed.

  1. To avoid mismatched fruid values causing a failure after a disk backplane replacement the fruid data should be confirmed to have matching data in at least the Backup1 (MB) and Backup2 (PS0) containers so that the disk backplane will have it's container updated automatically after replacement. Go into restricted mode and use the showpsnc command to check this.  
    -> set SESSION mode=restricted

    WARNING: The "Restricted Shell" account is provided solely
    to allow Services to perform diagnostic tasks.

    [(restricted_shell) x5-2l]# showpsnc
    Primary: fruid:///SYS/DBP
    Backup 1: fruid:///SYS/MB
    Backup 2: fruid:///SYS/PS0

    Element           | Primary
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 1449NM7002
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup1
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 1449NM7002
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup2 (7)        | Backup2 (8)
    ------------------+-------------------+-------------------
    PPN                 33154888+1+1        33154888+1+1
    PSN                 1449NM7002          1449NM7002
    Product Name        ORACLE SERVER X5-2L ORACLE SERVER X5-2L
    [(restricted_shell) x5-2l]# exit
     
  2. The above example shows a system with all three containers properly in sync. If the output from the system does not show all of the containers with matching values then you should reset the SP and then re-check the values again. An ILOM reset will attempt to auto-populate the matching values if one container is out of sync.  
    -> reset /SP
    Are you sure you want to reset /SP (y/n)? y
    Performing reset on /SP
     
  3. After an ILOM reset if the Backup1 and Backup2 containers match then proceed with the following steps to replace the disk backplane. If these two containers do not match then DO NOT proceed with the replacement yet.
  4. If the containers do not match you will need to use the copypsnc command from service or escalation mode to copy the data from the good container so that the Backup1 and Backup2 containers match (Primary is the DBP and we are about to replace this so it is not as important at this step). If you are unfamiliar with this process and require assistance please reference the steps for using copypsnc to fix the serial number detailed in the "How to update product serial number on systems which implement TLI functionality (Doc ID 1280913.1)" and contact the TSC if needed. How to access service mode and escalation mode on ILOM 3.x and later platforms (Doc ID 1019946.1)
  5. After the fruid data in the Backup1 and Backup2 containers have been confirmed to match proceed with the following steps.

2. Prepare the server for service.

  1. Power off the server and disconnect the power cords from the power supplies.
  2. Extend the server to the maintenance position in the rack.
  3. Attach an anti-static wrist strap.

3. Remove the Fan Assembly Door and all of the Fan Modules.

  1. Using a No. 2 Phillips screwdriver, remove the two screws on each side of the chassis that secure the fan assembly door.
  2. Slide the fan assembly door toward the rear of the server, then lift and remove the door from the chassis.
  3. Using your thumb and forefinger, loosen the captive screw that secures the fan module in the chassis (use a No. 2 Phillips screwdriver if it is too tight to loosen with your fingers).
  4. Grasp both the captive screw and the opposite end of the module and lift the fan module straight up and out of the chassis (do not rock it back and forth), and set it aside on an antistatic mat.

4. Remove all of the storage drives from the storage drive cage.

  1. On each drive, push the latch release button to open the latch.
  2. Grasp the latch and pull the drives far enough out so that they disengage from the backplane.
  3. The drives do not need to be fully removed from the disk enclosure, but if they are then place the drives on an antistatic mat making sure to note the slot locations that the drives were removed from, so that they can be re-installed in their proper slots.

5. Disconnect the cables and remove the storage drive backplane.

  1. Disconnect the two power cables and the auxiliary signal cable from the storage drive backplane.
  2. If present, disconnect the two optional NVMe cables from the storage drive backplane.
  3. Disconnect the SAS cable from the storage drive backplane to the rear-mounted storage drives, and the two SAS cables from the storage drive backplane to the Oracle Storage 12 Gb/s SAS PCIe RAID HBA.  Note the cable connection locations in order to ease proper reconnection of the cables.
  4. Using a No. 2 Phillips screwdriver, loosen the two spring-mounted screws that secure the storage drive backplane to the chassis.
  5. Lift the storage drive backplane up to release it from the standoff hooks.
  6. Pull the storage drive backplane away from the standoff hooks and out of the chassis.
  7. Place the storage drive backplane on an antistatic mat.

6. Install the new storage drive backplane and connect the cables.

  1. Lower the storage drive backplane into the server, and position it to engage the standoff hooks.
  2. Using a No. 2 Phillips screwdriver, install and tighten the two spring-mounted screws to secure the storage drive backplane to the chassis.
  3. Reconnect the SAS cable to the storage drive backplane from the rear-mounted storage drives, and the two SAS cables to the storage drive backplane from the Oracle Storage 12 Gb/s SAS PCIe RAID HBA.
  4. If present, reconnect the two optional NVMe cables to the storage drive backplane.
  5. Reconnect the two power cables and the auxiliary signal cable to the storage drive backplane.

7. Re-install the storage drives into the storage drive cage.

  1. Making sure to install the drives back into the same slots from which they were removed, align the drive to the drive slot.
  2. Slide the drive into the bay until the drive is fully seated.
  3. Close the drive latch to lock the drive in place.

8. Re-install the Fans and the Fan Assembly Door.

  1. Install the fan modules into the server.
  2. For each module press down on the fan module and apply firm pressure to fully seat the fan module.
  3. Using your thumb and forefinger, tighten the captive screw to secure each of the fan modules to the chassis. Then use a No. 2 Phillips screwdriver to tighten the screw an additional 1/4 turn to secure the fan module to the chassis.
  4. Place the fan assembly door on the chassis and slightly over the fan assembly.
  5. Slide the fan assembly door forward and under the lip of the forward top cover until it latches into place.
  6. Using a No. 2 Phillips screwdriver, install and tighten the two screws on each side of the chassis.

9. Return the Server to operation

  1. Remove any anti-static measures that were used.
  2. Return the server to it's normal operating position within the rack.
  3. Re-install the AC power cords and any data cables that were removed.
  4. Power on server. Verify that the Power/OK indicator led lights steady on.

10. Set the system serial number/fruid data if needed.

  1. The disk backplane is the primary fruid container in this server so when it is replaced you will normally need to fix the serial number information.
  2. login to the ILOM as root and then enter the restricted shell to check the fruid values. Follow the example below to enter restricted shell and use the showpsnc command   
    -> set SESSION mode=restricted

    WARNING: The "Restricted Shell" account is provided solely
    to allow Services to perform diagnostic tasks.

    [(restricted_shell) x5-2l:~]# showpsnc
    Primary: fruid:///SYS/DBP
    Backup 1: fruid:///SYS/MB
    Backup 2: fruid:///SYS/PS0

    Element           | Primary
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 0000000000
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup1
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 1449NM7002
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup2 (7)        | Backup2 (8)
    ------------------+-------------------+-------------------
    PPN                 33154888+1+1        33154888+1+1
    PSN                 1449NM7002          1449NM7002
    Product Name        ORACLE SERVER X5-2L ORACLE SERVER X5-2L
    [(restricted_shell) x5-2l:~]#
  3. When the disk backplane is replaced the Primary fruid container will likely not match the Backup entries. If it does not you must enter escalation or service mode to fix it (if all three entries match this step is done).
  4. Contact the TSC to request an escalation password (service mode will work also if just the copypsnc command ends up needing to be used, if the setpsnc command is needed escalation mode is required. setpsnc is not covered in this procedure).
  5. Provide your TSC contact the output from the following ILOM commands- "version", "show /SYS product_serial_number", and "show /SP/clock". If the product_serial_number information does not give good output then provide the showpsnc output that was seen in step b above as well.
  6. The TSC will provide an escalation password that is made up of 32 short words. Follow the example below to create a new user with the 'Service' role assigned. The Service role is required to access service or escalation modes. In the following example we will create a user named 'escuser' with the service role.
    -> cd /SP/users
    /SP/users
    -> create escuser
    Creating user...
    Enter new password: ********
    Enter new password again: ********
    Created /SP/users/escuser
    -> set escuser role=aucros
    Set 'role' to 'aucros'
    -> show escuser
    /SP/users/escuser
    Targets:
    ssh
    Properties:
    role = aucros
    password = *****
  7. Set the check_physical_presence to false and then exit from the ILOM so that you can login as the newly created user.
    -> set /SP check_physical_presence=false
    Set 'check_physical_presence' to 'false'
    -> show /SP check_physical_presence
    /SP
    Properties:
    check_physical_presence = false

    -> exit
  8. Login using the escuser login and enter escalation mode using the password that was provided by the TSC.
    X5-2l login: escuser
    Password:

    Oracle(R) Integrated Lights Out Manager

    Version 3.2.4.36 r95733

    Copyright (c) 2014, Oracle and/or its affiliates. All rights reserved.

    Warning: The system appears to be in manufacturing test mode.
    Contact Service immediately.

    Hostname: x5-2l

    -> cd /SP/users/ecsuser/escalation
    -> set SESSION mode=escalation                            
    Password:**** **** **** **** **** *** *** **** **** **** **** **** **** **** **** **** *** *** **** *** **** **** **** *** **** **** *** **** *** *
    Short form password is:  NOSE HAAG MED

    [(escalation_mode) X5-2l:~]#
  9. Use the showpsnc command to confirm the current container values. Confirm that one of the backup containers has a serial number (the value on the PSN line) that matches the system serial number. The system serial number can be checked by comparing to the serial number RFID tag on the front left hand side of the server. After confirming that there is a valid fruid backup use the copypsnc command to write the good data from the backup to the primary container on the DBP. The following example shows copying from backup1 to the primary but you could also copy from backup2 if needed.
    [(escalation_mode) X5-2l:~]# showpsnc
    Primary: fruid:///SYS/DBP
    Backup 1: fruid:///SYS/MB
    Backup 2: fruid:///SYS/PS0

    Element           | Primary
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 0000000000
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup1
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 1449NM7002
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup2 (7)        | Backup2 (8)
    ------------------+-------------------+-------------------
    PPN                 33154888+1+1        33154888+1+1
    PSN                 1449NM7002          1449NM7002
    Product Name        ORACLE SERVER X5-2L ORACLE SERVER X5-2L
    [(escalation_mode) X5-2l:~]# copypsnc Backup1 Primary
    [(escalation_mode) X5-2l:~]# showpsnc
    Primary: fruid:///SYS/DBP
    Backup 1: fruid:///SYS/MB
    Backup 2: fruid:///SYS/PS0

    Element           | Primary
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 1449NM7002
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup1
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 1449NM7002
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup2 (7)        | Backup2 (8)
    ------------------+-------------------+-------------------
    PPN                 33154888+1+1        33154888+1+1
    PSN                 1449NM7002          1449NM7002
    Product Name        ORACLE SERVER X5-2L ORACLE SERVER X5-2L
    [(escalation_mode) X5-2l:~]# exit

  10. At this point if all of the fruid containers match and have the correct serial number data this step is done. If more than one of the fruid containers had non-valid entries then the copypsnc command should be used to copy over the valid data to the other container that is not valid. (ie. "copypsnc Backup1 Backup2" to copy backup1 to backup2) After confirming all fruid data is correct reset the ILOM to confirm that the fruid data persists through a reboot and remove the escalation user if needed.
    -> reset /SP
    Are you sure you want to reset /SP (y/n)? y
    Performing reset on /SP
    ..........

    ***login as the root user again and check the fruid data***

    -> set SESSION mode=restricted

    WARNING: The "Restricted Shell" account is provided solely
    to allow Services to perform diagnostic tasks.

    [(restricted_shell) x5-2l]# showpsnc
    Primary: fruid:///SYS/DBP
    Backup 1: fruid:///SYS/MB
    Backup 2: fruid:///SYS/PS0

    Element           | Primary
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 1449NM7002
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup1
    ------------------+-------------------
    PPN                 33154888+1+1
    PSN                 1449NM7002
    Product Name        ORACLE SERVER X5-2L

    Element           | Backup2 (7)        | Backup2 (8)
    ------------------+-------------------+-------------------
    PPN                 33154888+1+1        33154888+1+1
    PSN                 1449NM7002          1449NM7002
    Product Name        ORACLE SERVER X5-2L ORACLE SERVER X5-2L
    [(restricted_shell) x5-2l]#
    exit

    -> cd /SP/users
    /SP/users
    -> delete escuser
    Are you sure you want to delete /SP/users/escuser (y/n)? y
    Deleted /SP/users/escuser
  11. If trouble is encountered during any of the steps of accessing escalation mode and fixing the fruid containers please contact the TSC for assistance.

 

How to verify the Disk Backplane is working properly.

     1.  Log into ILOM to confirm if disk backplane status is working properly.

Sample

-> show /SYS/DBP

 /SYS/DBP
    Targets:
        HDD0
        HDD1
        HDD2
        HDD3
        HDD4
        HDD5
        HDD6
        HDD7
        HDD8
        HDD9
        HDD10
        HDD11
        HDD12
        HDD13
        HDD14
        HDD15
        HDD16
        HDD17
        HDD18
        HDD19
        HDD20
        HDD21
        NVME0
        NVME1
        NVME2
        NVME3
        SASEXP

    Properties:
        type = Disk Backplane
        ipmi_name = DBP
        fru_description = ASSY,24DBP,2U
        fru_manufacturer = MiTAC International Corporation
        fru_part_number = 7069338
        fru_rev_level = 01
        fru_serial_number = 489089M+14368B001X
        fault_state = OK
        clear_fault_action = (none)

    Commands:
        cd
        set
        show

->



    2.  Check ILOM event log to see if any error related backplane.

-> show /SP/faultmgmt
-> show /SP/logs/event/list

 

OBTAIN CUSTOMER ACCEPTANCE

WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

Boot up system and verify full functionality

REFERENCE INFORMATION:

Oracle Server X5-2L Documentation:
http://docs.oracle.com/cd/E41033_01/index.html

Oracle Server X6-2L Documentation:
http://docs.oracle.com/cd/E62172_01/index.html

Oracle Integrated Lights Out Manager (ILOM) 3.2 Documentation:
http://docs.oracle.com/cd/E37444_01/index.html


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback