Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-2085272.1
Update Date:2016-12-21
Keywords:

Solution Type  Technical Instruction Sure

Solution  2085272.1 :   MaxRep: How to Remove and Replace a Motherboard in a MaxRep3 Engine  


Related Items
  • Pillar Axiom Replication Engine (MaxRep)
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: DISK-CAP VCAP
  •  


Instructions on how to replace motherboard in a MaxRep3 Engine.

In this Document
Goal
Solution
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: FRU

Applies to:

Pillar Axiom Replication Engine (MaxRep) - Version 3.0 to 3.0 [Release 3.0]
Information in this document applies to any platform.

Goal

Outline the steps required to replace a MaxRep3 Engine motherboard.

 

Solution

DISPATCH INSTRUCTIONS

- WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED:

Product knowledge, MaxRep3 Engine

TIME ESTIMATE: 120 minutes

TASK COMPLEXITY: 2

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:

This Action Plan requires the use of 3 items that should be obtained and kept by the FE:

  • CPU Installation/Removal Tool - part # 7026168
  • Thermal Grease Kit - part # 350-1271 and contains:
    • Thermal Grease Syringe - 310-0065
    • Alcohol wipes - 250-1802

If you are not very familiar with servicing the Netra Server X3-2/Sun Netra X4270 M3 Server upon which the MaxRep3 Engine is based, it is highly recommended that you look at the animation videos that detail the replace procedures covered in this CAP.  They are available at the Oracle's Sun Server X3-2 Animations.

QRC for this procedure: 

 MaxRep3 Motherboard


PROBLEM OVERVIEW: 

MaxRep3 Engine motherboard.

What: A motherboard in a MaxRep3 Engine has failed and needs to be replaced. 

Where: Assuming that the MaxRep3 Engine is still functioning, there will be a failed Hardware Status in the MaxRep3 User Interface. 

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY?

The the motherboard failure will likely have a warning status but depending on how severe the damage is, the entire MaxRep3 Engine itself may be in a missing state.

 

NOTE: Please review <Document 1942676.1> FS System: How to Disable Call-Home to Prevent Automatic Service Request (ASR) Generation before proceeding with the procedure below. The steps contained therein are provided to allow an administrator to deactivate a particular ASR-enabled array while performing maintenance or troubleshooting. This will prevent any additional Service Requests from being created unnecessarily.

 

NOTE: The MaxRep3 Engine uses a quorum mechanism for Key Identity Properties (KIP).  The quorum is comprised of the motherboard, disk backplane and power distribution bus which are all encoded with the Product Serial Number (PSN) of the MaxRep3 Engine.  At least two of these must agree on the correct PSN or the MaxRep3 Engine will NOT boot.  So as to avoid this problem, this process has the user confirm the PSNs are in sync before attempting the replacement.  NEVER replace one of these quorum devices if the PSNs are not in sync and NEVER replace two of these items at the same time.

  

WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE:

  1. Confirm Product Serial Number Containers (PSNCs) are currently synchronized.
    1. Use ssh to directly access the MaxRep3 Engine's ILOM (default login/password is root/changeme).
    2. Enter restricted session mode and run the showpsnc command.

      -> set SESSION mode=restricted

      WARNING: The "Restricted Shell" account is provided solely
      to allow Services to perform diagnostic tasks.

      [(restricted_shell) MAXREPHOST:~]# showpsnc
      Primary: fruid:///SYS/DBP
      Backup 1: fruid:///SYS/MB
      Backup 2: fruid:///SYS/PDB

      Element           | Primary           | Backup1           | Backup2
      ------------------+-------------------+-------------------+-------------------
      PPN                 7056355             7056355             7056355
      PSN                 1312FMXXXX          1312FMXXXX          1312FMXXXX  <=== Product Serial Numbers must match
      Product Name        Sun Netra X4270 M3  Sun Netra X4270 M3  Sun Netra X4270 M3
      [(restricted_shell) MAXREPHOST:~]#
       

    3. If all 3 PSNs match, exit back to the ILOM prompt (->) and proceed to step 2.
    4. If the Disk BackPlane 0 (DBP0) and Power Distribution Board (BPD) are the same but MotherBoard (MB) is different, it is safe to proceed to step 2 since the motherboard will be replaced.
    5. If any other condition exists, STOP!! and re-engage the TSC for steps to correct before proceeding to replace the failed motherboard.

  2. Prepare MaxRep3 Engine for service procedure.
    1. Using the ILOM interface, confirm the failed part and location:
      • Browser: point a browser at the ILOM IP address and select Open Problems in the navigation tree on the left.
      • CLI: ssh to the ILOM IP address and execute "show /System/Open_Problems"
    2. Disable Call-Home to prevent spurious alerts (see <Document 1942676.1> FS System: How to Disable Call-Home to Prevent Automatic Service Request (ASR) Generation).
    3. Shutdown the MaxRep3 Engine (see <Document 2066803.1> MaxRep: How to Perform a Graceful Shutdown of the Oracle MaxRep3 Engine.
    4. Use ESD precautions.

  3. Access the MaxRep3 Engine motherboard.
    1. Deploy the anti-tip legs in the front of the rack.
    2. Slide the MaxRep3 Engine into the service position.
    3. Unplug both power cords.
    4. Loosen the two captive screws in the rear of the top cover.
    5. Slide the top cover back and remove.
    6. Disengage both power supplies from motherboard.
    7. Lift Drive Compartment into service position.
      1. Unplug the 2 SAS cables connected to the middle of the Drive Compartment - suggest they be labeled to make reconnections easier.
      2. Unplug the Drive Compartment power cable (left side) from the Drive Compartment.
      3. Unplug the USB cable from it's connector (right side)
      4. Loosen the four captive screws that secure the Drive Compartment chassis and tilt it up into the service position.
    8. Remove air duct.
    9. Remove fan tray.
      1. Remove all 5 fan modules.
      2. Remove fan tray.
    10. Remove risers.
      1. Unplug and mark any external cables.  Internal SAS cable may be left installed.
      2. Loosen 2 captive screws in each riser.
      3. Lift the riser up and clear of the chassis.

      NOTE: it is not necessary to remove the PCIe cards from the risers.  This will make reassembly easier.


    11. Disconnect PDB.
      1. Loosen the single captive screw that secures the PDB duct to the chassis and lift it free. 
      2. Remove the PDB signal cable connection from the motherboard.
      3. Remove the 4 screws that secure the PDB GND and +12V buses to the motherboard.

    12. Remove remaining cable connections from the motherboard
      1. Drive backplane power and monitoring cable (left side).
      2. LED board/Alarm cable (left side).
      3. Drive signal cable (right side).
      4. Remove and label external network and fibre cabling.

  4. Remove failed motherboard.
    1. Loosen the single captive screw at the front of the motherboard.
    2. Using the two green plastic handles, gently push the motherboard towards the rear until it is free of the chassis.
    3. Lift the motherboard out of the server's chassis and place it on an anti-static mat.  

  5. Install replacement motherboard.
    1. Using the 2 green plastic handles, insert the replacement motherboard from the rear and slide it forward to it's original position.  Take care to align the four holes that connect it to the PDB as well as the alignment tabs in the rear of the chassis.
    2. Secure motherboard by tightening captive screw in the front of the motherboard.
    3. Secure the  PDB buses to the mother board using the 4 screws previously removed.

  6. Transfer DIMMs and CPUs with their heatsinks from old motherboard to new.
    1. CPUs and heatsinks.
      1. While gently pushing down on the Heatsink, loosen 4 screws 1.5 turns each using a crossing pattern until all the screws are free.
      2. Gently rotate the heatsink back and forth slightly while pulling up to free it from the CPU.
      3. Use a supplied alcohol pad, clean the heatsink bottom and CPU top.  Be VERY careful not to damage the CPU pins or socket by applying too much pressure.  Avoid spreading the thermal grease to other surfaces .
      4. You must use a CPU Installation/Removal tool 7026168 to extract the CPU
      5. Viewing from the front, disengage the right CPU Cover retaining levers by pushing down on them then pushing them away from the CPU.  Repeat for left lever.  Be sure to move the levers all the way back to allow better access to the CPU cover and CPU itself.
      6. Open the CPU cover towards the right side to expose the CPU.
      7. Press the round button at the top center of the tool to unlock it.  Then place it over the CPU using the green arrow to align it properly.
      8. Lock the tool to the CPU by pushing the tab next to the center button on the top of the tool.  Once CPU is locked to the tool lift the tool straight up.
      9. Position the CPU in the replacement motherboard in the same location it was removed from and carefully align it with the CPU socket on the motherboard.
      10. Press the center round button on top of the tool to unlock the CPU.  Do NOT press on the CPU itself.
      11. Lift the CPU tool free and clear of the Controller.
      12. Lower the CPU Cover back into place.
      13. Re-engage the left lever back into place and then the right lever.
      14. Using the syringe it comes in, apply ~0.1 ml of thermal grease in the center of the top of the CPU.  Do NOT spread it around.
      15. Verify the underside of the heatsink is clean and if not clean it with an alcohol pad.
      16. Carefully position the heatsink over the CPU by aligning the captive mounting screws to their holes in the motherboard.
      17. Once the heatsink has made contact with the thermal grease, keep any sideways movement to a minimum.
      18. Using a crossing pattern, tighten each screw 0.5 turns until all four are securely fastened.
      19. Repeat for second CPU.

    2. DIMMs
      1. One at a time, remove a DIMM from the failed motherboard and install it in the same slot as the replacement motherboard.
      2. To ensure proper cooling, also transfer the plastic fillers in the unused DIMM slots from the failed motherboard to it's replacement.

  7. Reconnect cables.
    1. From left side to right side, reconnect the Drive Container power, monitor  PDB signal, USB cables to the motherboard.
    2. Install the PDB duct in place and secure to chassis with captive screw.
    3. Reattach all external network and fibre cables.

  8. Reassemble remaining MaxRep3 Engine components.
    1. Reconnect all the remaining cables to their motherboard connections that were removed in step 3-l.
    2. Reinstall the risers.
      1. Each riser has 1 alignment hole, 2 screws and a PCIe connector.  Align the PCIe connector to the motherboard first and then adjust for the others before pushing the PCIe connector into the PCIe slot.
      2. Secure in place using the two captive screws.
      3. Reconnect any external cables and the internal SAS cables if they were disconnected.
    3. Reinstall fan tray and Fans.  Be not to pinch the LED board monitor and Drive Contrainer power & monitoring cables (left side) and PDB cables (right side).
    4. Reinstall the air duct
    5. Return Drive compartment to it's normal position.
      1. Tilt the Drive Compartment back into it's normal position being careful not to pinch any cables.
      2. Secure in place using the 4 captive screws.
      3. Reconnect (left to right) the drive backplane power & monitoring cable from the motherboard, and the SAS cables.
    6. Reseat both power supplies.
    7. Close the top cover.
    8. Tighten the two captive screws in the rear of the top cover.

  9. Return MaxRep3 Engine back into the rack.
    1. Plug in both power cords.
    2. Return the MaxRep3 Engine to the rack position.
    3. Return the anti-tip legs to their normal position.

  10. Once the ILOM has booted (SP LED in front is solid green) configure MaxRep3 Engine network settings.
    1. Access the ILOM Serial Management port - see <Document 2070735.1> FS System: How to Access FS1-2 ILOMs Using a Serial Connection.
    2. Configure the pending ILOM network interface information (address, gateway & netmask) and then commit changes:
      -> set /SP/network pendingipaddress=XX.XX.XX.XX
      Set 'pendingipaddress' to 'XX.XX.XX.XX'

      -> set /SP/network pendingipgateway=XX.XX.XX.XX
      Set 'pendingipgateway' to 'XX.XX.XX.XX'

      -> set /SP/network pendingipnetmask=XX.XX.XX.XX
      Set 'pendingipnetmask' to 'XX.XX.XX.XX'

      -> set /SP/network commitpending=true
      Set 'commitpending' to 'true'

      ->

    3. Confirm the settings:
      -> show /SP/network
      /SP/network
         Targets:
             interconnect
             ipv6
             test

         Properties:
             commitpending = (Cannot show property)
             dhcp_clientid = none
             dhcp_server_ip = none
             ipaddress = XX.XX.XX.XX
             ipdiscovery = static
             ipgateway = XX.XX.XX.XX
             ipnetmask = XX.XX.XX.XX
             macaddress = 00:21:28:A2:XX.XX
             managementport = MGMT
             outofbandmacaddress = 00:21:28:A2:XX.XX
             pendingipaddress = XX.XX.XX.XX
             pendingipdiscovery = static
             pendingipgateway = XX.XX.XX.XX
             pendingipnetmask = XX.XX.XX.XX
             pendingmanagementport = MGMT
             sidebandmacaddress = 00:21:28:A2:XX.XX
             state = enabled

         Commands:
             cd
             set
             show

      ->
       

  11. Once the MaxRep3 Engine has completed it's reboot, repeat step 1 to verify that the PSN of the replacement motherboard is synchronized to the other two quorum devices.
  12. When finished, re-enable Call-Home. Please refer to <Document 1942676.1> FS System: How to Disable Call-Home to Prevent Automatic Service Request (ASR) Generation for details.


OBTAIN CUSTOMER ACCEPTANCE


WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:

 Confirm the System Alert associated previously is gone and the status of the MaxRep3 Engine and FS1-2 is normal/green.

NOTE: Because the MaxRep3 Engine must cold start, it may take as long as 20 minutes for the boot process to complete and the MaxRep3 Engine to return to a normal status.


REFERENCE INFORMATION:

From the Oracle Help Center: http://docs.oracle.com/en/storage/#fla select the Oracle Flash System Utilities, Oracle MaxRep for SAN Documentation Library Release 3.0  for more information.


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback