Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1565362.1
Update Date:2018-05-16
Keywords:

Solution Type  Technical Instruction Sure

Solution  1565362.1 :   How to Replace a Sun Server X2-8 (Sun Fire X4800 M2), Sun Fire X4800, Exadata X2-8, X3-8 NEM (Network Express Module)  


Related Items
  • Exadata Database Machine X2-8
  •  
  • Sun Server X2-8
  •  
  • Exadata X3-8 Hardware
  •  
  • Sun Fire X4800 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  




In this Document
Goal
Solution
References


Applies to:

Exadata Database Machine X2-8 - Version All Versions and later
Sun Server X2-8 - Version Not Applicable and later
Exadata X3-8 Hardware - Version All Versions and later
Sun Fire X4800 Server - Version Not Applicable and later
Information in this document applies to any platform.

Goal

How to Replace a Sun Server X2-8 (Sun Fire X4800 M2), Sun Fire X4800, Exadata X2-8, X3-8 NEM (Network Express Module).

Solution

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED:
No special skills required, Customer Replaceable Unit (CRU) procedure

TIME ESTIMATE: 30 minutes

TASK COMPLEXITY: 0

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:

PROBLEM OVERVIEW: A Sun Server X2-8 (Sun Fire X4800 M2), Sun Fire X4800, Exadata X2-8, X3-8 NEM (Network Express Module) needs replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

The Network Express Module can be replaced with system in powered on running state.

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

Reference Doc:
Sun Fire X4800 Server Service Manual
http://docs.oracle.com/cd/E19140-01/html/821-0282/index.html

Sun Fire X4800 M2 Server Service Manual
http://docs.oracle.com/cd/E20815_01/html/E20840/sfmsm.html#scrolltoc

Network Express Module Designations
http://docs.oracle.com/cd/E20815_01/html/E20840/sfmsm.gjfsa.html#scrolltoc
The server contains two Network Express Modules. The Network Express Modules are designated NEM0, and NEM1, with NEM0 residing in the left slot and NEM1 residing in the right slot.

How to shutdown the Exadata database nodes and storage cells in a rolling fashion so certain hardware tasks can be performed. (Doc ID 1539451.1)


How to Remove a Network Express Module


Network Express Modules (NEM) are supposed to be hot-pluggable, but depending on the OS installed (Linux) can cause the host to reset. That is, the server does not need to be powered off to remove or replace a NEM as long as IO resources accessing the NEM are disabled in the OS.  However, if the system is running Linux, like in the case of Exadata DB nodes, the system must be put into standby power mode as hot-plugging a NEM causes a reset of the HOST.

Note: If this server is part of an Exadata, please follow shutdown instructions in Section 1 of DOC ID: 1539451.1


1. Locate the NEM that you need to remove.
If the NEM is in a fault state, the Amber Service Action Required LED is on.

2. Label and disconnect any cables attached to the NEM.

3. To release the NEM, squeeze and extend the release handles outward in opposite directions.

4. Pull the lower handle downward and lift the upper handle until both are in their fully open position.
This action removes the pawl from the locking slot.

5. Use the handles to pull the unit partially out of the slot.
Pull the unit out far enough to grab it with your hands.

6. Close the release handles by moving them inward again.

7. To remove the NEM, slowly slide it out of the slot with one hand while supporting it with the other hand.


Caution - Airflow and temperature-related issues. NEM slots must have a NEM filler when removed for extended periods and can lower temperatures when a NEM is removed from its slot due to airflow.



How to Install a Network Express Module

1. If necessary remove the existing NEM or NEM filler panel.

2. Ensure that the release handles on the replacement NEM are in the fully open position.
Extend the release handles outward in opposite directions to their fully open position. Pull the lower handle downward and lift the upper handle.

3. Position the NEM at the opening with the LEDs to the bottom and the release handles on the right hand side.
Correct positioning ensures that the connectors on the NEM is aligned with the connectors on the internal midplane of the server.

4. To install the NEM, use the handles to slide the NEM into the opening until it stops.

Note - In this position, the NEM is not flush with the face of the server, and the connectors on the NEM are not engaged with the connectors on the internal midplane. Do not attempt to force the NEM into the server any farther.

5. Ensure that the pawls on the levers are aligned with the slot in the side wall.

6. To engage and set the NEM connectors with the connectors on the internal midplane, push the end of the release handles inward toward the NEM.


Caution - Pinch point. Keep your fingers clear of the back side of the lever, the lever hinges, and the edges of the module.


 This action draws the NEM into the slot and engages the connectors on the NEM with the connectors on the internal midplane and the NEM face will be flush with the rear of the server.

7. Continue to push the release handles until the handles lock into place on the right side edge of the NEM.  In this position, the NEM cannot be slid out of the server unless the release handles are unlocked and lifted first.

8. If necessary, install or transfer from removed NEM any 10Gbe transceivers.

9. Attach the necessary cables to the NEM.

10. Once installed the replacement NEM should be checked for firmware level to ensure it matches the other existing NEM installed.  If the replacement NEM is down revved or at a different firmware revision than the other NEM, they must be upgraded to the same level for proper operation.

Note - For component firmware update instructions, refer to the Oracle Integrated Lights Out Manager (ILOM) 3.0 Supplement for the Sun Server X2-8 (formerly Sun Fire X4800 M2).

11. If the SAS expander is down revved and needs to be updated then power on the server and boot into BIOS so that the NEM is powered on but there is no traffic on the NEMs at all.

Note – You do not need to configure anything in the BIOS setup utility. This action ensures that the NEMs are powered-on but the OS does not boot.

12. In the ILOM BUI go to the Components tab and highlight the NEM needing upgrade by clicking the radio button on the left.

13. From the Actions Drop-down menu, select Update Firmware.  A window will appear asking for download details.

14. Perform the following Actions:

    A. Specify the image location by performing:

       1. Click Browse to select the location of the firmware image you want to install (Example: x4800-sas2exp-5.7.0.0-rom.pkg).

    B. Select a transfer method from the drop-down list.

    C. Click the Update button to upload the file and update the firmware.
        The Update Status display appears, providing details about the update progress. When the update indicates 100%, the firmware upload is complete.  When the update is finished, it displays the message Firmware Update Successful.  Auto-closing windows in 2 seconds, then it closes the display.

15. Repeat above steps 10 through 13 for second NEM if needed.

16. Press ESC in the BIOS setup utility to boot the system.

 

If you find the SAS expander firmware is down revved below 5.7.0.0 then you should also assume the FPGA is down revved as well.  The NEM FPGA v17 and SAS expander 5.7.0.0 were implemented at the same time in manufacturing so if SAS is down revved then so is FPGA.  You need to acquire an escalation password to access escalation mode.

FPGA v17 comes in SW1.3 release for X4800 and SW1.1 for X4800M2.

Exadata X2-8 images with the proper ILOM containing v17 FPGA are:

image 11.2.2.4.0 contains SW1.3 for X4800

image 11.2.3.1.0 contains SW1.1 for X4800M2

Exadata X3-8 released with correct image at RR so all images contain v17 FPGA.

 

It's a manual process to update the NEM FPGA:

1. First upgrade the ILOM/BIOS with at least SW1.3 for X4800 or SW1.1 for X4800M2 or later using normal firmware update process.  

2. Then boot the server into BIOS so that the server/NEM is powered on but there is no traffic on the NEMs at all.  The server must be powered on to update the NEM FPGA, it cannot be done with server powered off.

3. Then login with escalation mode to the CMM.  

4. After logging in use the below command to update the NEM FPGA and the PHYROM.  The script updates both automatically.

# /usr/local/bin/g5nemfpgaupdate.sh

No need to AC power cycle the machine.  (As host is turned off at the end of upgrade script, NEM is powered by chassis main 12V, new FPGA/PHYROM will be effective next time host is powered on.)

 

Note: If this server is part of an Exadata, please follow restart instructions in Section 1, step 10 of DOC ID: 1539451.1


How to verify the Network Express Module is working properly

Login to CMM to Check the NEM status.

-> show /SYS/NEMx -l all

Note: The "x" represents the NEM number replaced

Sample:

-> show /SYS/ -l all

/SYS/NEM0
    Targets:
        MB
        PRSNT
        STATE
        ERR
        OK
        SERVICE
        OK2RM
        LOCATE

    Properties:
        type = Network Express Module
        ipmi_name = NEM0
        fru_manufacturer = Mitac International
        fru_version = FW 5.7.0.0
        fru_part_number = 511-1056-05
        fru_serial_number = 0328MSL-1049MC007Y
        fault_state = OK
        load_uri = (none)
        clear_fault_action = (none)


 /SYS/NEM0/MB
    Targets:

    Properties:
        type = Motherboard
        fru_name = ASSY,BRD,NEM,C4
        fru_manufacturer = 7696 MITAC COMPUTER LTD GUANGDONG CN
        fru_version = FW 5.7.0.0
        fru_part_number = 511-1056-05
        fru_serial_number = 0328MSL-1049MC007Y


 /SYS/NEM0/PRSNT
    Targets:

    Properties:
        type = Entity Presence
        ipmi_name = NEM0/PRSNT
        class = Discrete Sensor
        value = Present
        alarm_status = cleared


 /SYS/NEM0/STATE
    Targets:

    Properties:
        type = Module
        ipmi_name = NEM0/STATE
        class = Discrete Sensor
        value = Running
        alarm_status = cleared


 /SYS/NEM0/ERR
    Targets:

    Properties:
        type = OEM
        ipmi_name = NEM0/ERR
        class = Discrete Sensor
        value = Predictive Failure Deasserted
        alarm_status = cleared


 /SYS/NEM0/OK
    Targets:

    Properties:
        type = Indicator
        ipmi_name = NEM0/OK
        value = On


 /SYS/NEM0/SERVICE
    Targets:

    Properties:
        type = Indicator
        ipmi_name = NEM0/SERVICE
        value = Off


 /SYS/NEM0/OK2RM
    Targets:

    Properties:
        type = Indicator
        ipmi_name = NEM0/OK2RM
        value = Off


 /SYS/NEM0/LOCATE
    Targets:

    Properties:
        type = Indicator
        ipmi_name = NEM0/LOCATE
        value = Off


Check ILOM event log to see if any error existed.

-> show /SP/faultmgmt


Example:


/SP/faultmgmt
Properties:

-> show faulty
Target | Property | Value
-----------------+-----------------------+-------------------------------------


-> show /SP/logs/event/list

Check OS FMA info to see if any errors existed.

Example:

fmadm faulty -a
STATE RESOURCE / UUID

-------- ----------------------------------------------------------------------


Clear any faults found.

References

<NOTE:1381773.1> - How to clear FMA logs on the ILOM or Solaris on x86 platforms

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback