Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1383901.1
Update Date:2018-05-16
Keywords:

Solution Type  Technical Instruction Sure

Solution  1383901.1 :   How to Replace a Sun Fire X4800, Sun Server X2-8 (Sun Fire X4800 M2) Disk Backplane  


Related Items
  • Sun Fire X4800 Server
  •  
  • Sun Server X2-8
  •  
  • Exadata Database Machine X2-8
  •  
  • Exadata X3-8 Hardware
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: x64-CAP VCAP
  •  




In this Document
Goal
Solution
References


Oracle Confidential PARTNER - Available to partners (SUN).
Reason: This doc is for ORACLE internal reference

Applies to:

Exadata X3-8 Hardware - Version All Versions and later
Sun Fire X4800 Server - Version All Versions and later
Sun Server X2-8 - Version All Versions and later
Exadata Database Machine X2-8 - Version All Versions and later
Information in this document applies to any platform.

Goal

How to Replace a Sun Fire X4800, Sun Server X2-8 (Sun Fire X4800 M2) Disk Backplane.

Solution

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED?:
Sun Fire X4800, Sun Server X2-8 (Sun Fire X4800 M2) Training

TIME ESTIMATE: 60 minutes

TASK COMPLEXITY: 1

FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:

PROBLEM OVERVIEW: A Sun Fire X4800, Sun Server X2-8 (Sun Fire X4800 M2) Disk Backplane needs replacement

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

If the system is still up and functioning, customer should perform an orderly and graceful shutdown of applications and OS. Then power off the server and remove the AC power cords from the system. A data backup is not a prerequisite but is a wise precaution.

Note: If this server is part of an Exadata, please follow shutdown instructions in Section 1 of DOC ID: 1539451.1


WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

Reference Doc:
Sun Fire X4800 Server Service Manual
http://docs.oracle.com/cd/E19140-01/html/821-0282/index.html

Sun Fire X4800 M2 Server Service Manual
http://docs.oracle.com/cd/E20815_01/html/E20840/sfmsm.html#scrolltoc

Sun Fire X4800 Replacing the Hard Drive Backplane (FRU)
http://docs.oracle.com/cd/E19140-01/html/821-0282/gjgqc.html#scrolltoc

Sun Fire X4800 Replacing the Hard Drive Backplane (FRU)
http://docs.oracle.com/cd/E20815_01/html/E20840/sfmsm.gjgqc.html#scrolltoc

Sun Fire X4800 Server Chassis Overview
http://docs.oracle.com/cd/E19140-01/html/821-0282/gjkwa.html#scrolltoc

Sun Fire X4800 M2 Server Chassis Overview
http://docs.oracle.com/cd/E20815_01/html/E20840/sfmsm.gjkwa.html#scrolltoc

Sun Fire X4800 Drive Backplane Overview
http://docs.oracle.com/cd/E19140-01/html/821-0282/gjmst.html#scrolltoc

Sun Fire X4800 M2 Drive Backplane Overview
http://docs.oracle.com/cd/E20815_01/html/E20840/sfmsm.gjmst.html#scrolltoc

Sun Fire X4800 Hard Drive Backplane Cable Routing and Designations
http://docs.oracle.com/cd/E19140-01/html/821-0282/gjkuz.html#scrolltoc

Sun Fire X4800 M2 Hard Drive Backplane Cable Routing and Designations
http://docs.oracle.com/cd/E20815_01/html/E20840/sfmsm.gjkuz.html#scrolltoc

How to shutdown the Exadata database nodes and storage cells in a rolling fashion so certain hardware tasks can be performed. (Doc ID 1539451.1)

How to Remove the Hard Drive Backplane

1.Shutdown OS and power off server.
2.Disconnect the AC power cables from the AC power block.
3.Label and remove the Disk Drives, CMODs and module fillers with their respective slot designation.
4.Label and remove the hard drives and any HD filler carriers.
5.To remove the three SAS connectors from the hard drive backplane, squeeze the connector lock clips and pull the connectors toward the back of the server .

Note:As you face the server, the lock clips are on the left side of the connector. From left to right, the cables are labeled SAS 1,SAS Power, and SAS 0. The right most cable, SAS 0, is color coded.


6.From the front of the machine, use a No. 1 Phillips screwdriver to loosen the three captive hard drive backplane retaining screws.
7.To remove the hard drive backplane and frame assembly from the server, slide it toward the back of the server, tilt the right side upward, and pull it out leading with the right edge.
8.Separate the HD backplane from the frame.


How to Install the Hard Drive Backplane

9.Join together the backplane and the frame.
10.Orient the hard drive backplane assembly with the connectors on the backplane facing toward the back of the server.
11.Insert the hard drive backplane into the server and position it against the back wall of the drive bay.
12.Hold the hard drive backplane assembly flat against the back wall of the drive bay and align the screw holes in the assembly with the captive screws.

Note: For ease of installation, align the center screw first.

13.From the front of the server, use a No. 1 Phillips screwdriver to tighten the three captive screws and secure the hard drive backplane assembly.
14.Connect the SAS cables and the one SAS power cable to the hard drive backplane.
15.Install the hard drives into their original slots.
16.Install the CMODs and filler modules into their original slots.
17.Install the AC power cables.
18.Prepare the server for operation.

How to verify the DBP working properly.

Note: If this server is part of an Exadata, please follow restart instructions in Section 1, step 10 of DOC ID: 1539451.1

Power on server and log in ILOM to confirm if DBP working and all HDD status working properly.

Sample

-> show /SYS/DBP -l all

/SYS/DBP
Properties:
type = Disk Backplane
ipmi_name = DBP
fru_name = ASSY,BACKPLANE HDD,C4
fru_manufacturer = 9615 HON HAI PRECISION CO. LTD SHENZHEN GUANGDONG CN
fru_part_number = 511-1062-06
fru_serial_number = 0226LHF-1039AB00VF

/SYS/DBP/HDD0
Properties:
type = Hard Disk
ipmi_name = DBP/HDD0

/SYS/DBP/HDD0/PRSNT
Properties:
type = Entity Presence
ipmi_name = DBP/HDD0/PRSNT
class = Discrete Sensor
value = Present
alarm_status = cleared

/SYS/DBP/HDD0/SERVICE
Properties:
type = Indicator
ipmi_name = DBP/HDD0/SVC
value = Off

/SYS/DBP/HDD0/OK2RM
Properties:
type = Indicator
ipmi_name = DBP/HDD0/OK2RM
value = Off
.
.



Check ILOM event log to see if any error related HDDs.

-> show /SP/faultmgmt
-> show /SP/logs/event/list



Check if the HDD working normal from LSI Niwot REM/Erie RAID controller.

For LSI Niwot REM controller.

1.Power On system
During boot up a message is displayed that gives you the option to press CTRL+H to access the WebBIOS configuration utility.

2. Press CTRL+H to access the WebBIOS utility.
The Adapter Selection screen is displayed.

3. Use the Tab key to navigate to the adapter that you want, and press Enter.

4. Use the Tab key to navigate to the Start button, and press Enter.
The MegaRAID BIOS Config Utility Virtual Configuration screen is displayed.

5. In the navigational menu on the left, use the Tab key to navigate to the Virtual Drives menu option, and press Enter.
The MegaRAID BIOS Config Utility Virtual Drives screen is displayed.

6.Check if all Drives showed Optimal.

7.Exit the utility by using the Back button to return to the main menu, navigating to the Exit menu option, and pressing Enter.



For LSI Erie RAID controller.

1.Power On system
During the boot process, the BIOS initialization banner lists information about the discovered SAS adapters and devices that are attached to the discovered HBAs in the system.

2. Upon seeing the prompt, Press Ctrl-C to start LSI Corp Configuration Utility..., immediately press CTRL-C to access the LSI Corp Config Utility utility.
The LSI Corp Config Utility menu is displayed

3.Use the arrow keys to navigate to the HBA that you want, and press Enter.
The Adapter Properties screen is displayed for the selected HBA

4.To view the devices and logical volumes attached to the HBA, use the arrow keys to navigate to the SAS Topology field, and press Enter.
The SAS Topology screen is displayed

5.Check if all Drives showed Optimal.

6.Exit the utility by using Esc key, navigating to the Exit menu option, and pressing Enter.


Check if all HDDs working normal on Solaris environment.

1.Use format command to check HDD status

# format
Searching for disks...done


AVAILABLE DISK SELECTIONS:
0. c0t0d0 <DEFAULT cyl 36348 alt 2 hd 255 sec 63>
/pci@0,0/pci8086,3408@1/pci1000,9262@0/sd@0,0



2.Use iostat command to check HDD status

# iostat -E
sd1 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: LSI Product: MR9262-8i Revision: 2.90 Serial No: Size: 299.00GB <298999348736 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 2 Predictive Failure Analysis: 0



3.Use cfgadm command to check HDD connection

# cfgadm -alv
Ap_Id Receptacle Occupant Condition Information
When Type Busy Phys_Id
Slot3 empty unconfigured unknown Location: Slot3
Jan 1 1970 unknown n /devices/pci@79,0/pci8086,340a@3:Slot3
Slot5 connected configured ok Location: Slot5
Jan 1 1970 pci-pci/hp n /devices/pci@79,0/pci8086,340c@5:Slot5
c0 connected configured unknown
unavailable scsi-bus n /devices/pci@0,0/pci8086,3408@1/pci1000,9262@0:scsi
c0::dsk/c0t0d0 connected configured unknown LSI MR9262-8i



4.Use raidctl -l command to see if HardRAID working normal.

Platforms using Solaris 10 Update 3 or lower will produce similar to the following output when configured under LSI hardware RAID management:

Good Volume:

# /usr/sbin/raidctl -l

RAID Volume RAID RAID Disk
Volume Type Status Disk Status
------------------------------------------------------
c1t0d0 IM OK c1t0d0 OK
c1t1d0 OK


Platforms using Solaris 10 Update 4 or higher will produce similar to the following output when configured under LSI hardware RAID management:

Good Volume:

# /usr/sbin/raidctl -l

Controller: 0
Volume:c1t0d0
Disk: 0.0.0
Disk: 0.1.0



5.check /var/adm/message file if any warning/error related HDDs

For other OS, please follow related docs to check if HDD working normal

Linux:
How to Check for Linux Platform Disk Errors and Online/Offline Status (Doc ID 1002936.1)

Windows:
How to Check for Windows platform disk errors and online/offline status (Doc ID 1011590.1)

For Niwot controller, please check

Sun Storage 6Gb SAS REM RAID HBA Installation Guide, 820-7888.
https://support.us.oracle.com/handbook_internal/data/820/820-7888/pdf/820-7888-11.pdf

For Erie RAID controller, please check

Sun Storage 6Gb SAS REM HBA Installation Guide, 820-7892
https://support.us.oracle.com/handbook_internal/data/820/820-7892/pdf/820-7892-10.pdf

More information regarding the LSI RAID controller, please check

http://www.lsi.com/sep/Pages/oracle/sas_6gbs_support.asp

References

<NOTE:1002936.1> - How To Check for Linux Platform Disk Errors and Online/Offline Status
<NOTE:1005530.1> - How To Check for Disk Errors and Online/Offline Status on Oracle Platforms Running Solaris x86
<NOTE:1011590.1> - How to check for Windows platform disk errors and online/offline status
<NOTE:1013107.1> - How to Identify BIOS and Solaris[TM] Hardware RAID Status

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback