Asset ID: |
1-71-1945614.1 |
Update Date: | 2018-04-09 |
Keywords: | |
Solution Type
Technical Instruction Sure
Solution
1945614.1
:
FS System: How to Remove and Replace a Hard Disk Drive in an FS1-2 Pilot
Related Items |
- Oracle FS1-2 Flash Storage System
- Oracle FS1-2 Cloud System
|
Related Categories |
- PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: DISK-CAP VCAP
|
In addition to the physical replacement of the Hard Disk Drive in the Pilot, the Pilot image will also need to be copied from the surviving Pilot.
In this Document
Oracle Confidential PARTNER - Available to partners (SUN).
Reason: Part being replaced is a FRU and image copy is complex.
Applies to:
Oracle FS1-2 Flash Storage System - Version All Versions to All Versions [Release All Releases]
Oracle FS1-2 Cloud System - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.
This process of copying the Pilot image can take as long as 30 minutes to complete during which time the FE will have only the Pilot LEDs to monitor the progress. As such, LED changes on the front of the Pilot have been included with an approximate timeline so that the progress can be monitored.
Goal
The purpose of this document is to outline the steps required to replace a Hard Disk Drive in an FS1-2 Pilot using Guided Maintenance
Solution
DISPATCH INSTRUCTIONS
WHAT SKILLS DOES THE FIELD ENGINEER/ADMINISTRATOR NEED?:
Product knowledge, FS1-2 Flash Storage System
TIME ESTIMATE: 90 minutes
TASK COMPLEXITY: 2
FIELD ENGINEER/ADMINISTRATOR INSTRUCTIONS:
PROBLEM OVERVIEW:
WHAT: A Hard Disk Drive (HDD) in an FS1-2 Pilot needs to be replaced
WHERE: The FS1-2 problem Pilot will be listed in the System Alerts.
WHY: The HDD has failed or is in danger of failing.
WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY?:
The Pilot with the problem HDD will need to be powered off during the physical replacement. The surviving Pilot must be functioning normally so that the pilot image can be copied from it to the new HDD.
NOTE: You will need to enable SSH on the FS1-2 in order to access the surviving Pilot.
QRC for this procedure:

WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE?:
- Prepare FS1-2 for service procedure.
- Disable Call-Home to prevent spurious alerts (see Document 1942676.1 FS System: How to Disable Call-Home to Prevent Automatic Service Request (ASR) Generation).
- Use ESD precautions
- Log into Oracle FS System Manager to access Guided Maintenance:
- Select System tab.
- In the navigation tree, expand Hardware and select Pilots.
- In the main window, right click on the Pilot with failed or failing HDD and select Repair Pilot option.
- In the pop-up Modify window, select the SAS HDD from the pull-down menu and then click on Next.
- Click Next again to turn on the White Locate LED for the problem Pilot.
- Instructions for HDD replacement can be downloaded from the next screen but it is recommended that the FE use this CAP as it also contains a timeline.
- Clicking Next will offline the problem Pilot. If it was still the active Pilot, a failover to it's partner will now happen and you will lose connection to the GUI.
- Wait for the green OK LED of the problem Pilot to to slow-blink (every 3 seconds).
- Extend the anti-tip legs in front of the rack.
- Slide the Pilot into the service position.
- Unplug both power cables.
NOTE: If the Pilot is an X5-2 version, remove the top cover and check to see if an Oracle System Assistant (OSA) USB drive is installed internally on the Pilot motherboard. If one is installed, proactively remove it. While having the OSA USB drive installed does not affect day to day operations of the FS1-2, it will create problems if the Pilot needs to be re-imaged in the future.
- Replace Pilot HDD.
- Use the square button on the front of the Pilot HDD to eject it.
- Remove the Pilot HDD.
- Install the replacement HDD.
- Re-image the replacement Pilot HDD.
NOTE: for Systems running R6.2.5 or higher, please use the Pilot Rescue Image to re-image the replaced Pilot HDD if possible. See Document
2148252.1 FS System: How to Use the Pilot Rescue Image.
- Unplug the ethernet cable from Net-2 (see cover panel for location).
- Unplug the ethernet cable from Net Mgr and plug it into the Net-2 location.
- Enable SSH on the FS1-2 for 60 minutes (see Document 2029847.1 FS System: How to Enable ssh on the Pilot.
- ssh into the working pilot (see Document 2046703.1 FS System: Passwords Associated with the Oracle FS1-2 Flash Storage System) and flush the current network tables on the active Pilot:
For FS1-2 systems running R6.1.x:
# echo "" > /var/lib/dnsmasq_svc.leases
#service dnsmasq_svc restart
For FS1-2 systems running R6.2.x:
# echo "" > /var/lib/axiom/dnsmasq_pmi.leases
# service dnsmasq_pmi restart
The first command will not produce any output, but the second will display two lines with green [OK]s on the right side of the screen.
- Reattach both power cords and observe the LEDs on the front left side of the Pilot (all times are approximate from power on):
- The SP LED on the Pilot should fast-blink (every second).
- After about two minutes, it will turn solid and the OK LED will slow-blink.
- After about two and a half minutes the OK LED will fast-blink.
- After about four and a half minutes the OK LED will go solid.
- At about twenty-two minutes, the image copy will complete and the OK LED will again slow-blink indicating it is rebooting with the new image.
- At about twenty-two and a half minutes, the OK LED will fast-blink.
- At about twenty-four and a half minutes the OK LED will go solid.
- At about twenty-seven and a half minutes, the Pilot should finish it's reboot.
- After about twenty-five minutes, use a command tool or other terminal to ping the IP address assigned to the problem Pilot.
- When ping responds, remove the ethernet cable from the problem Pilot's Net-2 and plug it into Net Mgt.
NOTE: if ping does not respond after 30 minutes, connect up a monitor and keyboard. If you see an "Unsupported Hardware Detected" error message, select OK and process should resume. See <Bug 21564611> for additional details.
- Restore the original Pilot inter-connections.
- Disconnect the cable in Net-2 and return it to it's original location, Net Mgr.
- Reconnect the cable originally in Net-2.
- Return the Pilot to the FS1-2 system.
- Return the Pilot to it's racked position.
- Re-flush the Network Tables again
For FS1-2 systems running R6.1.x:
#echo "" > /var/lib/dnsmasq_svc.leases
#service dnsmasq_svc restart
For FS1-2 systems running R6.2.x:
# echo "" > /var/lib/axiom/dnsmasq_pmi.leases
# service dnsmasq_pmi restart
- Check the GUI to confirm that both Pilots are normal and functioning. If not, check the following issues and recommended remedies:
- Network connections in the Pilot may be in an inconsistent state. This can be cleared by rebooting the Pilot again.
- Pilots will be normal but Hardware Status Icon in the lower left corner of the GUI may still be yellow. This can be cleared by a logout/login of the GUI.
- Confirm that the following files and directories contain the same information on both Pilots:
/var/lib/pillar/pcp/Compatibility.xml
/rpms/AxiomONE-SW-installed/*
/rpms/AxiomONE-SW-previous/*
/rpms/AxiomONE-SW-staged/*
/rpms/firmware/*
/rpms/pillar-pilot-os/*
If any or all of these files are missing, use scp to copy them from the good Pilot image.
NOTE: If this system has never gone through a software upgrade or had one staged, the contents of AxiomONE-SW-previous and AxiomONE-SW-staged may be empty.
Example of copying the files from Pilot 1 to Pilot 2:
[root@pilot1 /]# scp /var/lib/pillar/pcp/Compatibility.xml pilot2:/var/lib/pillar/pcp/Compatibility.xml
[root@pilot1 /]# scp -r /rpms/AxiomONE-SW-installed pilot2:/rpms/AxiomONE-SW-installed
NOTE: Do not copy over the /rpms directory as it is also a filesystem and doing so will also copy over the lost+found directory.
- Check the timezone setting to confirm it is set to UTC:
[root@pilot1 ~]# ls -l /etc/localtime
lrwxrwxrwx 1 root root 23 Nov 9 22:08 /etc/localtime -> /usr/share/zoneinfo/UTC
[root@pilot1 ~]#
If the timezone is NOT set to UTC, contact Oracle support for additional help resolving <Bug 22241151>.
- If any other issue are observed, please contact the owner of the Service Request for additional help.
- When finished, re-enable Call-Home.
OBTAIN CUSTOMER ACCEPTANCE
WHAT ACTION DOES THE FIELD ENGINEER/ADMINISTRATOR NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE?:
Confirm that replacement Pilot HDD is seen by the FS1-2 Flash Storage System and the system has reverted to a normal/green state.
REFERENCE INFORMATION:
From the Oracle Help Center: http://docs.oracle.com/en/storage/#fla select the Oracle Flash System Documentation Library for more information.
References
<NOTE:1431693.1> - Pillar Axiom: How to Enable SSH Access on the Pilot(s)
<NOTE:1942676.1> - FS System: How to Disable Call-Home to Prevent Automatic Service Request (ASR) Generation
Attachments
This solution has no attachment