![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||||
Solution Type Problem Resolution Sure Solution 1948040.1 : FS System: Pilots in FS1-2 May Reboot If System Date and Time Are Changed
In this Document
Created from <SR 3-9903511042> Applies to:Oracle FS1-2 Flash Storage System - Version All Versions to All Versions [Release All Releases]Information in this document applies to any platform. SymptomsChanging the sources of where Pilots get their time from can confuse the Pilots and cause them to reboot. In this example, a customer disabled the NTP (Network Time Protocol) setting, manually updated the system time so that it was an hour ahead of where it had been and then re-enabled NTP. NOTE: some of the logs depicted below require processing by Oracle Support to be seen in the manner indicated below. Contact Oracle support if you are not able to see the details as shown below.
Event Log entries to look for: 2014-11-20T20:15:33.503 NTP_SETTINGS_MODIFIED INFORMATIONAL ComponentName=/NtpSettings
2014-11-20T20:16:00.486 PCP_EVT_NTP_SERVER_NOT_RESPONDING WARNING 2014-11-20T20:16:00.490 PCP_EVT_NTP_SYNCHRONIZATION_FAILED WARNING 2014-11-20T20:16:10.503 NTP_SETTINGS_MODIFIED INFORMATIONAL ComponentName=/NtpSettings 2014-11-20T20:19:43.996 PCP_EVT_ALL_NTP_SERVERS_RESPONDING INFORMATIONAL 2014-11-20T20:19:44.007 PCP_EVT_NTP_SYNCHRONIZATION_RESTORED INFORMATIONAL 2014-11-20T20:19:44.208 PCP_EVT_PILOT_FAILED_OVER WARNING ComponentName=/PILOT-2 2014-11-20T20:19:46.055 PCP_EVT_SYSTEM_STATE_CHANGED INFORMATIONAL currentSystemState=NORMAL, previousSystemState=PILOT_FAILBACK_IN_PROGRESS 2014-11-20T20:20:00.216 PCP_EVT_FOUND_PILOT_CORE_FILE INFORMATIONAL
Pilots Logs will also contain important indications of the issue:
CauseThis is known issue where setting the clock back in time causes a pilot core dump. Since the pilots are not part of the data path, this does not affect host access to the data stored on the FS1-2 system. The issue is documented in the Oracle FS1-2 Flash Storage System Customer Release Notes: [18006920] When the system time is changed and the new time is before the previously set time, the Pilot nodes might become unresponsive and then restart.
SolutionNo fix is currently available for this issue. The only way to avoid the problem is not to set the system time to an earlier value.
That bug 18006920 is fixed in 6.2.0, but that bug fix ONLY addresses one issue. The fix was a side effect of moving from Java 7 to Java 8, where 6.1 uses Java 7 and 6.2 uses Java 8. That bug fix is because the old Java determined when to wake up from a Java sleep() call by using the current time—what the Dev manager calls wall clock time. If you set the clock back [manually or from turning on NTP], Java would never wake up since it would be trying to wake up before it went to sleep. You can also trigger this by setting the clock forward on 6.1 since that also changes the time where Java wakes from a sleep. The heartbeats between PacMan [Java] and PCP [C++] are driven by that Java sleep() in PacMan. If PacMan does not send a heartbeat to PCP [within the same pilot] at least once every 30 seconds, PCP will kill that pilot with the pcp:warning Pacman has not heartbeat in 30 seconds!
References<BUG:18006920> - PILOTS RESTARTED WHEN SYSTEM DATE/TIME WAS CHANGED - BUILD 683<BUG:20079499> - FS1 PILOT CORES FOUND, SOFTWARE UPDATE 8 HOURS PRIOR Attachments This solution has no attachment |
||||||||||||||||||
|