![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||||
Solution Type Problem Resolution Sure Solution 1494369.1 : Sun Storage 7000 Unified Storage System: BUI unavailable and seeing errors like "failed to update kstat chain: Not enough space"
In this Document
Created from <SR 3-6245745571> Applies to:Sun ZFS Storage 7320 - Version All Versions and laterSun ZFS Storage 7120 - Version All Versions and later Sun ZFS Storage 7420 - Version All Versions and later Sun Storage 7410 Unified Storage System - Version All Versions and later Sun Storage 7310 Unified Storage System - Version All Versions and later 7000 Appliance OS (Fishworks) SymptomsTo discuss this information further with Oracle experts and industry peers, we encourage you to review, join or start a discussion in the My Oracle Support Community - Disk Storage ZFS Storage Appliance Community
BUI access hangs or generates log errors of the form: Thu Mar 22 23:27:37 2012: asynchronous error on statistics module 'mem': failed to update kstat chain: Not enough space
Fri Mar 23 02:05:53 2012: failed to update chassis data: failed to update kstat chain: Not enough space These errors will also be seen in the akd.ak error log CLI access may also be lost - but sometimes may still be available The management interface (akd) will usually have been running without having been restarted for months Take a core dump of akd and check vmem: # gcore -o akd.core `pgrep -ox akd` # mdb akd.core.<PID> >::vmem ADDR NAME INUSE TOTAL SUCCEED FAIL CauseThis is most likely a known problem where the akd process that controls the management interface runs out of memory because of memory fragmentation due to large number of oversize allocations. If unsure, please raise a call with Oracle Support who will be able to verify if you are hitting this issue. The likely cause for this is Bug 15781962 - Repeated analytics graphs and drilldowns fragment memory until akd runs out of memory SolutionThe workaround for this is to restart the management interface (akd) to alleviate the heap fragmentation. If the CLI is still available, it is possible to restart the management interface from there: S7000:> maintenance system restart
Please note that if you have a cluster then you should verify that the cluster is in a sane state before restarting the management interface on any one head to prevent a takeover happening. You can check this by checking the cluster configuration: S7000:> configuration cluster show Properties:
Valid states for the cluster head and peer are AKCS_CLUSTERED, AKCS_OWNER and AKCS_STRIPPED. In these states restarting the management interface will not cause any takeover by the other head. Restarting the management interface will not have any effect on access to the shares. If the CLI is not available, please raise a service request with Oracle Support to restart the menagement interface.
Some customers are still seeing memory fragmentation in akd after installing 2011.04.24.5.0 (2011.1.5.0), this is being tracked by Bug 16187433 - closed as a duplicate of Bug 15685321. The workaround of restarting akd remains the same. The total number of analytics datasets that must be continuously updated is a contributory reason to the fragmentation so destroying unnecessary datasets will help. 16187433 - datasets with lots of breakdowns causes memory fragmentation in akd For 2013.1.x releases: Fixed in 2013.1.1.1 (15685321-17531615 Backport 15685321 to ak-2013-rel)
Also, be aware of Bug 20751907 (datasets with lots of breakdowns caused akd to bloat) reported on 2013.1.2.12
***Checked for relevance on 24-MAY-2018*** References<BUG:15781962> - SUNBT7157268-AK-8 REPEATED ANALYTICS GRAPHS AND DRILLDOWNS FRAGMENT MEMORY UNTIL<BUG:16187433> - DATASETS WITH LOTS OF BREAKDOWNS CAUSES MEMORY FRAGMENTATION IN AKD <BUG:15685321> - SUNBT7004788 UMEM_MAXBUF TOO SMALL FOR MODERN APPLICATIONS Attachments This solution has no attachment |
||||||||||||||||||
|