![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Solution Type Predictive Self-Healing Sure Solution 2079645.1 : Health Check (ZCheck) for Oracle ZFS Storage Appliance on Oracle Enterprise Manager 12c and 13c
This document provides general information, installation guide and remediation information for ZCheck (Health Check) for Oracle ZFS Storage Appliance on Oracle Enterprise Manager 12c. In this Document
Applies to:Sun ZFS Storage 7420 - Version All Versions and laterOracle ZFS Storage Appliance Racked System ZS4-4 - Version All Versions and later Sun ZFS Storage 7120 - Version All Versions and later Oracle ZFS Storage ZS4-4 - Version All Versions and later Oracle ZFS Storage ZS3-4 - Version All Versions and later 7000 Appliance OS (Fishworks) PurposeThis document provides general information, installation guide and remediation information for Health Check for Oracle ZFS Storage Appliance on Oracle Enterprise Manager 12c or Oracle Enterprise Manager 13c.
ScopeThis document is intended for administrators and Oracle support engineers that monitors Oracle ZFS Storage Appliance on Oracle Enterprise Manager.
DetailsOverview
Health Check(ZCheck) is a monitoring metric on Oracle Enterprise Manager that is designed to assess the health of Oracle ZFS Storage Appliance. Health Check periodically monitors configuration settings in the ZFS Storage Appliance to check if they are within the Oracle recommendation best practices for performance and resilience. Health Check generates a stateful warning or alert if any setting doesn't meet the Oracle recommendation and automatically clear the warning or alert when it clears. Users can customize both the Health Check collection schedule and alert threshold type. In this release, Health Check includes seven individual categories, additional items will be added in future release. Note that Health Check is in synchronization with the ORAchk (Oracle Configuration Audit Tool) for ZFS Storage Appliance.
The following table summarizes the seven checks for Oracle ZFS Storage Appliance.
Table below shows the Benefit/Impact and Risk for each check.
Installation GuidePrerequisite1. OEMCC plugin version 2.1 is deployed and running on Oracle Enterprise Manager 12c, OEMCC plugin version 2.1.3 is deployed and running on Oracle Enterprise Manager 13c. 2. Grant the user role (if not "root") on all monitored ZFS targets the "shell" and "audit" privileges. Usually the user is "oracle_agent". To do that
Installation StepsThe attached deliverable in this note includes two metric extension .zip files and two Java .jar files. ME$HealthCheckWindows.zip is for the Enterprise Manager agent running on Windows platform. ME$HealthCheckNonWindows.zip is for Enterprise Manager agent running on all other platforms. Oracle_Grid.jar works for all OS platforms, but we have two versions of Oracle_Grid.jar for different OEMCC plugin versions. One is for version 2.1.3 only and the other is for version 2.1, please select the matched version of Oracle_Grid.jar for Enterprise Manager OEMCC plugin.
1. Login as the user role of the administrator of Enterprise Manager. 2. Replace the existing Oracle_Grid.jar with the new one Health Check Script is included in the Oracle_Grid.jar. Unlike common installation procedure, the user doesn't need to redeploy opar file or restart Enterprise Manager Agent/OMS. The user can replace this Oracle_Grid.jar file when EM is running. To Replace Oracle_Grid.jar:
Note: If the user has multiple agents monitoring ZFS targets, depending on the configuration and need, the user may need to repeat the steps above to replace all Oracle_Grid.jar under all running agents.
2. Import the correct ME$HealthCheck.zip on Enterprise Manager Browser UI.
The user should be able to see the metric extension from ZFS Target - Oracle ZFS Storage Appliance - Monitoring - All Metrics. For more details, refer to Oracle Enterprise Manager - Using Metric Extensions TroubleshootingQ: The status of added appliance shows error or unknown status and click "*Health Check" shows "Can not find ME$HealthCheck" error message. A: It typically means the Oracle_Grid.jar is not matched with the OEMCC plugin version correctly, make sure you download the matched version Oracle_Grid.jar from the attachment link. If your OEMCC plugin is version 2.1.3, please download Oracle_Grid.jar for v2.1.3, if your OEMCC plugin is version 2.1, please download Oracle_Grid.jar for v2.1.
Q: "*Health Check" on some targets don't show up in the metric list. A: Go to "Enterprise" - "Monitoring" - "Metric Extensions" - "Actions" - "Manage Target Deployments", see if the target is deployed. If not, select "Action" - "Deploy to Targets" and deploy Health Check on the target.
Q: Click "*Health Check" shows "ClassNotFound" error message. A: It typically means the Oracle_Grid.jar is not replaced correctly, the script cannot be found. Contact the administrator, go to cd <agent_homt>/plugins/oracle.sun.oss7.agent.plugin_12.1.0.7.0/scripts/emx/sun_storage_7000
Run the command below (<zfs_target_name> is the target hostname, username/password is the pair to log into ZFS target). Note that this command is for non-windows, for windows, replace the : with ;. java -classpath ./Oracle_Grid.jar:./ws-commons-java5-1.0.1.jar:./xmlrpc-common-3.1.2.jar:./xmlrpc-client-3.1.2.jar:./ws-commons-util-1.0.2.jar:./commons-logging-1.1.jar:./commons-codec-1.7.jar:./jsch-0.1.49.jar com.sun.s7000.client.RetrieveSSHResult <zfs_target_name> "HealthCheck.aksh" InstanceUser=<username> InstancePassword=<password> For Non-windows: CTRL + D (end of the text stream) For Windows: CTRL + Z (end of the text stream) Check if the result is printed out correctly. If not, contact the administrator. Otherwise, delete the metric extension and import the ME$HealthCheck.zip again.
Q: DNS Configuration check shows result: "INFO: Error running nslookup command, check failed" or "Error in getting result from appliance.Please contact the administrator. oracle.sysman.emSDK.emd.comm.MetricGetException: Error in getting result from appliance.Please contact the administrator." A: Grant the user role of ZFS target the "shell" and "audit" privileges, the steps are:
Uninstallation Steps
To uninstall Health Check metric extension: 1. Undeploy all targets:
2. Delete the metric extension:
For detail of delete the metric extension, please refer to Oracle Enterprise Manager - Using Metric Extensions Note: The user doesn't have to switch the Oracle_Grid.jar back, but if the user want, the user can replace the new jar back to the original Oracle_Grid.jar, steps to do that are the same with installation steps on swapping Oracle_Grid.jar.
Upgrade StepsHealth Check is updated version of ZCheck. If you previously installed ZCheck, the best way to upgrade to Health Check is to uninstall ZCheck first and install Health Check metric extension followed by the steps above from the beginning. Health Check Monitoring Features and Customization
Collection Schedule and CustomizationBy default, collection schedule is set to collect every 4 hours and upload to database on alert only. The user can customize this setting by changing the collection schedule. Alert threshold and CustomizationBy default, "CRITICAL" result is set under the critical threshold, "UNKNOWN" or "WARNING" results are set under the warning threshold. The user can customize this setting by changing the keywords under menu "Oracle ZFS Storage Appliance" - "Metric and Collection Settings" - metric "Health Check". For example, the user can set the result keyword "CRITICAL" to be a warning alert by using "UNKNOWN|WARNING|CRITICAL" under the column of warning threshold and delete "CRITICAL" under column critical threshold. Stateful AlertsAlerts are stateful, which get cleared automatically as ZFS configuration settings meet the Health Check recommendations. For details of remediation, please refer to "Remediation" section.
Remediation For Health Check
If the user encounters alerts on Enterprise Manager stating Health Check issue, please contact the administrator. This section is intended for Oracle ZFS Storage Appliance Administrator. Repair Action for Cluster Links
Check on the cable setting, contacting the administrator and reconfigure the cluster configuration on the owner node. For more information, please refer to Sun ZFS Storage 7000 System Administration Guide Repair Action for Analytics Retention Policy1. Log into ZFS Storage Appliance and set <property_type>=<property_value_in_hours> and commit the change. For example (1 month = 672 hours): analytics settings set retain_second_data=672
analytics settings commit
2. Verify the change: analytics settings show
Repair Action for DNS ConfigurationPlease consult with the DNS Network administrator for the specific domain or servers. 1. Log into ZFS Storage Appliance. 2(a). If set domain property, run: set domain=<domain_value> and commit the change, for example: configuration services dns set domain=my.example.com
configuration services dns commit
2(b). If set servers property, run: set servers=<server_value> and commit the change, for example: configuration services dns set servers=0.0.0.0
configuration services dns commit
3. Verify the changes: configuration services dns show
Repair Action for Datasets Check1. Log into ZFS Storage Appliance. 2. Show all active datasets, select the dataset to prune and choose: prune <hour/minute/second> to discard hour/minute/second data . For example: analytics dataset show
select dataset-000
confirm prune second
Now, if the user run command: show, the user should be able to see activity: pruning (x % completed) Note: Command: destroy <dataset> will discard the dataset and disable it, the user needs to create it again, run: create <dataset_name>
Repair Action for Backend Check
Repair Action for L2Arc Header Size CheckEnsure that the Oracle ZFS Storage Appliance containing this warning is up to date with the latest Oracle ZFS Storage Appliance software. Repair Action for Locked Server CheckUpgrade Oracle ZFS Storage Appliance software to current version.
Check for Currency 06-MAR-2018
Attachments This solution has no attachment |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|