![]() | Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||
Solution Type Technical Instruction Sure Solution 1533993.1 : Collect XSCF snapshot(s) by running STB7.3 (or newer ) domain Explorer on SPARC Enterprise M3000/M4000/M5000/M8000/M9000 (OPL) Servers
Applies to:Sun SPARC Enterprise M3000 Server - Version All Versions and laterSun SPARC Enterprise M9000-32 Server - Version All Versions and later Sun SPARC Enterprise M4000 Server - Version All Versions and later Sun SPARC Enterprise M9000-64 Server - Version All Versions and later Sun SPARC Enterprise M5000 Server - Version All Versions and later All Platforms GoalThis document describes XSCF snapshot collection by running a domain Explorer. The Explorer module xscfextended is completely re-written, starting with Services Tool Bundle (STB) 7.3. The various ways to gather diagnostic data from either the XSCF or the domain and XSCF combined manually, or in an automated fashion using STB 7.2 or older, are described in "Gathering diagnostic data for SPARC Enterprise M3000/M4000/M5000/M8000/M9000 (OPL) Servers (Doc ID 1008229.1)". How to upload the gathered data to Oracle, is described in How to Upload Files to Oracle Support (Doc ID 1547088.2) . This document applies to all Mx000 servers, and describes the usage of xscfextended module of Explorer 7.3 (as part of STB7.3) or newer, on both Solaris 10 and Solaris 11. Where needed, some Explorer basic facts are listed, to get a complete picture on how to use the reworked module. For reference purposes, we will list the available manual options for collecting the XSCF snapshot, too. To discuss this information further with Oracle experts and industry peers, we encourage you to review, join, or start a discussion in the My Oracle Support Community - M Series Servers
SolutionManually running snapshot on SPARC Enterprise Mx000 (OPL) eXtended System Control Facility ( XSCF ).Snapshot stores the collected data on a remote network host or on an external media device, based upon the use of the -T, -t or -d option. When storing data on a remote network host, snapshot opens a network connection using SSH to act as a data pipe to the remote file. Example of collecting a snapshot using the value for user@host:directory previously set using the setarchiving command: XSCF> snapshot -L F -T
Example of collecting a snapshot to host m9000-x-a.uk, as user joe, writing the snapshot file to directory /tmp/joe: XSCF> snapshot -L F -t joe@m9000-x-a.uk:/tmp/joe
Example of collecting a snapshot to an external USB stick: XSCF> snapshot -L F -d usb0
Bear in mind, that the external media device connected to the XSCF's USB port is expected to have a partition 1, formatted with the FAT32 filesystem. The external USB device can have multiple partitions, as long as partition 1 is FAT32. That partition will then be used by the snapshot command. For more details on the snapshot command please visit the manual page.
Running Explorer on SPARC Enterprise Mx000 (OPL) ServersLike on any other Oracle SPARC Server, you can run Explorer on the Solaris instance(s) on that server. When you run Explorer with the option -w xscfextended, it is capable of collecting a Snapshot on the specified XSCF(s). The Snapshot data will be in the xscf sub-directory of the host's Explorer. It does make sense when running an Explorer on a Mx000 Server domain, to run it with -w xscfextended, that way a single Explorer data file contains information about the domain, its main XSCF and its spare XSCF where applicable. The xscfextended module can be run manually, and in an automated version. This document will describe both options in detail. Be aware that to collect the XSCF snapshot via Explorer, the Explorer does not need to run on the platform containing that XSCF. As long as Explorer can reach the XSCF over the network, it is capable or collecting the snapshot data via the -w xscfextended option. For simplicity, this document will describe to run Explorer on an Mx000 server domain. NOTE: By choosing the Mx000 Solaris domain as the host to run the Explorer command on, it is assumed it is healthy enough to stay up long enough for the customer to run the Explorer and then get the Explorer output and send to Oracle for analysis. If the domain will not boot or stay up, consider using another machine as the Explorer host or simply gather just the XSCF snapshot manually as described in the first section of this document.
The new Explorer 7.3 xscfextended module explainedThis paragraph will describe the new features that come with the Explorer 7.3 xscfextended module and how to properly set them up. You can run the xscfextended module in interactive mode and in non-interactive mode. Furthermore, there are now two methods of collecting one or more XSCF snapshots:
The differences between the two methods, and how to properly set them up, will be described further in How to setup the archival host. After installing STB7.3, you first need to run "explorer -g" to generated a defaults file that is usable by Explorer 7.3. Starting with Explorer 7.0 the package is distributed both as a SVR4 package and as an IPS package. The standard location for the output and configuration files is as follows:
For more details on Explorer, please visit the explorer manual pages. As always, on the Solaris instance where you want to collect an Explorer, you run the command explorer as user root. If you want the Explorer program to collect one of more XSCF snapshots, you specify the xscfextended module to be run. The host on which you run the Explorer program is called the Explorer host, in this document we will refer to it as the EXPLO_HOST. The xscfextended module does need to be told for which XSCF units to collect the snapshot. This can be done by creating the file xscfinput.txt in ${XPL_ETC), which only will be used in non-interactive mode. If running in interactive mode, the user will be prompted to enter the appropriate information. The xscfinput.txt layout and file attributes will be described in How to setup the Explorer host. For each XSCF snapshot we tell the xscfextended module to collect, it will contact what we call the archival host; in this document we will refer to it as ARCH_HOST. To contact ARCH_HOST, we will be using the special user called archival user, to which we will refer in this document as ARCH_USER. In the home directory of the ARCH_USER, there must be a file called .explorer, which will contain information describing how to contact the XSCF, and what method to use, LOG or SNP. The content of the file .explorer will be described in How to setup the archival host. With the information found in the .explorer file, the Explorer program on EXPLO_HOST will connect to the XSCF, and start the snapshot command. The snapshot will be stored on ARCH_HOST, and after the complete snapshot file has arrived, the Explorer program will pull over the zip file to EXPLO_HOST. This is in a nutshell how it works. Run explorer -w default,xscfextended on EXPLO_HOST. The Explorer program will login to ARCH_HOST as user ARCH_USER, for each specified XSCF. It will find all necessary details for each XSCF in the .explorer file, and start a snapshot collection accordingly. When the snapshot zip file has arrived on ARCH_HOST, EXPLO_HOST will pull that file over and put it in the ./xscf subdirectory of the Explorer output directory. Please read on for a detailed description of the various bits and pieces that make this work. Although the procedure turns to be not too complex, at first the various variables, files, names and locations can easily make you lose track of how your setup looks. Please use the attached little variables used table that is attached to this document, and fill in the values you are going to use, while going through the procedure.
How to setup the archival hostThe archival host must be a host that can be reached over the network by both the EXPLO_HOST and the XSCF. On the ARCH_HOST, you must have user ARCH_USER with a home directory to hold one of more snapshots. The xscfextended module connects to ARCH_HOST through su or ssh, depending on the content of the xscfinput.txt file. The Explorer program uses user ARCH_USER, and will go and find information on how to contact the XSCF in the .explorer file. The ARCH_USER can only be configured with a standard shell , setting up the user with a restricted shell will cause the xscfextended module to fail. To allow for non-interactive mode to run without prompting for any passwords, the password of user ARCH_USER should be recorded in the xscfinput.txt file. Also, the password of user ARCH_USER is needed by the snapshot command, to be able to write back the snapshot file to ARCH_HOST. The file xscfinput.txt will be further explained in How to setup the Explorer host . User ARCH_USER on host ARCH_HOST should have a file ${HOME}/.explorer, and its permissions should be 0400 or 0600. The .explorer file will have one or more sections, one section for each XSCF we want to be able to collect a snapshot on. A typical section looks like this: [XSCF:m3000-x-m-xscf]
HST="m3000-x-m-xscf" USR="explo_dom0" TYP="SNP" DIR="snapshots" CLN="NONE" This is what the usage of each line is:
Finally, it must be understood the xscfextended module always works with an archival host. This section has described how to setup such an environment. The archival host environment is used regardless if the xscfextended module is run in interactive or non-interactive mode. You can only run explorer -w default,xscfextended in an automated fashion, if the setup is done in a way that Explorer does not prompt for passwords. Just wanted to make that very clear. Now on to running in interactive mode. To check if you have properly setup the archival host, you should be able to:
How to run the Explorer 7.3 xscfextended module in interactive modeTo run the xscfextended module in interactive mode, first of all you need to run explorer -w default,xscfextended,interactive.Then, one or more of the following criteria must be met:
If you only are going to run in interactive mode, there is no need to setup the Explorer host. An interactive session of the module looks like this (non-relevant output omitted): .
. Defined XSCF archive contexts: Select an action: D Define new XSCF archive context R Return Enter the selection Hit "Return" to accept the default (R) > D Enter the XSCF section name to use from .explorer file > m3000-x-m-xscf Enter the user for connecting to the XSCF archive host > rfxscf Enter the host name or IP address of an XSCF archive host, or '-' for localhost > - Defined XSCF archive contexts: 1 m3000-x-m-xscf from rfxscf on local host Select an action: D Define new XSCF archive context S Suppress an existing XSCF archive context R Return Enter the selection Hit "Return" to accept the default (R) > R . . . Mar 09 15:17:03 m3000-x-m[21529] xscfextended:RUNNING Mar 09 15:17:04 m3000-x-m[21529] xscfextended:Collecting snapshot from m3000-x-m-xscf (can take time) Enter rfxscf password for archive host m3000-x-m.uk.oracle.com: Please re-enter it to confirm: Mar 09 15:18:01 m3000-x-m[21529] xscfextended:Data gathering complete for m3000-x-m-xscf . . When running the xscfextended module in interactive mode, you are presented with a menu to enter one of more XSCFs from which you want to have a snapshot collected. To add an XSCF, type D and hit return:
Please be aware of the effect of the value of ARCH_HOST. If you enter "-", the Explorer program will use the su command to become user ARCH_USER on your EXPLO_HOST. All other input, including "localhost" , will be considered an IP address or IP hostname to which the Explorer program will connect using ssh . In the process, you will be asked to provide the ARCH_USER password for accessing ARCH_HOST, both for use by explorer on EXPLO_HOST and snapshot on the XSCF. In the above example, we collected a single snapshot for the XSCF as specified in the m3000-x-m-xscf section (see How to setup the archival host ).To automate all of the above, and to run in non-interactive mode, you first need to setup the Explorer host. Please read on.
How to setup the Explorer hostTo run the xscfextended module in non-interactive mode, you need to setup the Explorer host. You setup an Explorer host by creating the input file xscfinput.txt in the ${XPL_ETC} directory, the file should be owned by user root and its permissions should be 0400 or 0600. A line in the xscfinput.txt file can be a type T, type X or type A:
A type A line describes how to connect to ARCH_HOST, and what section to use:
If you now start explorer -w default,xscfextended, the module will process the A lines in xscfinput.txt one by one:
We have not covered one thing. For using the non-interactive xscfextended module in an automated fashion, user root will need to be able to login to the XSCF without being prompted for a password. How to make that work, is described in How to run the Explorer 7.3 xscfextended module in non-interactive, automated mode. To check if you have properly setup the explorer host, you should be able to:
How to run the Explorer 7.3 xscfextended module in non-interactive, automated modeIn the previous section we learned how to setup EXPLO_HOST, and what happens if you run it in non-interactive mode. The missing step is to eliminate the XSCF asking for the password of the user defined in the XSCF section (lines HST and USR, see How to setup archival host). To allow for non-interactive mode to run without prompting for the XSCF password, we need to ensure that user root on EXPLO_HOST is able to run ssh -l USR HST without being prompted for a password:
At this point, you will be able to run Explorer on M-series server domain, and in an automated fashion collect a snapshot of the XSCF(s) as part of the Explorer data, without the Explorer program prompting you for any information. You are now ready to run the Explorer program in an automated fashion, for example through cron . To check if you have properly eliminated the XSCF asking for a password, you should be able to:
Final observationsIf you don't get this to work, make the Explorer module more verbose, by running explorer -verbose -w default,xscfextended. If that does not clear up any confusion, you can turn on "tracing" in two ways:
This will cause a very,very verbose run, with a ton of output, but it will tell you exactly what happens. After activating tracing, you can run /opt/SUNWexplo/bin/explorer -verbose -w default,xscfextended 2>&1 | tee -a /tmp/log.txt . All screen output will be in the specified output file, there might be a clue in the trace output just before any issue you experience.
What can go wrong ? Some example output of common mistakes (I'm sure there are more) that can happen, using the input files as listed at the end of this document ... .
. .Mar 18 19:24:05 m3000-x-m[26853] xscfextended:RUNNING Inside XSCF collection, obtaining information from m3000-x-m.uk.oracle.com Inside XSCF collection, validating required setup at m3000-x-m.uk.oracle.com Inside XSCF collection, validating m3000-x-m-xscf access Inside XSCF collection, gathering m3000-x-m-xscf information (can take time) Mar 18 19:24:05 m3000-x-m[26853] xscfextended:Collecting snapshot from m3000-x-m-xscf (can take time) Mar 18 19:24:09 m3000-x-m[26853] xscfextended:WARNING Auth cancel . .
If ARCH_USER is configured with a restricted shell, the XSCF snapshot cannot be received, but an explorer with xscf directory is created: .
. Mar 18 22:35:13 m3000-x-m[15060] xscfextended:RUNNING Inside XSCF collection, obtaining information from m3000-x-m.uk.oracle.com Inside XSCF collection, validating required setup at m3000-x-m.uk.oracle.com Inside XSCF collection, validating m3000-x-m-xscf access Inside XSCF collection, gathering m3000-x-m-xscf information (can take time) Mar 18 22:35:13 m3000-x-m[15060] xscfextended:Collecting snapshot from m3000-x-m-xscf (can take time) Inside XSCF collection, obtaining snapshot file from m3000-x-m.uk.oracle.com Mar 18 22:35:22 m3000-x-m[15060] xscfextended:ERROR Snapshot not received from m3000-x-m-xscf . . Mar 18 22:35:28 m3000-x-m[15060] explorer: = = = stderr output from explorer = = = Mar 18 22:35:28 m3000-x-m[15060] explorer: cannot access parent directories Mar 18 22:35:28 m3000-x-m[15060] explorer: Removing previous explorers from /opt/SUNWexplo/output ... Mar 18 22:35:30 m3000-x-m[15060] explorer: Explorer finished # # ls -la /opt/SUNWexplo/output/explorer.8099577b.m3000-x-m-2013.03.18.21.34/xscf/m3000-x-m-xscf/ total 6 drwx------ 2 root root 512 Mar 18 22:35 . drwx------ 3 root root 512 Mar 18 22:35 .. -rw------- 1 root root 700 Mar 18 22:35 snapshot.out # cat /opt/SUNWexplo/output/explorer.8099577b.m3000-x-m-2013.03.18.21.34/xscf/m3000-x-m-xscf/snapshot.out Setting up ssh connection to rfxscf@m3000-x-m.uk.oracle.com... Failed to setup ssh connection. Verify password for 'rfxscf' on 'm3000-x-m.uk.oracle.com'. Verify this directory exists on 'm3000-x-m.uk.oracle.com' and can be accessed by rfxscf: /export/home/rfxscf/snapshots Error encountered connecting to remote host: - Failed to start secure shell session on m3000-x-m.uk.oracle.com - Program exited unexpectedly: /usr/bin/ssh - Output: "Warning: Permanently added the RSA host key for IP address '10.163.209.76' to the list of known hosts." - Output: "rfxscf@m3000-x-m.uk.oracle.com's password: " - Output: "rksh: /bin/sh: restricted" Error opening SSH target #
If ARCH_USER is configured with a restricted shell, section information cannot be found, and the explorer does not have the xscf directory: .
. Mar 18 21:52:49 m3000-x-m[5905] xscfextended:RUNNING Inside XSCF collection, obtaining information from m3000-x-m.uk.oracle.com Inside XSCF collection, validating m3000-x-m access Inside XSCF collection, obtaining information from m3000-x-m Mar 18 21:53:02 m3000-x-m[5905] xscfextended:Unable to obtain information on 192.168.131.1 from /export/home/rfxscf/.explorer . .
This is how the output of a successful run looks: .
. Mar 18 22:24:27 m3000-x-m[12769] xscfextended:RUNNING Inside XSCF collection, obtaining information from m3000-x-m.uk.oracle.com Inside XSCF collection, validating required setup at m3000-x-m.uk.oracle.com Inside XSCF collection, validating m3000-x-m-xscf access Inside XSCF collection, gathering m3000-x-m-xscf information (can take time) Mar 18 22:24:27 m3000-x-m[12769] xscfextended:Collecting snapshot from m3000-x-m-xscf (can take time) Inside XSCF collection, obtaining snapshot file from m3000-x-m.uk.oracle.com Inside XSCF collection, cleaning files from m3000-x-m.uk.oracle.com Mar 18 22:25:02 m3000-x-m[12769] xscfextended:Data gathering complete for m3000-x-m-xscf Inside XSCF collection, validating m3000-x-m access Inside XSCF collection, obtaining information from m3000-x-m Inside XSCF collection, validating required setup at m3000-x-m Inside XSCF collection, validating 192.168.131.1 access Inside XSCF collection, gathering 192.168.131.1 information (can take time) Mar 18 22:25:06 m3000-x-m[12769] xscfextended:Collecting snapshot from 192.168.131.1 (can take time) Inside XSCF collection, obtaining snapshot file from m3000-x-m Inside XSCF collection, cleaning files from m3000-x-m Mar 18 22:25:41 m3000-x-m[12769] xscfextended:Data gathering complete for 192.168.131.1 . . Mar 18 22:25:44 m3000-x-m[12769] explorer: data collection complete Mar 18 22:25:46 m3000-x-m[12769] explorer: explorer.8099577b.m3000-x-m-2013.03.18.21.23.tar.gz created Mar 18 22:25:46 m3000-x-m[12769] explorer: Removing previous explorers from /opt/SUNWexplo/output ...Mar 18 22:25:48 m3000-x-m[12769] explorer: Explorer finished . . # ls -la /opt/SUNWexplo/output/explorer.8099577b.m3000-x-m-2013.03.18.21.23/xscf/ total 8 drwx------ 4 root root 512 Mar 18 22:25 . drwx------ 6 root root 512 Mar 18 22:25 .. drwx------ 2 root root 512 Mar 18 22:25 192.168.131.1 drwx------ 2 root root 512 Mar 18 22:25 m3000-x-m-xscf # # more /opt/SUNWexplo/output/explorer.8099577b.m3000-x-m-2013.03.18.21.23/xscf/*/snapshot.out :::::::::::::: /opt/SUNWexplo/output/explorer.8099577b.m3000-x-m-2013.03.18.21.23/xscf/192.168.131.1/snapshot.out :::::::::::::: Setting up ssh connection to rfxscf@m3000-x-m... Collecting data into rfxscf@m3000-x-m:/export/home/rfxscf/snapshots/m3000-x-m-xscf_10.163.226.103_2013-03-18T21-25-41.zip Data collection complete :::::::::::::: /opt/SUNWexplo/output/explorer.8099577b.m3000-x-m-2013.03.18.21.23/xscf/m3000-x-m-xscf/snapshot.out :::::::::::::: Setting up ssh connection to rfxscf@m3000-x-m.uk.oracle.com... Collecting data into rfxscf@m3000-x-m.uk.oracle.com:/export/home/rfxscf/snapshots/m3000-x-m-xscf_10.163.226.103_2013-03-18T21-25-04.zip Data collection complete #
What files are used in the examples in this documentOn the EXPLO_HOST (m3000-x-m), we used the following xscfinput.txt file # pwd
/etc/opt/SUNWexplo # ls -la xscfinput.txt -rw------- 1 root root 82 Mar 9 19:14 xscfinput.txt # cat xscfinput.txt A - rfxscf m3000-x-m-xscf Gehe1mpjev A m3000-x-m rfxscf 192.168.131.1 Gehe1mpje # On the ARCH_HOST (m3000-x-m), ARCH_USER (rfxscf) had the following .explorer file $ pwd
/export/home/rfxscf $ ls -al .explorer -rw------- 1 rfxscf other 198 Mar 1 19:00 .explorer $ cat .explorer [XSCF:m3000-x-m-xscf] CLN="NONE" DIR="snapshots" TYP="SNP" HST="m3000-x-m-xscf" USR="explo_dom0" [XSCF:192.168.131.1] CLN="NONE" DIR="snapshots" TYP="SNP" HST="192.168.131.1" USR="explo_dom1" $
Attachments This solution has no attachment |
||||||||||||
|