Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1473515.1
Update Date:2018-01-08
Keywords:

Solution Type  Technical Instruction Sure

Solution  1473515.1 :   Pillar Axiom: How to Run Pitman on AxiomOne R5.X Systems  


Related Items
  • Pillar Axiom 600 Storage System
  •  
  • Pillar Axiom 500 Storage System
  •  
Related Categories
  • PLA-Support>Sun Systems>DISK>Axiom>SN-DK: Ax600
  •  




In this Document
Goal
Solution
References


Applies to:

Pillar Axiom 500 Storage System - Version All Versions and later
Pillar Axiom 600 Storage System - Version All Versions and later
Information in this document applies to any platform.

Goal

The intent of this document is to provide the instructions to run the internal Private Interconnect Fabric diagnostic utility (PITMan) on a field system as part of:

  • An installation,
  • Significant hardware reconfiguration,
  • Or health check.

This diagnostic is superior to PI Stats because traffic is sent on all paths simultaneously at levels higher than can typically be achieved by the host based generated traffic used with PI Stats. The Axiom can be online and does not interfere with data traffic during the run of this diagnostic utility. The diagnostic does not use user file systems or LUNs to function, so it has no effect on user data.

PITMan Diagnostics can only be run on AxiomONE release 05.00.xx and higher to provide this specific data collection to assist with troubleshooting.

The commands can be run in either:

If you want to understand how to interpret the data collection, please refer to <Document 1906880.1> Pillar Axiom: How to Interpret a Pitman Output on AxiomOne R5.X Systems.

 

IMPORTANT: If you move or replace cables on an Axiom with Optical PI cabling, always be sure to remove and reseat the SFP. If only the optical cable is removed and replaced or re-inserted, the connection may not come back online. There is a scan of all SFPs by the PIM internal fabric switch every 4 seconds, and changes are detected by that scan. The link status change caused by just removing and reseating or replacing the optical cable is not enough to invoke the Private Interconnect port database topology scan. As a matter of practice, you should always remove and re-insert the SFP rather than just the cable.

 

Should you still have questions after reading this document, do not hesitate to use My Oracle Support Communities. Communities put you in touch with industry professionals like yourself. They are monitored by Oracle support engineers, so you can expect reliable and correct answers. Ask questions and see what others are asking about in the Disk Storage Pillar Axiom System Community.

 

Solution

IMPORTANT: The attached utility has been tested to work under certain conditions but may have negative effects if improperly used. Customers should only used it under the direct supervision of Oracle Support.

 

Summary

Overview

This procedure invokes commands that are part of the Axiom software to generate traffic on the Private Interconnect Fibre channel fabric connections. This includes the connections on the slammer control unit PIM to the buddy control unit PIM and all the bricks that are attached. Pitman diagnostics are to be executed as part of a post installation validation, as part of a post maintenance validation when significant changes are made to the PI network, or as part of a system health check.

    • The initial run of the commands should run for 5 minutes.  This will identify any immediate PI issues.  If the test is free of errors, then proceed with running a 15 minute test.
    • If the initial 5 minute run produces errors, reseat the PI cables identified in the path producing errors and run another 5 minute test. If that test runs clean, a full 30 minute test needs to run with no errors reported.
    • If the second 5 minute test fails after reseating the cables in the chain, contact the Support Center for assistance in identifying hardware replacement. These will usually be Cables, RAID Controllers, or Private Interconnect Modules. You may be asked to disconnect a given subset of the hardware to isolate difficult problems.
    • After replacing any hardware, run another 5 minute test. If that test fails, contact the Support Center for assistance.  If the 5 minute test succeeds, run a 30 minute test which must complete without errors.

Pitman Diagnostic Options

These may be used in any combination. Some combinations are highly disruptive to host traffic. As of Patch 05.04.05, the diagstate option is inhibited on Axioms with more than two Slammers.

    • maxtraffic=on    Disruptive                Minimum 5.2.4, 5.3.6 or higher recommended.  Increases traffic to levels which will disrupt any host traffic.
    • diagstate=on     Disruptive                 Minimum 5.3.10.  Places the PIM FC switches in hub mode.  Also known as PortBlaster Mode.  All hosts should be turned off before using this mode.
    • pattern=on        Non-disruptive          Minimum 5.4.6.  Transmits rotating worst case patterns to isolate pattern sensitive components.

IMPORTANT: The use of either maxtraffic or diagstate mode is disruptive to data access. A maintenance window must be scheduled when using maxtraffic or diagstate mode.

 

Special Handling Instructions for Disruptive Pitman, using the maxtraffic and diagstate parameters

The maxtraffic mode requires Axiom Release 05.02.04 and higher. The maxtraffic mode sends traffic to a single destination port at a time, but changes that port every two seconds.
The diagstate mode is available in Axiom Release 05.03.10 and higher. The diagstate mode puts the PIM FC switches in hub mode to produce more internal traffic.

Releases 05.02.04 up to 05.03.05 have a software issue in the switching of destination ports. As a result, maxtraffic mode will only send traffic to a single, non-selectable port on these versions Do not use maxtraffic on releases from 05.02.04 up to and including 05.03.05. The fix for this issue is in 05.03.06. (Refer to <Bug 14346013>)

 


 

PITMan Diagnostics Testing Runs in the GUI

Note: This mode is non-disruptive to user traffic, however on a very heavily loaded system, the Traffic Generator may cause interference with user data access. On such heavily loaded systems, the normal user data may be enough traffic to diagnose most Private Interconnect errors.

 

  1. Open the Axiom Storage Service Manager to view the GUI of the Axiom. The hardware should be green and normal before proceeding.
  2. Select the Support Tab.
  3. Select System Trouble in the list.
  4. Right mouse click in the right hand window and choose Run Pitman Diagnostics. Additionally this can be invoked from the Action menu.
  5. In the command parameter window type SetAutoModeOff and click Execute. This sets the operational state to manual mode.
  6. Next type:

    • For non-disruptive:

      TrafficGenOn mode=auto peer=all pattern=on and click Execute.

      This will start the traffic generator in auto mode. This mode automatically selects the initiator and target ports to equally distribute the generated traffic through a particular FC network.
      The 5.4.x option pattern=on sends worst case rotating patterns, but is non-disruptive to user data. The Axiom must be in 5.4.6 or higher to use this option

    • For disruptive:

      TrafficGenOn mode=auto peer=all maxtraffic=on diagstate=on pattern=on and click Execute.

      maxtraffic mode requires 05.03.06, diagstate mode requires 05.04.06 or higher, and pattern mode requires 5.4.0 or higher.
      This will start the traffic generator in auto mode. This mode automatically selects the initiator and target ports to equally distribute the generated traffic through a particular FC network.
      The maxtraffic mode removes the limits on data traffic.
      The diagstate mode puts the FCIM switches in hub mode. Either maxtraffic or diagstate are disruptive to host traffic.
      Pattern mode sends rotating bit patterns to isolate pattern-sensitive components.

  7. Start the statistics monitor by typing StartRecordingStats interval=x duration=x.  The interval and duration can be changed for each test being run.  The time is listed in seconds.
    1. For the 5, 15, and 30 minute test the commands would look like the following. The interval timing represents how often the system records the system stats for that time frame.

      StartRecordingStats interval=60 duration=300
      StartRecordingStats interval=180 duration=900
      StartRecordingStats interval=360 duration=1800


  8. Once the time has expired on each test, run StopRecordingStats.
  9. Then it's time to collect the data. Type GetMarksDb RecordCount=all and execute.  It will work for a few moments and put the output in the window. Then select all the text and copy/paste to a text editor where it can be saved, zipped, and emailed to Oracle Support for evaluation.
  10. Once the file is safely captured run TrafficGenOff to stop the extra traffic on the PI network.
  11. Repeat the steps 7-9 if additional tests are needed.
  12. Please refer to the <Document 1547088.2> How to Upload Files to Oracle Support to transfer the results to the Oracle support for an existing SR.

 


 

PITMan Diagnostics Testing Runs in the CLI

Download the CLI from the Axiom GUI in the Support tab Utilities or from the http page of the Axiom IP address. Once installed here are the commands to run to obtain the same information as above.

  1. Set the operation state to manual.

    pcli.exe submit -H <AxiomIP> PerformPitmanCommand CommandParameter="setAutoModeOff"


  2. Start the traffic generator in auto mode.

    • For non-disruptive:

      pcli.exe submit -H <AxiomIP> PerformPitmanCommand CommandParameter="TrafficGenOn mode=auto peer=all pattern=on"


    • For disruptive:

      pcli.exe submit -H <AxiomIP> PerformPitmanCommand CommandParameter="TrafficGenOn mode=auto peer=all pattern=on maxtraffic=on diagstate=on"


      maxtraffic mode requires 05.03.06, diagstate mode requires 05.04.06 or higher, and pattern mode requires 5.4.0 or higher.
      This mode automatically selects the initiator and target ports to equally distribute the generated traffic through a particular FC network.
      The maxtraffic mode removes the limits on data traffic.
      The diagstate mode puts the FCIM switches in hub mode. Either maxtraffic or diagstate are disruptive to host traffic.
      Pattern mode sends rotating bit patterns to isolate pattern-sensitive components.

  3. Start the statistics monitor. The interval and duration can be changed for each test being run. The time is listed in seconds.

    pcli.exe submit -H <AxiomIP> PerformPitmanCommand CommandParameter="StartRecordingStats interval=x duration=x"


    1. For the 5,15,and 30 minute test the commands would look like the following. The interval timing represents how often the system records the system stats for that time frame.

      StartRecordingStats interval=60 duration=300
      StartRecordingStats interval=180 duration=900
      StartRecordingStats interval=360 duration=1800


  4. Once the time has expired on each test, you can stop the recording.

    pcli.exe submit -H <AxiomIP> PerformPitmanCommand CommandParameter="StopRecordingStats"


  5. To collect the statistics file for review. Run the follwing command and then redirect the output to a text file.

    pcli.exe submit -H <AxiomIP> -u <username> -p <password> PerformPitmanCommand CommandParameter="GetMarksDb RecordCount=all" > <output filename>
     

  6. Once the text file has been generated the traffic can then be turned off.

    pcli.exe submit -H <AxiomIP> PerformPitmanCommand CommandParameter="TrafficGenOff"


  7. Please refer to the <Document 1547088.2> How to Upload Files to Oracle Support to transfer the results to the Oracle support for an existing SR.


 

 

Using getPITMAN

The tool getPITMAN has been tested on the following platforms (this list will be updated when tests on other platforms will be completed):
  • Oracle Linux 6.
  • Ubuntu 12.04.
  • Solaris 11 x86.
  • Solaris 11 SPARC
  • Solaris 10 x86
  • Solaris 10 SPARC

The script has been tested with Bash 3.4.51 and above.

Oracle Support has made available a tool called getPITMAN for Windows/Solaris/Linux. The tool automates all the above commands and rules depending on the code version. The tool will also warn you if you choose to use the PortBlaster option.

A compressed archive is generated at the end and the console will advise if the Pitman was successful or if it needs further analysis by Support.

Syntax usage:

GetPITMAN [-t Session_Length] [-u user] [-p password] [-b] Pilot_IP_or_hostname
    Ex:
        GetPITMAN 10.1.1.1
        GetPITMAN -b 10.1.1.1
        GetPITMAN -u administrator -p mypassword -t 120 10.1.1.1

           Options must be placed before the ip/hostname
        -t : specify the session length in minute; 120 min by default
        -u : specify the user to collect the logs (interactive if omitted)
        -p : specify the user's password (interactive if omitted)
        -b : portblaster will use all the backend bandwidth and WILL impact the production
            It is recommended to power off all your hosts when using this option.
            You cannot use the portblaster option before 5.3.6

 

There is no pattern option because the script tests if the Axiom is in 5.4.6 or above and activates the option automatically.

 

Example:

bash-3.2$ ./getPITMAN.sh -t 3 -u administrator -p password 123.123.123.123
Starting PITMan for R5_AXIOM_SOURCE (<axiom SN#>)

Waiting 4 min ...
1 min... 2 min... 3 min... 4 min...


**No error has been found in the statistics**

Compressing <axiom SN#>_R5_AXIOM_SOURCE_PITMAN_2014-07-12_1647_3min.txt ...
Pitman Collection completed please attach <axiom SN#>_R5_AXIOM_SOURCE_PITMAN_2014-07-12_1647_3min.txt.gz to your SR
bash-3.2$

 

The getPITMAN file is attached to this Knowledge article (getPITMAN.zip for Windows and getPITMAN.sh for Linux/Solaris).

Please refer to the <Document 1547088.2> How to Upload Files to Oracle Support to transfer the results to the Oracle support for an existing SR.

 

References

<NOTE:1906880.1> - Pillar Axiom: How to Interpret a Pitman Output on AxiomOne R5.X Systems
<NOTE:1473492.1> - Pillar Axiom: How to interpret a PITMAN output from a PSG_PITMAN_EVENT_DIAGNOSTIC_RESULTS_AVAILABLE Callhome event
<NOTE:1609365.1> - Pillar Axiom: How to generate Axiom topology

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback