Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1670078.1
Update Date:2018-04-05
Keywords:

Solution Type  Technical Instruction Sure

Solution  1670078.1 :   VSM6 how to map to FICON port for MVS HOST IOS051I INTERFACE TIMEOUT & IOS050I CHANNEL DETECTED ERROR  


Related Items
  • StorageTek Virtual Storage Manager System 6 (VSM6)
  •  
Related Categories
  • PLA-Support>Sun Systems>TAPE>Virtual Tape>SN-TP: VSM6
  •  


The customer sees messages like the below on their MVS/MainFrame/Host console
N 4040000 SB90     2014118 16:26:26.17 JOB97304 00000010  IOS051I INTERFACE
TIMEOUT DETECTED ON 5648,BF,**,**02,PCHID=04A2
With in the VSM6 product we need to be able to map the above message to a FICON port to isolate the cause of the messages.

In this Document
Goal
Solution
References


Created from <SR 3-8965923147>

Applies to:

StorageTek Virtual Storage Manager System 6 (VSM6) - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Goal

 Map the host message to a FICON port on the VSM6 that should lead to a quicker resolution if you need to engage engineering.

N 4040000 SB90     2014118 16:26:26.17 JOB97304 00000010  IOS051I INTERFACE
TIMEOUT DETECTED ON 5648,BF,**,**02,PCHID=04A2

Solution

 Below are the messages that the customer will see on the console and in syslog on the host.

  N 4040000 SB90     2014118 16:26:26.17 JOB97304 00000010  IOS051I INTERFACE
  TIMEOUT DETECTED ON 5648,BF,**,**02,PCHID=04A2
  N 4040000 SB90     2014118 16:26:26.17 JOB99867 00000010  IOS051I INTERFACE
  TIMEOUT DETECTED ON 5622,BF,DB,**02,PCHID=04A2
 
  N 4040000 SBCC     2014118 16:26:26.19 JOB00221 00000010  IOS051I INTERFACE
  TIMEOUT DETECTED ON 5684,BF,DB,**02,PCHID=05B8
 
  N 4040000 SCCC     2014118 16:26:26.17 JOB00491 00000010  IOS051I INTERFACE
  TIMEOUT DETECTED ON 5610,97,DB,**02,PCHID=0370
 
  N 4040000 SFCC     2014118 16:26:26.14 JOB01689 00000010  IOS051I INTERFACE
  TIMEOUT DETECTED ON 5626,9A,DB,**02,PCHID=0598
  N 4040000 SFCC     2014118 16:26:26.14 JOB09288 00000010  IOS051I INTERFACE
  TIMEOUT DETECTED ON 568F,9A,DB,**02,PCHID=0598
 
  N 4040000 SGCC     2014118 16:26:26.18 JOB48713 00000010  IOS051I INTERFACE
  TIMEOUT DETECTED ON 564A,99,DB,**02,PCHID=01C


The above host message IOS051I is broken down as follows:
IOS051I  - INTERFACE TIMEOUT DETECTED ON
dev,chp,cmd,stat,PCHID=pppp,EXTSS=ss

Explanation: The channel subsystem detected a timeout condition during the operation of device dev.
In the message text:
dev - The device number
chp - The channel path identifier (CHPID) if known; otherwise, this field is set to asterisks.
cmd - The failing command code, if known; otherwise, this field is set to asterisks.
stat - The device and subchannel status, if known; otherwise, this field is set to asterisks.
PCHID=pppp The physical channel id, if known.
EXTSS=ss The subchannel extended status, if known.

 

To decode the above MVS/HOST message follow below steps.
The device number example below, came from the first IOS message above:
IOS051I INTERFACE TIMEOUT DETECTED ON 5648,BF,**,**02,PCHID=04A2
Using the above decoder ring, 5648 is the customer device number, and BF is the chpid number.  Assuming his device number range for this VTSS is 5600-56FF, the last 2 digits of his device number are "control unit" and "device".  In VSM, we code these as Vc0d, where c=control unit and d=device, hence you get V408.


Now looking at a current state save from the above system we can map that as follows:
1. Identify the chpid in question decoded from above, we need to know which fd_user.####.log file the chpid's are in.
    We use the below grep command to find the correct log file and process ID.

The fd_user.####.log file is of the format fd_user.<Process_ID>.log There are 4 FICON process logs, one for each port on the VSM6.
In the below grep command we are using egrep for each of the above CHPIDs from the MVS host messages.
--> grep chpid fd_user.* | egrep "0xBF|0x97|0x9A|0x99"
fd_user.6460.log:2014/04/12 00:54:24.445948  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M66 chpid=0xBF loop_id=01 s_id=681200
fd_user.6460.log:2014/04/12 00:54:24.447464  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M66 chpid=0x99 loop_id=02 s_id=681400
fd_user.6460.log:2014/04/12 00:54:24.451615  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M80 chpid=0x9A loop_id=03 s_id=683400
fd_user.6460.log:2014/04/12 00:54:24.461323  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M80 chpid=0x97 loop_id=06 s_id=689200
fd_user.6460.log:2014/04/12 00:54:24.484015  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002827 model=HA1 chpid=0xBF loop_id=07 s_id=68A700
fd_user.6460.log:2014/04/12 01:46:19.495809  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M66 chpid=0xBF loop_id=01 s_id=681200
fd_user.6460.log:2014/04/12 01:46:19.496953  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M66 chpid=0x99 loop_id=02 s_id=681400
fd_user.6460.log:2014/04/12 01:46:19.501954  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M80 chpid=0x9A loop_id=03 s_id=683400
fd_user.6460.log:2014/04/12 01:46:19.509149  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M80 chpid=0x97 loop_id=06 s_id=689200
fd_user.6460.log:2014/04/12 01:46:19.518657  6460:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002827 model=HA1 chpid=0xBF loop_id=07 s_id=68A700
fd_user.6471.log:2014/04/12 00:54:24.972100  6471:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M66 chpid=0x97 loop_id=02 s_id=671400
fd_user.6471.log:2014/04/12 00:54:24.975567  6471:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M80 chpid=0x97 loop_id=04 s_id=673400
fd_user.6471.log:2014/04/12 01:47:40.096880  6471:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M66 chpid=0x97 loop_id=02 s_id=671400
fd_user.6471.log:2014/04/12 01:47:40.100980  6471:I: FD:013f:fd_ae_thread.c:3467:                NF_RNID_MANUF_TYPE_MODEL:channel: manufacturer=IBM type=002817 model=M80 chpid=0x97 loop_id=04 s_id=673400
It is not typical to find the same chpid number on multiple ports, but obviously happens (as shown above with process_id's 6460 and 6471).
However, the fact that all of the chpid numbers you are looking for, can be found on one port (process_id 6460), strongly suggests that is the port you care about.

  

2. We use the below command to find the FICON port in question, mapping the Process_ID from above to the correct port number.

--> grep "[0-9]" Ficon*.pid
FiconPortProcess.p0.pid:6454
FiconPortProcess.p1.pid:6460
FiconPortProcess.p2.pid:6466
FiconPortProcess.p3.pid:6471
From the above output we find our process 6460 is on port 1 (FiconPortProcess.p1.pid:6460).

 

3. Then if you look for abort log messages in the tikka.out.p# port logs, and match up the time and device numbers, then you can be certain.

You will need to know the time difference between the MVS/Host and the VSM6 to corelate the messages.
In the below command we are grep'ing through the tikka.out.p1* files and then grep'ing for 16: which is the hour the above IOS051I messages occured.
You should pay attention to the date of the above IOS message and may even want to include the date in your grep command below.
--> grep -i abts tikka.out.p1.* | grep 16:
tikka.out.p1.12:2014-04-28 16:28:07.048988 p1>SBCU - SB Busy Task Error Recovery - Exchange Error(0x10) ABTS Received - OXID:026d V305 on Port 1 - thread:35.
tikka.out.p1.12:2014-04-28 16:28:07.049874 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:0230 V408 on Port 1 - thread:19, state SB_CMDSTATE_WAIT_FOR_STATUS_ACK, early eoc 0, chain flags 8209, reset flag 0, original reset_wait_flag 0, status_host_response_ptr ffffffff4b1ba9d4.
tikka.out.p1.12:2014-04-28 16:28:07.050523 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:023e V80F on Port 1 - thread:2, state SB_CMDSTATE_WAIT_FOR_STATUS_ACK, early eoc 1, chain flags 101, reset flag 0, original reset_wait_flag 0, status_host_response_ptr 0.
tikka.out.p1.12:2014-04-28 16:28:07.051163 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:0291 V206 on Port 1 - thread:6, state SB_CMDSTATE_WAIT_FOR_STATUS_ACK, early eoc 1, chain flags 103, reset flag 0, original reset_wait_flag 0, status_host_response_ptr 0.
tikka.out.p1.12:2014-04-28 16:28:07.051768 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:01ea V100 on Port 1 - thread:24, state SB_CMDSTATE_WAIT_CMD_STATUS, early eoc 0, chain flags 101, reset flag 90, original reset_wait_flag 80, status_host_response_ptr 0.
tikka.out.p1.12:2014-04-28 16:28:07.052386 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:014a V10C on Port 1 - thread:20, state SB_CMDSTATE_DO_READ_XFER, early eoc 0, chain flags 03, reset flag 90, original reset_wait_flag 80, status_host_response_ptr 0.
tikka.out.p1.12:2014-04-28 16:28:07.052986 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:0019 V804 on Port 1 - thread:12, state SB_CMDSTATE_FIRST_CMD, early eoc 0, chain flags 00, reset flag 90, original reset_wait_flag 80, status_host_response_ptr 0.
tikka.out.p1.12:2014-04-28 16:28:07.053593 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:0301 V307 on Port 1 - thread:8, state SB_CMDSTATE_DO_READ_XFER, early eoc 0, chain flags 03, reset flag 90, original reset_wait_flag 80, status_host_response_ptr 0.
tikka.out.p1.12:2014-04-28 16:28:07.054211 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:0374 V202 on Port 1 - thread:13, state SB_CMDSTATE_WAIT_FOR_STATUS_ACK, early eoc 1, chain flags 101, reset flag 0, original reset_wait_flag 0, status_host_response_ptr 0.
tikka.out.p1.12:2014-04-28 16:28:07.054843 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:03b5 V40A on Port 1 - thread:7, state SB_CMDSTATE_WAIT_FOR_STATUS_ACK, early eoc 0, chain flags 8c08, reset flag 0, original reset_wait_flag 0, status_host_response_ptr ffffffff4b1d29d4.
tikka.out.p1.12:2014-04-28 16:28:07.055530 p1>SBCU - CMD Error Recovery - Exchange Error(0x10) ABTS Received - OXID:0217 V209 on Port 1 - thread:0, state SB_CMDSTATE_WAIT_FOR_STATUS_ACK, early eoc 1, chain flags 101, reset flag 0, original reset_wait_flag 0, status_host_response_ptr 0.

 From the above we can see we need to focus on p1 in tikka.out.p1.12 file and the port in question is the bottom port on the left hand FICON card looking at the back of the system.



References

<NOTE:1510362.1> - VSM6 - How to initiate a manual ASR to collect Support File Bundle (SFB)

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback