Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-2197335.1
Update Date:2016-10-27
Keywords:

Solution Type  Problem Resolution Sure

Solution  2197335.1 :   After Upgrade to DSR R7.1.2 Inetmerge Process Stuck with Audit Status on Standby DR NO  


Related Items
  • Oracle Communications Diameter Signaling Router (DSR)
  •  
Related Categories
  • PLA-Support>Sun Systems>CommsGBU>Global Signaling Solutions>SN-SND: Tekelec DSR
  •  




In this Document
Symptoms
Changes
Cause
Solution
References


Created from <SR 3-13496436721>

Applies to:

Oracle Communications Diameter Signaling Router (DSR) - Version DSR 7.1.2 and later
Information in this document applies to any platform.

Symptoms

The inetmerge process is stuck with Audit status on the standby DR-NO Server after the upgrade to Release 7.1.2:

[admusr@xxdsrno02 ~]$ inetmstat -w
nodeId InetMerge State dir dSeq dTime updTime info
xxdsrno01 Standby To 0 0.00 09:39:49
xxdsrno01 Standby From 0 0.00 09:39:49
xxsrno02 Active To 0 0.00 09:39:49
xxdsrno02 Audit From 0 0.00 09:39:49
xxdsrno01 Standby To 0 0.00 09:39:49
xxdsrno01 Standby From 0 0.00 09:39:49
xxdsrso01 Standby From 0 0.00 09:39:49
xxdsrso02 Standby From 0 0.00 09:39:49
xxdsrso01 Standby From 0 0.00 09:39:49
xxdsrso02 Standby From 0 0.00 09:39:49

inetmerge log:

[root@xxdsrno02 ~]# tt inetmerge
+ cf.follow -10 /var/TKLC/rundb/run/proc/inetmerge/trace.dat
1018:184916.278 TR-V AUDIT [Receiver] chadsrno02: Audit Started. [12507/MergeReceiver.cxx:417]
1018:184916.292 TR-V AUDIT [Receiver] chadsrno02: Deleting fragment in table SNMPTrapLog [12507/MergeReceiver.cxx:559]
1018:184916.502 TR-V AUDIT [Receiver] chadsrno02: Audit Started. [12507/MergeReceiver.cxx:417]
1018:184916.516 TR-V AUDIT [Receiver] chadsrno02: Deleting fragment in table SNMPTrapLog [12507/MergeReceiver.cxx:559]
1018:184916.729 TR-V AUDIT [Receiver] chadsrno02: Audit Started. [12507/MergeReceiver.cxx:417]
1018:184916.742 TR-V AUDIT [Receiver] chadsrno02: Deleting fragment in table SNMPTrapLog [12507/MergeReceiver.cxx:559]
1018:184916.957 TR-V AUDIT [Receiver] chadsrno02: Audit Started. [12507/MergeReceiver.cxx:417]
1018:184916.970 TR-V AUDIT [Receiver] chadsrno02: Deleting fragment in table SNMPTrapLog [12507/MergeReceiver.cxx:559]
1018:184917.188 TR-V AUDIT [Receiver] chadsrno02: Audit Started. [12507/MergeReceiver.cxx:417]
1018:184917.202 TR-V AUDIT [Receiver] chadsrno02: Deleting fragment in table SNMPTrapLog [12507/MergeReceiver.cxx:559]

Changes

  • The upgrade of NOAM servers (2 NOAM servers and 2 DR-NOAM) completed from DSR 6.0.2 to DSR 7.1.2
  • SOAM and MP servers were still in DSR 6.0.2 and schedule for upgrade in next MTC.
  • The inetmerge process was seen stuck only on the standby DR NO Server

Cause

This appears to be a case of BUG 23743443. The part file for the SNMPTrapLog table is not being properly deleted from the file system, to make room for newer data to come in during the audit.

The issue can be identified by inetmerge trace of the form "Deleting fragment in table ..." for the same table repeated numerous times in quick succession.

Solution

Open a request to Oracle Support.

In this specific case for this table, the recovery/workaround would be the following procedure:

On the afflicted node:

  1. Put the standby NOAM into Forced standby from the GUI
  2. Bring COMCOL to DBdown state
    prod.dbdown
  3. Delete SNMPTraplog Part files from disk.
    $ cd $DB/AppworksSNMPlog.0001
    $ rm -f *
  4. Verify the files are gone:
    $ ls
  5. Start COMCOL:
    $ prod.start
  6. Restore NOAM state from the GUI

 

References

<BUG:23743443> - INETMERGE: ENDLESS AUDIT LOOP BETWEEN ACTIVE & STANDBY NO

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback