Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-2214774.1
Update Date:2017-06-06
Keywords:

Solution Type  Technical Instruction Sure

Solution  2214774.1 :   How to Troubleshoot an SPU Switchover  


Related Items
  • Net-Net 9200
  •  
Related Categories
  • PLA-Support>Sun Systems>CommsGBU>Session Delivery Network>SN-SND: Acme Service Provider
  •  




In this Document
Goal
Solution
References


Created from <SR 3-13623887651>

Applies to:

Net-Net 9200 - Version S-D7.1.0 to S-D7.2.3 [Release S-D7.0]
Net-Net 9200
SPU switchover

Goal

SPU switchover and need to know the reason.
 

Solution

From the logs, we can see the below messages:

Nov 11 07:35:57.337 [MINOR] Alarm ID:20010202 Task ID:tSM@0.0.0 Severity:MINOR Time:2016-11-11 07:35:57.337 [Health Check Timeout on Card SPU0]
Nov 11 07:36:04.339 [MINOR] Alarm ID:20010202 Task ID:tSM@0.0.0 Severity:MINOR Time:2016-11-11 07:35:57.337 [Health Check Timeout on Card SPU0]
Nov 11 07:36:11.342 [MINOR] Alarm ID:20010202 Task ID:tSM@0.0.0 Severity:MINOR Time:2016-11-11 07:35:57.337 [Health Check Timeout on Card SPU0]
Nov 11 07:36:11.343 [PROC] Alarm ID:21e10210 Task ID:tSM@0.0.0 Severity:NOTICE Time:2016-11-11 07:36:11.341 [Health Score Dropped to 40 on Card SPU0]

Nov 11 07:36:11.404 [PROC] Alarm ID:22610224 Task ID:tSM@0.0.0 Severity:NOTICE Time:2016-11-11 07:36:11.361 [SPU0 Switchover to Recovery Role]
Nov 11 07:36:11.404 [PROC] Alarm ID:22611222 Task ID:tSM@1.0.0 Severity:NOTICE Time:2016-11-11 07:36:11.361 [SPU1 Switchover to Active Role]
Nov 11 07:36:11.404 [PROC] Alarm ID:22610224 Task ID:tSM@0.0.0 Severity:NOTICE Time:2016-11-11 07:36:11.404 [SPU0 Switchover to Recovery Role]
Nov 11 07:36:11.404 [PROC] Alarm ID:21e10210 Task ID:tSM@0.0.0 Severity:NOTICE Time:2016-11-11 07:36:11.404 [Health Score Dropped to 40 on Card SPU0]
Nov 11 07:36:11.404 [MINOR] Alarm ID:20010202 Task ID:tSM@0.0.0 Severity:MINOR Time:2016-11-11 07:36:11.404 [Health Check Timeout on Card SPU0]


Standby SPU will be sending health check messages to active SPU . If the active SPU couldn't respond to the three consecutive messages, it will timeout and health score will be dropped. The switchover happened due to this health score drop.

Need to monitor the system for further switchover.

If the switchover happened multiple times, we may need to replace the SPU.

Upgrade your system at least to the version- nnSD720m3p10 which has the fix for sending zero byte CDRs.
 

Note: Please refer the Troubleshooting guide for the software upgrade.

http://docs.oracle.com/cd/E50397_01/doc/sd_sd720_troubleshooting.pdf

References

<NOTE:166650.1> - Working Effectively With Oracle Support - Best Practices
<NOTE:1998679.1> - How to replace SPU card on Acme packet 9200

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback