Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-2116961.1
Update Date:2016-03-16
Keywords:

Solution Type  Problem Resolution Sure

Solution  2116961.1 :   Oracle ZFS Storage Appliance: Replication fails with "stage 'submit_job' failed: failed to invoke receive() XDR: ssl connection closed"  


Related Items
  • Sun ZFS Storage 7420
  •  
  • Sun Storage 7110 Unified Storage System
  •  
  • Oracle ZFS Storage ZS3-2
  •  
  • Sun Storage 7210 Unified Storage System
  •  
  • Oracle ZFS Storage ZS4-4
  •  
  • Sun Storage 7410 Unified Storage System
  •  
  • Sun ZFS Storage 7120
  •  
  • Sun Storage 7310 Unified Storage System
  •  
  • Oracle ZFS Storage ZS3-4
  •  
  • Sun ZFS Storage 7320
  •  
  • Oracle ZFS Storage Appliance Racked System ZS4-4
  •  
  • Oracle ZFS Storage ZS3-BA
  •  
Related Categories
  • PLA-Support>Sun Systems>DISK>ZFS Storage>SN-DK: 7xxx NAS
  •  


Replicating from both peer cluster nodes to a single target interface causes issues including error on one head of "stage 'submit_job' failed: failed to invoke receive() XDR: ssl connection closed"

In this Document
Symptoms
Cause
Solution


Created from <SR 3-12285111941>

Applies to:

Sun ZFS Storage 7420 - Version All Versions and later
Sun ZFS Storage 7320 - Version All Versions and later
Sun ZFS Storage 7120 - Version All Versions and later
Sun Storage 7410 Unified Storage System - Version All Versions and later
Sun Storage 7310 Unified Storage System - Version All Versions and later
7000 Appliance OS (Fishworks)

Symptoms

A ZFS-SA cluster system that has replications configured on both heads to the same target IP address may have problems replicating, especially if replication is continuous.

 

On one source, the error is "stage 'submit_job' failed: failed to invoke receive() XDR: ssl connection closed".

Thu Mar 3 15:04:27 2016
nvlist version: 0
        project = RHEV
        target_host = dmz-dr1
        source = appliance/kit/akd:default
        class = alert.ak.appliance.nas.project.replication.send.fail.net
        result = failure
        ak_errmsg = stage 'submit_job' failed: failed to invoke receive() XDR: ssl connection closed
        uuid = 1c8599a7-c5b8-cb8e-c681-fd10ac59ad1c
        link =

 

On the target, it may show the replication being cancelled (alert.ak.appliance.nas.project.replication.receive.fail.cancelled).

 

Cause

The issue is that both peer heads on the cluster ZFSSA are replicating to the same target IP address.

 

Solution

The resolution is to have TWO (different)  IP addresses used for replication on the target - so that one cluster peer replicates to one IP address and the other cluster peer replication to the second IP address.

It can be the same physical NIC, but just needs a second IP address that is able to use the physical NIC.

 


Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback