Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-77-1968958.1
Update Date:2016-12-07
Keywords:

Solution Type  Sun Alert Sure

Solution  1968958.1 :   Sun System Firmware 8.6.0.b, 8.6.0.c, and 9.3.0.b through 9.3.0.f May Cause Guest Domains to be Unresponsive Following a Live Migration  


Related Items
  • SPARC M5-32
  •  
  • SPARC T5-1B
  •  
  • SPARC T4-2
  •  
  • Sun Software - Generic
  •  
  • Netra SPARC T5-1B Server Module
  •  
  • SPARC T4-1B
  •  
  • SPARC T4-1
  •  
  • SPARC T5-2
  •  
  • SPARC T5-4
  •  
  • SPARC M6-32
  •  
  • SPARC T4-4
  •  
  • SPARC T5-8
  •  
  • Sun Hardware - Generic
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: Sun Alert
  •  
  • _Old GCS Categories>Sun Microsystems>Sun Alert>Criteria Category>Availability
  •  




In this Document
Description
Occurrence
Symptoms
Workaround
Patches
History
References


Applies to:

SPARC T4-2
SPARC T4-1B
SPARC T4-1
SPARC T4-4
SPARC T5-2
SPARC
SPARC T4-1B Server Module
Netra SPARC T4-1 Server
Netra SPARC T4-2 Server
Netra SPARC T4-1B Server Module
SPARC T5-2 Server
SPARC T5-4/T5-8 Server
SPARC T5-1B Server Module
Netra SPARC T5-1B Server Module
SPARC M5-32/M6-32 Server
________________________________________



Date of Workaround Release: 9-Feb-2015

Date of Resolved Release: 16-Jun-2015
________________________________________

Description

Sun System Firmware 8.6.0.b, 8.6.0.c, and 9.3.0.b through 9.3.0.f may cause guest domains to be unresponsive on the destination server following a live migration.

Occurrence

This issue can occur on the following platforms:

SPARC Platform

  • SPARC T4-1 Servers with Firmware version 8.6.0.b (as delivered in patch 151682-01) or later
  • SPARC T4-2 Servers with Firmware version 8.6.0.b (as delivered in patch 151683-01) or later
  • SPARC T4-4 Servers with Firmware version 8.6.0.b (as delivered in patch 151684-01) or later
  • SPARC T4-1B Servers with Firmware version 8.6.0.b (as delivered in patch 151685-01) or later
  • Netra SPARC T4-1 Servers with Firmware version 8.6.0.b (as delivered in patch 151686-01) or later
  • Netra SPARC T4-2 Servers with Firmware version 8.6.0.b (as delivered in patch 151687-01) or later
  • Netra SPARC T4-1B Servers with Firmware version 8.6.0.b (as delivered in patch 151688-01) or later
  • SPARC T5-2 Servers with Firmware version 9.3.0.b (as delivered in patch 20034526)
  • SPARC T5-4/T5-8 Servers with Firmware version 9.3.0.b and 9.3.0.d (as delivered in patches 20034528 & 20229460 )
  • SPARC T5-1B Servers with Firmware version 9.3.0.b (as delivered in patch 20034530)
  • Netra SPARC T5-1B Servers with Firmware version 9.3.0.b (as delivered in patch 20034531)
  • SPARC M5-32/M6-32 Servers with Firmware version 9.3.0.c and 7.3.0.f (as delivered in patches 20034532 & 20229461)

To determine the current firmware version on one of these servers, the following command can be used:

    -> show /System system_fw_version

    /System
    Properties:
    system_fw_version = Sun System Firmware 9.2.0.b 2014/04/21 07:09

Symptoms

After performing a live migration of a guest domain, the migrated domain may be unresponsive on the target system. In addition, the CPU utilization may be high (see below example), and it may not be possible to log in to the domain.

    # ldm list-domain ldom-1
    NAME             STATE      FLAGS   CONS    VCPU  MEMORY   UTIL  NORM  UPTIME 
    ldom-1     active     -n----  5002    8     8G       100%  100%  2h 16m

Workaround

The cause of the described issue is that a certain register is not set during a migration. The workaround below forces that register to be set for all strands before a migration occurs.

To prevent this issue from occurring, execute the following steps prior to performing a live domain migration:

1. Log in to the control domain on the target host.

2. Start each guest at least once (if not already done since poweron).

    # ldm start-domain existing-domain
      LDom existing-domain started

3. Determine the number of unused VCPU's:

    # ldm list-devices
    :
    :
    VCPU
    PID     %FREE  
    128     100    
    129     100    
    130     100    
    131     100    
    132     100    
    133     100    
    134     100    
    135     100    
    136     100    
    137     100    
    138     100    
    139     100    
    140     100
    :
    :
    :

4. Execute the following commands as 'root' to create a domain containing the unused VCPU's:

    # ldm add-domain max-vcpus

    # ldm add-vcpu 128 max-vcpus

    # ldm add-memory 128G max-vcpus

    # ldm bind-domain max-vcpus

    # ldm start-domain max-vcpus
    LDom max-vcpus started

    # ldm list-domain
    NAME             STATE      FLAGS   CONS    VCPU  MEMORY   UTIL  NORM  UPTIME
    primary          active     -n-cv-  UART    128   128G     0.0%  0.0%  7m
    max-vcpus        active     -t----  5000    128   128G     0.8%  0.8%  2m
    #

    # telnet localhost 5000
    :
    :
    {0} ok 
    telnet> quit
    Connection to localhost closed.
    #

    # ldm stop-domain max-vcpus
    Remote graceful shutdown or reboot capability is not available on max-vcpus
    LDom max-vcpus stopped

    # ldm list-domain
    NAME             STATE      FLAGS   CONS    VCPU  MEMORY   UTIL  NORM  UPTIME
    primary          active     -n-cv-  UART    128   128G     0.0%  0.0%  9m
    max-vcpus        bound      ------  5000    128   128G                 
    #

    # ldm unbind max-vcpus

    # ldm remove-domain max-vcpus
    #

5. Perform the domain migration.

This issue is addressed in the following releases:

  • SPARC T4-1 Servers with Firmware version 8.7.2.b (as delivered in patch 151743-01) or later
  • SPARC T4-2 Servers with Firmware version 8.7.2.b (as delivered in patch 151744-01) or later
  • SPARC T4-4 Servers with Firmware version 8.7.2.b (as delivered in patch 151745-01) or later
  • SPARC T4-1B Servers with Firmware version 8.7.2.b (as delivered in patch 151746-01) or later
  • Netra SPARC T4-1 Servers with Firmware version 8.7.2.b (as delivered in patch 151747-01) or later
  • Netra SPARC T4-2 Servers with Firmware version 8.7.2.b (as delivered in patch 151748-01) or later
  • Netra SPARC T4-1B Servers with Firmware version 8.7.2.b (as delivered in patch 151749-01) or later
  • SPARC T5-2 Servers with Firmware version 9.4.2.c (as delivered in patch 20214646) or later
  • SPARC T5-4/T5-8 Servers with Firmware version 9.4.2.c (as delivered in patches 20214648) or later
  • SPARC T5-1B Servers with Firmware version 9.4.2.c (as delivered in patch 20214649) or later
  • Netra SPARC T5-1B Servers with Firmware version 9.4.2.c (as delivered in patch 20214650) or later
  • SPARC M5-32/M6-32 Servers with Firmware version 9.4.2.d (as delivered in patch 20214652) or later

Note: There is an issue with Firmware 9.4.2.c and 9.4.2.d for the SPARC T5 and SPARC M5-32/M6-32 Servers. Please see Service Alert <Document:2027245.1> for more details.

Patches

<SUNPATCH:151743-01>
<SUNPATCH:151744-01>
<SUNPATCH:151745-01>
<SUNPATCH:151746-01>
<SUNPATCH:151747-01>
<SUNPATCH:151748-01>
<SUNPATCH:151749-01>
<SUNPATCH:20214646>
<SUNPATCH:20214648>
<SUNPATCH:20214649>
<SUNPATCH:20214650>
<SUNPATCH:20214652>

History

09-Feb-2015: Document released, status Workaround
13-Feb-2015: Updated the Description, Occurrence, and Workaround sections.
17-Mar-2015: Updated Product Field, Occurrence, and Workaround sections.
17-Mar-2015: Updated bugID field.
26-May-2015: Updated the Workaround section.
12-Jun-2015: Updated the Workaround section with available patches.
16-Jun-2015: Updated the Workaround section with available patches. State Resolved.
02-Jul-2015: Added Note to Workaround section regarding Alert 2027245.1

Internal Section: Comments:

This regression was caused by the putback for 18994335

Note: The bugID that was used to fix this issue was 19197738 and not 20142591 as originally intended.

Questions regarding this document should be addressed to
sunalertpublication_us_grp@oracle.com and copy the
responsible engineer listed below.

Internal Contributor/Submitter: alex.aftandilian@oracle.com
Internal Eng Responsible Engineer: alex.aftandilian@oracle.com
Internal Services Knowledge Engineer: jeff.folla@oracle.com
Internal Eng Business Unit Group: Systems RPE
Internal Associated SRs: 3-10228205511, 3-10138312491, 3-10240357271, 3-10260887011, 3-10168494841,
3-10252586421, 3-9583479111,3-10103333774, 3-10001489761, 3-10135396801, 3-10008607851, 3-10092890021,
3-10009194241, 3-9971194771, 3-9923744541, 3-9953833331
Internal Resolution Patches: 151743-01, 151744-01, 151745-01, 151746-01, 151747-01, 151748-01, 151749-01,
20214646, 20214648, 20214649. 20214650, 20214652

References


<NOTE:2027245.1> - SPARC T5 Series and SPARC M5-32/M6-32 Servers System Firmware Version 9.4.2.c and 9.4.2.d May Erroneously Disable CPUs

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback