Sun Microsystems, Inc.  Oracle System Handbook - ISO 7.0 May 2018 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1678674.1
Update Date:2017-11-22
Keywords:

Solution Type  Problem Resolution Sure

Solution  1678674.1 :   Firmware Error when Executing bdacheckcluster or bdacheckhw Commands on Oracle Big Data Appliance  


Related Items
  • Big Data Appliance X5-2 Full Rack
  •  
  • Big Data Appliance Integrated Software
  •  
  • Big Data Appliance Hardware
  •  
Related Categories
  • PLA-Support>Eng Systems>BDA>Big Data Appliance>DB: BDA_EST
  •  




In this Document
Symptoms
Cause
Solution


Applies to:

Big Data Appliance Hardware - Version All Versions and later
Big Data Appliance Integrated Software - Version 2.2.1 and later
Big Data Appliance X5-2 Full Rack - Version All Versions and later
Linux x86-64

Symptoms

On Oracle Big Data Appliance(BDA), executing bdacheckcluster or bdacheckhw  fails with below error:

SUCCESS: Correct Host Channel Adapter model : Mellanox Technologies MT27500 Family [ConnectX-3]
-E- Can not open a0:00.0: Can not obtain Flash semaphore (63). You can run "flint -clear_semaphore -d <device>" to force semaphore unlock. See help for details.
ERROR: Wrong Host Channel Adapter firmware version :
INFO: Expected Host Channel Adapter firmware version : 2.9.1000 or 2.11.2010 or 2.11.1280

 

Cause

The likely cause is a previous failure that did not clean up behind itself.

Solution

Use flash memory operations tool(mstflint) to clear the flash semaphore on the device reported.

flint is a FW (firmware) burning and flash memory operations tool for Mellanox Infiniband HCAs, Ethernet NIC cards, and switch devices. Execute 'mstflint --help' command for details

-clear_semaphore   - Force clear the flash semaphore on the device.
                         No command is allowed when this flag is used.
                         NOTE: May result in system instability or flash corruption if the device or another application is currently using the flash. Exercise caution.

1) Check the error for the device reported

When the error contains 'Can not open a0:00.0:' , then the device name is /proc/bus/pci/a0/00.0

2)Force clear the flash semaphore on the device reported.

Syntax:-

mstflint -clear_semaphore -d  <device_name>

Sample Command

mstflint -clear_semaphore -d  /proc/bus/pci/a0/00.0

Attachments
This solution has no attachment
  Copyright © 2018 Oracle, Inc.  All rights reserved.
 Feedback