Quantcast
Channel: Mellanox Interconnect Community: Message List
Viewing all articles
Browse latest Browse all 6211

Modify QP error (HCA reset)

$
0
0

I'm having an issue as of yesterday with a system that has a 40GB dual port daughter card in it.  The network connections for the 2 ports are showing disconnected.  I'm in a home lab with a single unmanaged switched running 2 instances of OpenSM on two separate servers.

 

The error I'm getting is in the event viewer and it spams repeated until I stop the OpenSM service on the host.

 

Mellanox ConnectX-2 IPoIB Adapter device reports a "Modify QP error" on qpn #0x58 Status #0xffffffea. Therefore, the HCA Nic will be reset. (The issue is reported in Function CMcast::CompleteJoinMcastWi).

 

My other 4 40GB IB cards are functioning properly and some of the things I've tried:

 

1. Restart the OpenSM service on both hosts

2. reset the daughter card

3. tried a different set of cables

4. reset the switch

5. reinstall the device drivers (4.90)

6. compared the advanced settings in the driver to the other daughter cards on another host

 

I've attached a snapshot and would appreciate any help.

 

Thanks


Viewing all articles
Browse latest Browse all 6211

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>