Hi Rama,
What OS, Kernel and driver version are you using? (modinfo mlx4_core | grep -i version).
Have you seen an followed documents:
HowTo Compile Linux Kernel for NVMe over Fabrics
HowTo Configure NVMe over Fabrics
What is the last trace generated in the messages file prior to crash?
Are you getting the same result with any jobs above 4 ? (IE: 5,6,7)
vender_err 87 reports a number of RNR NACK exceeding and terminate the QP. (receiver not ready (RNR) error).
Regards,
Sophie.