I picked up a couple ConnectX4 single-port HCAs second-hand that are showing an invalid GUID. They all are showing 0123 for GUID and hardware version of 0. I've updated the firmware and OFED to the latest. I'm not sure what to do next. Are these cards faulty? Any ideas?
transport: InfiniBand (0)
state: PORT_DOWN (1)
max_mtu: 4096 (5)
active_mtu: 4096 (5)
Querying Mellanox devices firmware ...
Device Type: ConnectX4
Part Number: MCX455A-ECA_Ax
Description: ConnectX-4 VPI adapter card; EDR IB (100Gb/s) and 100GbE; single-port QSFP28; PCIe3.0 x16; ROHS R6
PCI Device Name: /dev/mst/mt4115_pciconf0
Base GUID: 0000000000000123
Versions: Current Available
FW 12.22.1002 N/A
FW (Running) 12.16.0152 N/A
PXE 3.5.0403 N/A
UEFI 14.15.0019 N/A
Status: No matching image found
It is possible that someone reflashed HCA's with wrong GUID's. However the HCA's may be fine.
Install CentOS/RH7.5 (inbox driver should be fine, at least for testing)
Set the port type to desired value (see MFT user manual, mlxconfig command)
Connect hosts the with the EDR cable and if you are interested in checking IB, start opensm service
Configure interfaces and run 'ifuip <IFs> up'
Verify port state with ibv_devinfo