Description of problem:
When a node reboots, pacemaker starts before IB SRP LUNs are accessible which causes clustered VG and NFS resource residing on these LUNs fail, and we have to stop and start pacemaker service manually to bring those resources back online.
Version-Release number of selected component (if applicable):
OS:RHEL7.2(default kernel 3.10.0-327.el7.x86_64) with high-availability and resilient storage add-on
IB HBA model:Mellanox ConnectX-3 VPI
IB HBA driver:MLNX_OFED_LINUX-3.4-220.127.116.11
IB HBA firmware:v2.40.7000
Storage array: IBM FlashSystem-900
Steps to Reproduce:
1.create two-node RHCS7.2 cluster framework
2.create clustered volume group and logical volume/ Filesystem/NFS/VIP resource
3.reboot one node
From system log we can see clearly that Infiniband SRP LUNs are still inaccessible when pacemaker service starts, as such Filesystem resource residing on clustered VG/LV and NFS resource all failed.
IB SRP LUNs should be accessible first before pacemaker starts, then Filesystem resource residing on clustered VG/LV and NFS resource can start successfully.
Host is accessing storage through Infiniband SRP(SCSI RDMA Protocol)