0 Replies Latest reply on Oct 30, 2017 2:18 AM by wangfei

    during node reboot IB SRP LUNs are inaccessible when Red Hat Cluster 7.x pacemaker starts which causes clustered VG and NFS resource fail

    wangfei

      Description of problem:

       

      When a node reboots, pacemaker starts before IB SRP LUNs are accessible which causes clustered VG and NFS resource residing on these LUNs fail, and we have to stop and start pacemaker service manually to bring those resources back online.

       

      Version-Release number of selected component (if applicable):

       

       

      OS:RHEL7.2(default kernel 3.10.0-327.el7.x86_64) with high-availability and resilient storage add-on

       

      IB HBA model:Mellanox ConnectX-3 VPI

       

      IB HBA driver:MLNX_OFED_LINUX-3.4-2.0.0.0

       

      IB HBA firmware:v2.40.7000

       

      Storage array: IBM FlashSystem-900

       

       

      How reproducible:

       

      always

       

       

      Steps to Reproduce:

      1.create two-node RHCS7.2 cluster framework

      2.create clustered volume group and logical volume/ Filesystem/NFS/VIP resource

      3.reboot one node

       

      Actual results:

      From system log we can see clearly that Infiniband SRP LUNs are still inaccessible when pacemaker service starts, as such Filesystem resource residing on clustered VG/LV and NFS resource all failed.

       

       

      Expected results:

      IB SRP LUNs should be accessible first before pacemaker starts, then Filesystem resource residing on clustered VG/LV and NFS resource can start successfully.

       

      Additional info:

      Host is accessing storage through Infiniband SRP(SCSI RDMA Protocol)