13 Replies Latest reply on Aug 9, 2013 6:42 AM by benmiller

    Kernel Modules from Mellanox OFED Stack Won't Load

      I am running a freshly installed RHEL 6.4 server with kernel 2.6.32-358.11.1. I have installed the Mellanox OFED stack by downloading, running mlnx_add_kernel_support.sh -m ./ --make-tgz, and running mlnxofedinstall from the newly created .tgz. The resulting drivers will not load properly. Here is what I have so far:

       

      [root@mdarisnfs01 tmp]# hca_self_test.ofed

       

       

      ---- Performing Adapter Device Self Test ----

      Number of CAs Detected ................. 1

      PCI Device Check ....................... PASS

      Kernel Arch ............................ x86_64

      Host Driver Version .................... MLNX_OFED_LINUX-2.0-2.0.5 (OFED-2.0-2.0.5): 2.6.32-358.11.1.el6.x86_64

      Host Driver RPM Check .................. PASS

      Firmware on CA #0 HCA .................. v2.7.0

      Firmware Check on CA #0 (HCA) .......... NA

          REASON: NO required fw version

      Host Driver Initialization ............. FAIL

      Number of CA Ports Active .............. NA

      Error Counter Check .................... NA

      Kernel Syslog Check .................... NA

      Node GUID on CA #0 (HCA) ............... NA

      ------------------ DONE ---------------------

       

      [root@mdarisnfs01 tmp]# cat hca_self_test_modprobe.output

      WARNING: Error inserting ib_core (/lib/modules/2.6.32-358.11.1.el6.x86_64/extra/mlnx-ofa_kernel/drivers/infiniband/core/ib_core.ko): Invalid module format

      WARNING: Error inserting ib_mad (/lib/modules/2.6.32-358.11.1.el6.x86_64/extra/mlnx-ofa_kernel/drivers/infiniband/core/ib_mad.ko): Invalid module format

      WARNING: Error inserting ib_sa (/lib/modules/2.6.32-358.11.1.el6.x86_64/extra/mlnx-ofa_kernel/drivers/infiniband/core/ib_sa.ko): Invalid module format

      WARNING: Error inserting ib_cm (/lib/modules/2.6.32-358.11.1.el6.x86_64/extra/mlnx-ofa_kernel/drivers/infiniband/core/ib_cm.ko): Invalid module format

      FATAL: Error inserting ib_ipoib (/lib/modules/2.6.32-358.11.1.el6.x86_64/extra/mlnx-ofa_kernel/drivers/infiniband/ulp/ipoib/ib_ipoib.ko): Invalid module format

       

      [root@mdarisnfs01 tmp]# dmesg | tail -n 5

      compat: exports duplicate symbol __pskb_copy (owned by kernel)

      compat: exports duplicate symbol __pskb_copy (owned by kernel)

      compat: exports duplicate symbol __pskb_copy (owned by kernel)

      compat: exports duplicate symbol __pskb_copy (owned by kernel)

      compat: exports duplicate symbol __pskb_copy (owned by kernel)

       

       

      [root@mdarisnfs01 tmp]# uname -a

      Linux mdarisnfs01.mdanderson.org 2.6.32-358.11.1.el6.x86_64 #1 SMP Wed May 15 10:48:38 EDT 2013 x86_64 x86_64 x86_64 GNU/Linux

       

      [root@mdarisnfs01 tmp]# rpm -qa | grep -i mlnx

      opensm-devel-4.0.0.MLNX20130311.156f5c0-0.1.x86_64

      librdmacm-devel-1.0.17mlnx1-OFED.2.0.0.1.4.20130226.1156.g0c5d582.x86_64

      libmlx4-1.0.4mlnx1-OFED.2.0.0.1.8.20130311.1052.g57dd6ea.x86_64

      libibverbs-devel-static-1.1.6mlnx1-OFED.2.0.0.1.8.20130311.0904.g90c09c6.x86_64

      libibmad-devel-1.3.9.MLNX20130311.0cae028-0.1.x86_64

      libibumad-devel-1.3.8.MLNX20130311.0a67c01-0.1.x86_64

      librdmacm-utils-1.0.17mlnx1-OFED.2.0.0.1.4.20130226.1156.g0c5d582.x86_64

      infiniband-diags-1.6.1.MLNX20130311.21d799f-0.1.x86_64

      libibcm-1.0.5mlnx1-OFED.2.0.0.0.9.20130210.1800.gc8011c5.x86_64

      opensm-static-4.0.0.MLNX20130311.156f5c0-0.1.x86_64

      libibverbs-devel-1.1.6mlnx1-OFED.2.0.0.1.8.20130311.0904.g90c09c6.x86_64

      libibmad-1.3.9.MLNX20130311.0cae028-0.1.x86_64

      opensm-libs-4.0.0.MLNX20130311.156f5c0-0.1.x86_64

      libmlx4-devel-1.0.4mlnx1-OFED.2.0.0.1.8.20130311.1052.g57dd6ea.x86_64

      libibumad-1.3.8.MLNX20130311.0a67c01-0.1.x86_64

      librdmacm-1.0.17mlnx1-OFED.2.0.0.1.4.20130226.1156.g0c5d582.x86_64

      srptools-0.0.4mlnx3-OFED.2.0.0.2.6.20130407.1400.g028ed29.x86_64

      libibverbs-utils-1.1.6mlnx1-OFED.2.0.0.1.8.20130311.0904.g90c09c6.x86_64

      libibmad-static-1.3.9.MLNX20130311.0cae028-0.1.x86_64

      libibverbs-1.1.6mlnx1-OFED.2.0.0.1.8.20130311.0904.g90c09c6.x86_64

      libibumad-static-1.3.8.MLNX20130311.0a67c01-0.1.x86_64

      infiniband-diags-compat-1.6.1.MLNX20130311.21d799f-0.1.x86_64

      libibcm-devel-1.0.5mlnx1-OFED.2.0.0.0.9.20130210.1800.gc8011c5.x86_64

      opensm-4.0.0.MLNX20130311.156f5c0-0.1.x86_64

      mlnxofed-docs-2.0-2.0.5.noarch

       

      [root@mdarisnfs01 tmp]# rpm -qa | grep kernel

      kernel-ib-2.0-2.6.32_358.11.1.el6.x86_64_OFED.2.0.2.0.5.g1593535.x86_64

      libreport-plugin-kerneloops-2.0.9-15.el6.x86_64

      kernel-headers-2.6.32-358.11.1.el6.x86_64

      kernel-ib-devel-2.0-2.6.32_358.11.1.el6.x86_64_OFED.2.0.2.0.5.g1593535.x86_64

      abrt-addon-kerneloops-2.0.8-15.el6.x86_64

      kernel-devel-2.6.32-358.11.1.el6.x86_64

      kernel-firmware-2.6.32-358.11.1.el6.noarch

      dracut-kernel-004-303.el6.noarch

      kernel-mft-3.0.0-2.6.32_358.11.1.el6.x86_64.x86_64

      kernel-2.6.32-358.11.1.el6.x86_64

       

      I am at a bit of a loss and any help would be appreciated