10 Replies Latest reply on Jan 8, 2014 9:55 AM by scotschultz

    mlnx-ofed-kernel installation failed!

      Hi ,

       

      Im currently working on project that need to use Mellanox Infiniband card for our GPUDirect research. We are trying to setup a machine using the Infiniband card we just received but we face a problem with driver installation where it failed to fully install the “mlnx-ofed-kernel-2.0. Hope we can get some help on the matter. Below shows the output we getting and some information about our machine.

       

      CHECKING DEVICE AVAILABILITY

      gpu1@gpu1-System-Product-Name:~$ lspci -v | grep Mellanox
      05:00.0 Network controller: Mellanox Technologies MT27500 Family [ConnectX-3]
          Subsystem: Mellanox Technologies Device 0050

      CHECKING UBUNTU VERSION

      gpu1@gpu1-System-Product-Name:~/Downloads/MLNX_OFED_LINUX-2.0-3.0.0-ubuntu12.04-x86_64$ uname -a
      Linux gpu1-System-Product-Name 3.8.0-34-generic #49~precise1-Ubuntu SMP Wed Nov 13 18:05:00 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux

      LAUCNHING THE INSTALLER

      gpu1@gpu1-System-Product-Name:~/Downloads/MLNX_OFED_LINUX-2.0-3.0.0-ubuntu12.04-x86_64$ sudo ./mlnxofedinstall
      [sudo] password for gpu1:
      Log: /tmp/ofed.build.log
      This program will install the MLNX_OFED_LINUX package on your machine.
      Note that all other Mellanox, OEM, OFED, or Distribution IB packages will be removed.
      Do you want to continue?[y/N]:y

      Checking SW Requirements...


      Checking for old packages...

      Removing old packages...

      Installing new packages

      Installing ofed-scripts-2.0...
      Running: /usr/bin/dpkg -i --force-confmiss /home/gpu1/Downloads/MLNX_OFED_LINUX-2.0-3.0.0-ubuntu12.04-x86_64/DEBS/ofed-scripts_2.0-1_amd64.deb
      Installing mlnx-ofed-kernel-2.0...
      Running: /usr/bin/dpkg -i --force-confnew --force-confmiss /home/gpu1/Downloads/MLNX_OFED_LINUX-2.0-3.0.0-ubuntu12.04-x86_64/DEBS/mlnx-ofed-kernel-dkms_2.0-OFED.2.0.2.6.9.6.g3a2d7bf_all.deb /home/gpu1/Downloads/MLNX_OFED_LINUX-2.0-3.0.0-ubuntu12.04-x86_64/DEBS/mlnx-ofed-kernel-utils_2.0-OFED.2.0.2.6.9.6.g3a2d7bf_amd64.deb

      mlnx-ofed-kernel installation failed!
      Removing newly installed packages...
      Running: /usr/sbin/ofed_uninstall.sh --force

      Please inform me if you need me to provide more data about this. attached is the logfile of the installer. Thank you.

       

      NOTES:

      • The output shows here is the second time i ran the installer, the first run of the installer show it install some other package required before proceed to install the ofed-script-2.0 and mlnx-ofed-kernel-2.0
      • The computer is freshly format with ubuntu 12.04. CUDA 5.5 and nvidia driver 331 is also been installed.


      Amirul,

      MIMOS Software Engineer