0 Replies Latest reply on Mar 12, 2018 1:38 AM by pasokan

    IOR runtime error while compiled with Mellanox OpenMPI on SR-IOV + KVM environment

    pasokan

      IOR runtime error while compiled with Mellanox OpenMPI on SR-IOV + KVM environment

       

      [root@vcn01 C]# export PATH=/usr/mpi/gcc/openmpi-3.1.0rc2/bin:$PATH

      [root@vcn01 C]# export LD_LIBRARY_PATH=/usr/mpi/gcc/openmpi-3.1.0rc2/lib64:$LD_LIBRARY_PATH

      [root@vcn01 C]# export INCLUDE=/usr/mpi/gcc/openmpi-3.1.0rc2/include:$INCLUDE

      [root@vcn01 C]# gmake posix mpiio

      mpicc -o IOR IOR.o utilities.o parse_options.o \

          aiori-POSIX.o aiori-noMPIIO.o aiori-noHDF5.o aiori-noNCMPI.o \

           -lm

      mpicc -o IOR IOR.o utilities.o parse_options.o \

          aiori-POSIX.o aiori-MPIIO.o aiori-noHDF5.o aiori-noNCMPI.o \

           -lm

      [root@vcn01 C]# ./IOR

      --------------------------------------------------------------------------

      WARNING: No preset parameters were found for the device that Open MPI

      detected:

       

        Local host:            vcn01

        Device name:           mlx5_0

        Device vendor ID:      0x02c9

        Device vendor part ID: 4114

       

      Default device parameters will be used, which may result in lower

      performance.  You can edit any of the files specified by the

      btl_openib_device_param_files MCA parameter to set values for your

      device.

       

      NOTE: You can turn off this warning by setting the MCA parameter

            btl_openib_warn_no_device_params_found to 0.

      --------------------------------------------------------------------------

      [1520843760.041971] [vcn01:20166:0]            cpu.c:52   UCX  WARN  CPU does not support invariant TSC, time may be unstable

      [vcn01:20166:0] Caught signal 11 (Segmentation fault)

      ==== backtrace ====

      2 0x000000000006858c mxm_handle_error()  /var/tmp/OFED_topdir/BUILD/mxm-3.7.3111/src/mxm/util/debug/debug.c:641

      3 0x0000000000068adc mxm_error_signal_handler()  /var/tmp/OFED_topdir/BUILD/mxm-3.7.3111/src/mxm/util/debug/debug.c:616

      4 0x0000000000035250 killpg()  ??:0

      5 0x000000000005f2d0 PMPI_Comm_set_name()  ??:0

      6 0x000000000005f2d0 PMPI_Comm_size()  /var/tmp/OFED_topdir/BUILD/openmpi-3.1.0rc2/ompi/mpi/c/profile/pcomm_size.c:63

      7 0x0000000000402a2f main()  /mnt/lustre_client/pasokan/IOR-July12/src/C/IOR.c:126

      8 0x0000000000021b35 __libc_start_main()  ??:0

      9 0x00000000004027f9 _start()  ??:0

      ===================

      Segmentation fault

      [root@vcn01 C]#