Skip to content

openmpi 5.0.5 + lsf: The affinity file provided in LSB_AFFINITY_HOSTFILE could not be converted to a rankfile #13570

@puneet336

Description

@puneet336

Hi Team
I have compiler openmpi 5.0.5 and PRRTE 3.0.8 on RHEL 9.5 (Linux login1 5.14.0-503.38.1.el9_5.x86_64 #1 SMP PREEMPT_DYNAMIC Sun Apr 13 22:01:49 EDT 2025 x86_64 x86_64 x86_64 GNU/Linux
) with following options:

--with-slurm --with-lsf=/CHBS/apps/HPC/lsf/lsf10.1/10.1 --with-lsf-libdir=/CHBS/apps/HPC/lsf/lsf10.1/10.1/linux3.10-glibc2.17-x86_64/lib

but seems mpirun is unable to pick up node information from LSF:


[singhpuv@server1 ~]$ mpirun -np 1 /bin/hostname
--------------------------------------------------------------------------
The specified LSF affinity file contained a node (server1) that is not in your
allocation. We therefore cannot map a process rank to it. Please
check your allocation and affinity file to ensure the latter only
contains allocated nodes.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
The affinity file provided in LSB_AFFINITY_HOSTFILE could not be converted
to a rankfile:

  File:  /home/singhpuv/.lsbatch/1765772632.2986.hostAffinityFile

We cannot continue.
--------------------------------------------------------------------------
[singhpuv@server1 ~]$ echo $LSB_AFFINITY_HOSTFILE
/home/singhpuv/.lsbatch/1765772632.2986.hostAffinityFile
[singhpuv@glchst-st220045 ~]$ cat /home/singhpuv/.lsbatch/1765772632.2986.hostAffinityFile
server1 0
server2 0
[singhpuv@server1 ~]$ prte
prte       prted      prte_info  prterun
[singhpuv@server1 ~]$ prte_info |grep -i lsf
                          '--with-lsf=/CHBS/apps/HPC/lsf/lsf10.1/10.1'
                          '--with-lsf-libdir=/CHBS/apps/HPC/lsf/lsf10.1/10.1/linux3.10-glibc2.17-x86_64/lib'
                 MCA ess: lsf (MCA v2.1.0, API v3.0.0, Component v3.0.6)
                 MCA plm: lsf (MCA v2.1.0, API v2.0.0, Component v3.0.6)
                 MCA ras: lsf (MCA v2.1.0, API v2.0.0, Component v3.0.8)
[singhpuv@server1 ~]$

Please advice.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions