Skip to content

Conversation

@kpjeeja
Copy link

@kpjeeja kpjeeja commented Jan 9, 2026

Optimization of CPU hotspots with Progress/Control threads

    - This patch adds Adaptive sleep for progress Thread/control
    thread instead of while with timeout(0ms). if CQ completion does
    not thie progress Thread will keep spinning. Same applies for
    CM Thread.
    - Add Cq Burst read instead on 1 completion read.
    - in case of Eagain for send/write/read. add adaptive sleep
    delay to avoid udealy of some ms

Signed-off-by: Jeeja KP <[email protected]>

tsg- and others added 15 commits October 8, 2025 17:14
- Enable smart auto-detection of device types
- Dual-path memory registration (GDR, FI_HMEM)
- Device type as custom param (priority: envvar, backend param,
  auto-detect)
Use the unique device list to resolve double free

Signed-off-by: Jerome Anand <[email protected]>
Signed-off-by: Tushar Gohad <[email protected]>
This fixes a bug in multi-descriptor transfers where descriptors
point to different offsets within the same registered memory region.

Without this fix, RDMA reads always target offset 0. Should extract
each descriptor's specific target address instead.

Also impacted: Block-based transfers (Iteration N would read blocks
from iteration 0, etc), Partial buffer updates, etc.

Signed-off-by: Tushar Gohad <[email protected]>
- Create 16 descriptors at different 64KB offsets in a 1MB buffer
- Attempt RDMA transfers
- add libfabric backend to nixl test
- add libfabric Gaudi->Gaudi transfer test

Signed-off-by: Jeeja Kp <[email protected]>
Add nixl test for Gaudi with libfabric backend
ported from ofi branch,
commit f70b5d4
Author: Chendi Xue <[email protected]>

- changed the branch name from v0.6.0_OFI to libfabric

Signed-off-by: Jeeja KP <[email protected]>
@kpjeeja kpjeeja closed this Jan 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants