Skip to content

Conversation

@vanman-nguyen
Copy link

This PR adds the support for Eviden's communication library, UBCL (Unified Bxi Communication Layer), by the addition of a PML/UBCL and of an OSC/UBCL, thus enabling MPI communications through the BXI network.
Both components are currently maintained by us.

We are also currently working on joining the MTT efforts to continuously validate these components on our infrastructure.

Note: The PML does not yet support MPI_Session and the OSC does not yet support accelerators buffers. We plan on implementing these features in the future.

Co-authored-by: Florent GERMAIN <florent.germain@eviden.com>
Co-authored-by: Pierre LEMARINIER <pierre.lemarinier@eviden.com>
Co-authored-by: Antoine CAPRA <antoine.capra@eviden.com>
Co-authored-by: Emmanuel BRELLE <emmanuel.brelle@eviden.com>
Co-authored-by: Van Man NGUYEN <van-man.nguyen@eviden.com>
Co-authored-by: Julien DUPRAT <julien.duprat@eviden.com>
Co-authored-by: Tristan CALS <tristan.cals@eviden.com>
Co-authored-by: Anton DAUMEN <anton.daumen@eviden.com>
Co-authored-by: Alice CARIBONI <alice.cariboni@eviden.com>
Co-authored-by: François WELLENREITER <francois.wellenreiter@eviden.com>

Signed-off-by: Van Man NGUYEN <van-man.nguyen@eviden.com>
@hppritcha hppritcha self-requested a review October 28, 2025 16:05
@edgargabriel edgargabriel self-requested a review November 5, 2025 19:56
Copy link
Member

@edgargabriel edgargabriel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the code looks fundamentally fine with me, I was not able to test it, just read through some parts of it.

The one question that I have is that the pml component only seems to support CUDA buffers at the moment, is this correct (i.e. not the other accelerator components such as rocm or ze)? Is there something fundamental missing, or just a case of not having tested it with other GPUs?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants