Skip to content

[ISSUE]: Uninitialized ah_attr in ibgda_create_dct_shared_objects leads to potential EINVAL #21

@jcao219

Description

@jcao219

struct ibv_ah_attr ah_attr;

Within the ibgda_create_dct_shared_objects function, the struct ibv_ah_attr ah_attr variable appears to be used without being initialized.

Specifically, the problematic field is ah_attr.static_rate then containing a garbage value. This uninitialized data is then used later when calling ibv_create_ah, which I've seen leading to an EINVAL (Invalid argument) error on a RoCE setup.

Steps to Reproduce

  1. Build nvshmem with IBGDA transport enabled.
  2. Run a simple application with nvshmem_init, such as the hello.cpp example with NVSHMEM_IB_ENABLE_IBGDA=1

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions