Allow overriding gfx version per-node by AdamNiederer · Pull Request #104 · ROCm/ROCT-Thunk-Interface

AdamNiederer · 2024-06-04T23:16:39Z

This lets you run two unsupported-but-really-supported cards of different architecture together in the same program. Works great w/ llama.cpp on my 7900XT + 6600; I'm seeing a 72% perf uplift running LLaMA3-70B Q2 across the two cards (7.4 vs 4.3 tok/s).

Example usage (device 0 is RDNA3, device 1 is RDNA2):

HSA_OVERRIDE_GFX_VERSION_1="11.0.0" HSA_OVERRIDE_GFX_VERSION_2="10.3.0" ollama serve

This lets you run two unsupported-but-really-supported cards of different architecture together in the same program. Works great w/ llama.cpp on my 7900XT + 6600. Example usage (device 0 is RDNA3, device 1 is RDNA2): HSA_OVERRIDE_GFX_VERSION_1="11.0.0" HSA_OVERRIDE_GFX_VERSION_2="10.3.0" ollama serve

kentrussell · 2024-06-05T14:53:15Z

src/topology.c

This is cleaner than the strcat, I think:
snprintf(per_device_override_name, sizeof(per_device_override_name), "HSA_OVERRIDE_GFX_VERSION_%d", node_id);

I'll test this out internally and if it works (and doesn't break any other flows), I'll get it pushed out. Hopefully for 6.2

That's much better, thank you! Changed in da8055d. And thanks for giving it a spin internally!

kentrussell · 2024-06-12T15:50:58Z

Internal testing looks good, we'll try to get this released in ROCm 6.2. Thanks for your contribution!

AdamNiederer force-pushed the per-node-overrides branch from fa07f70 to 7d5f503 Compare June 4, 2024 23:19

AdamNiederer force-pushed the per-node-overrides branch from 7d5f503 to 8135a1c Compare June 4, 2024 23:20

AdamNiederer mentioned this pull request Jun 4, 2024

[Issue]: Proble with HSA_OVERRIDE_GFX_VERSION ROCm/ROCm#3065

Closed

kentrussell reviewed Jun 5, 2024

View reviewed changes

Clean up names & eliminate unneeded strcat

da8055d

kentrussell force-pushed the master branch 2 times, most recently from 56a16c2 to 06a9b94 Compare December 6, 2024 14:38

headcr4sh mentioned this pull request Mar 10, 2025

HSA_OVERRIDE_GFX_VERSION_0 while running on only one GPU ollama/ollama#8473

Open

This was referenced Mar 29, 2025

HSA_OVERRIDE_GFX_VERSION for individual GPU's required ollama/ollama#10033

Open

HSA_OVERRIDE_GFX_VERSION for individual GPU's ollama/ollama#10034

Closed

kentrussell mentioned this pull request Jul 21, 2025

GFX overrides targetted to a GPU ROCm/ROCm#5069

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow overriding gfx version per-node#104

Allow overriding gfx version per-node#104
AdamNiederer wants to merge 2 commits intoROCm:masterfrom
AdamNiederer:per-node-overrides

AdamNiederer commented Jun 4, 2024 •

edited

Loading

Uh oh!

kentrussell Jun 5, 2024

Uh oh!

AdamNiederer Jun 6, 2024

Uh oh!

kentrussell commented Jun 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AdamNiederer commented Jun 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kentrussell Jun 5, 2024

Choose a reason for hiding this comment

Uh oh!

AdamNiederer Jun 6, 2024

Choose a reason for hiding this comment

Uh oh!

kentrussell commented Jun 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AdamNiederer commented Jun 4, 2024 •

edited

Loading