WIP: AMD Support and Watchdog by gmlwns2000 · Pull Request #93 · DeepAuto-AI/hip-attention

gmlwns2000 · 2025-10-14T13:53:21Z

WIP: AMD Support and Watchdog

kbumsik · 2025-10-16T08:50:27Z

configs/mixed_landmark_0814_no_extend_qsa.json

    "using_extend": false,
    "dense_layers": [0, 1, 2, 47, 46, 45],
-    "mask_refresh_interval": [96],
+    "mask_refresh_interval": [96, 32, 16],
    "layers": [
        {
            "sliding_window_size": 1024,
            "sliding_window_size_for_masking_step": [1024, 1024, 1024],
-            "second_stage_k": 1024,
+            "second_stage_k": 2048,
            "sink_token_size": 1024,
            "sa_extend_backend": "self_extend",
-            "stages": [ { } ]
+            "stages": [
+                {
+                    "stage_block_size_q":128,
+                    "stage_block_stride_q":4,
+                    "stage_chunk_size":256,
+                    "stage_k":null,
+                    "stage_stride":1,
+                    "using_landmark":false
+                },
+                {
+                    "stage_block_size_q":64,
+                    "stage_block_stride_q":1,
+                    "stage_chunk_size":32,
+                    "stage_k":65536,
+                    "stage_stride":1,
+                    "using_landmark":false
+                },
+                {
+                    "stage_block_size_q":64,
+                    "stage_block_stride_q":1,
+                    "stage_chunk_size":8,
+                    "stage_k":8192,
+                    "stage_stride":1,
+                    "using_landmark":false
+                }
+            ]
        },
        {
            "sliding_window_size": 1024,
            "sliding_window_size_for_masking_step": [1024, 1024, 1024],
-            "second_stage_k": 1024,
+            "second_stage_k": 2048,
            "sink_token_size": 1024,
            "sa_extend_backend": "self_extend",
            "scan_extend_backend": "none",
-            "stages": [ { } ]
+            "stages": [
+                {
+                    "stage_block_size_q":128,
+                    "stage_block_stride_q":4,
+                    "stage_chunk_size":256,
+                    "stage_k":null,
+                    "stage_stride":1,
+                    "using_landmark":false
+                },
+                {
+                    "stage_block_size_q":64,
+                    "stage_block_stride_q":1,
+                    "stage_chunk_size":32,
+                    "stage_k":65536,
+                    "stage_stride":1,
+                    "using_landmark":false
+                },
+                {
+                    "stage_block_size_q":64,
+                    "stage_block_stride_q":1,
+                    "stage_chunk_size":8,
+                    "stage_k":8192,
+                    "stage_stride":1,
+                    "using_landmark":false
+                }
+            ]
        }
    ],


I suppose it is used for qwen3.

So is this PR also fixes our qwen3 + nvidia? To me this PR can't be ignored even if we don't need AMD.

gmlwns2000 added 11 commits October 5, 2025 12:59

add watchdog :D

53b800e

fix watchdog

beeac38

fix watchdog

744f5ee

fix

b69ee21

fix

4b434d7

fix

97cf3ca

fix

b94b511

fix

c17e4d9

fix

fee868e

fix

3685169

watchdog bug fix

b0568c6

kbumsik requested changes Oct 16, 2025

View reviewed changes

gmlwns2000 added 2 commits November 3, 2025 02:12

add benchmark

618c151

fix

4bdc62c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: AMD Support and Watchdog#93

WIP: AMD Support and Watchdog#93
gmlwns2000 wants to merge 13 commits intodeepauto/devfrom
feat/amd

gmlwns2000 commented Oct 14, 2025

Uh oh!

kbumsik Oct 16, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gmlwns2000 commented Oct 14, 2025

Uh oh!

kbumsik Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kbumsik Oct 16, 2025 •

edited

Loading