Skip to content

[quatization][DRAFT] Quantize Qwen3VLVisionAttention#497

Draft
stamalakhov wants to merge 1 commit intoSamsung:mainfrom
stamalakhov:qwen_visual_attn_br
Draft

[quatization][DRAFT] Quantize Qwen3VLVisionAttention#497
stamalakhov wants to merge 1 commit intoSamsung:mainfrom
stamalakhov:qwen_visual_attn_br

Conversation

@stamalakhov
Copy link
Contributor

@stamalakhov stamalakhov commented Feb 17, 2026

This draft intriduces version of quantized Qwen3VLVisionAttention for debugging.

output of python tico/quantization/wrapq/examples/qwen/quantize_qwen_vision_attn.py (int16):


┌───────────── Quantization Error Summary ─────────────
│ Mean |diff|: 0.000205
│ PEIR       : 0.041563 %
└──────────────────────────────────────────────────────
    ┌────────────────────────────────────────────┐
 5.8┤                                            │
    │                                       •••  │
 4.1┤                                            │
    │                              •••           │
 2.5┤                              •             │
    │                       •••                  │
 0.8┤                    ••••                    │
    │                •••••                       │
-0.8┤             ••••                           │
    │         •••••                              │
-2.5┤      • ••                                  │
    │  ••••                                      │
-4.1┤                                            │
    └┬──────────┬──────────┬─────────┬──────────┬┘
   -4.1       -1.6        0.8       3.3       5.8 

Quantized Circle model saved to /mnt/storage/slow_repos/VLM_TICO/TICO/qwen3vl_vision_attn.q.circle

TODO bug fixing, clean-up.

TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

@stamalakhov stamalakhov self-assigned this Feb 17, 2026
@stamalakhov stamalakhov force-pushed the qwen_visual_attn_br branch 2 times, most recently from 50b4495 to 415f4ac Compare February 17, 2026 15:49
This draft intriduces version of quantized Qwen3VLVisionAttention for debugging.

TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments