[quantization] Introduce a wrapper for `nn.Embedding` by stamalakhov · Pull Request #455 · Samsung/TICO

stamalakhov · 2026-02-02T06:20:14Z

This commit introduces a wrapper for nn.Embedding.

./ccex test -k "quantization.wrapq.wrappers.nn.test_quant_embedding"

RUN unit tests with -k quantization.wrapq.wrappers.nn.test_quant_embedding ...
test_dtype_override (quantization.wrapq.wrappers.nn.test_quant_embedding.TestQuantEmbedding) ... ok
test_mode_transitions (quantization.wrapq.wrappers.nn.test_quant_embedding.TestQuantEmbedding) ... ok
test_quantised_output_close (quantization.wrapq.wrappers.nn.test_quant_embedding.TestQuantEmbedding) ... ok
test_weight_stats_survive (quantization.wrapq.wrappers.nn.test_quant_embedding.TestQuantEmbedding) ... ok

----------------------------------------------------------------------
Ran 4 tests in 0.005s

OK

Draft: #436
TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

mhs4670go · 2026-02-02T06:49:32Z

tico/quantization/wrapq/wrappers/nn/quant_embedding.py

+        self.weight_obs = self._make_obs(
+            "weight",
+            qscheme=QScheme.PER_CHANNEL_ASYMM,  # tensorwise quantization breaks the model
+            channel_axis=0,  # weight ~ (vocab_size, inner_dim) so we quantize by inner dimension so that scales ~ (1, vocab_size)


"inner dimension" is vocal_size here. Right?

No. It's hidden_dim from LLama config.

@mhs4670go
I mean inner_dim is dimension of internal float representation.

@mhs4670go
Do you mean comment is wrong ? Scales are of ~ (1, vocab_size) shape for channel_axis = 0 (i've tested it)

@mhs4670go
Finally i got inner dimension in inner dimension so .... I've tried to underscore the fact that scales will have 1, vocab_size dimension. So in inner dimension so ... may be confusing. It can be changed tp:

# weight ~ (vocab_size, inner_dim) so that scales ~ (1, vocab_size)

@mhs4670go
Changed to a more clear comment.

Thank you! It's more clear!

This commit introduces a wrapper for nn.Embedding. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

mhs4670go

LGTM

dvsav · 2026-02-13T09:21:53Z

@stamalakhov Stas, it looks like tico.quantization.wrapq.wrappers.nn.quant_embedding is not registered in tico/quantization/wrapq/wrappers/registry.py. Therefore trying to quantize torch.nn.Embedding still causes exception PTQQuantizer: no quantization wrapper for Embedding.

stamalakhov · 2026-02-13T10:12:47Z

Therefore trying to quantize torch.nn.Embedding still causes exception PTQQuantizer: no quantization wrapper for Embedding

@dvsav
It is known as works as expected. Until there are no clients of nn.Embedding there is no point touch registry.py.

stamalakhov self-assigned this Feb 2, 2026

stamalakhov force-pushed the Embedding_PR branch from 514d9a6 to 0c64e7a Compare February 2, 2026 06:47

mhs4670go reviewed Feb 2, 2026

View reviewed changes

stamalakhov requested a review from mhs4670go February 2, 2026 06:57

[quantization] Introduce a wrapper for nn.Embedding

8030686

This commit introduces a wrapper for nn.Embedding. TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>

stamalakhov force-pushed the Embedding_PR branch from 0c64e7a to 8030686 Compare February 2, 2026 07:36

mhs4670go approved these changes Feb 2, 2026

View reviewed changes

mhs4670go merged commit bd9c9b5 into Samsung:main Feb 2, 2026
7 checks passed

stamalakhov deleted the Embedding_PR branch February 2, 2026 07:57

dvsav mentioned this pull request Feb 13, 2026

Qwen3-VL: Implement quantization wrappers #483

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] Introduce a wrapper for `nn.Embedding`#455

[quantization] Introduce a wrapper for `nn.Embedding`#455
mhs4670go merged 1 commit intoSamsung:mainfrom
stamalakhov:Embedding_PR

stamalakhov commented Feb 2, 2026 •

edited

Loading

Uh oh!

mhs4670go Feb 2, 2026

Uh oh!

stamalakhov Feb 2, 2026

Uh oh!

stamalakhov Feb 2, 2026

Uh oh!

stamalakhov Feb 2, 2026

Uh oh!

stamalakhov Feb 2, 2026

Uh oh!

stamalakhov Feb 2, 2026

Uh oh!

mhs4670go Feb 2, 2026

Uh oh!

mhs4670go left a comment

Uh oh!

Uh oh!

dvsav commented Feb 13, 2026

Uh oh!

stamalakhov commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

stamalakhov commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhs4670go Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

stamalakhov Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

mhs4670go left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dvsav commented Feb 13, 2026

Uh oh!

stamalakhov commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stamalakhov commented Feb 2, 2026 •

edited

Loading