Skip to content

[quantization][draft] Quantization of Llama#492

Draft
stamalakhov wants to merge 1 commit intoSamsung:mainfrom
stamalakhov:quant_full_model_PR
Draft

[quantization][draft] Quantization of Llama#492
stamalakhov wants to merge 1 commit intoSamsung:mainfrom
stamalakhov:quant_full_model_PR

Conversation

@stamalakhov
Copy link
Contributor

This PR quantizes the full LLama model and converts it to circle format.

Draft: #436
TICO-DCO-1.0-Signed-off-by: s.malakhov s.malakhov@partner.samsung.com

@stamalakhov stamalakhov self-assigned this Feb 13, 2026
@stamalakhov stamalakhov marked this pull request as draft February 13, 2026 10:38
@stamalakhov
Copy link
Contributor Author

@mhs4670go
Should i provide tests for it and/or split in smaller PRs?

This PR quantizes the full `LLama` model and converts it to circle format.

TICO-DCO-1.0-Signed-off-by: s.malakhov <s.malakhov@partner.samsung.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant