[ggma] Add documentation for TinyLlama example by glistening · Pull Request #16283 · Samsung/ONE

glistening · 2025-11-14T07:13:42Z

Created runtime/ggma/examples/generate_text/tinyllama.md with step‑by‑step guide.
Includes prerequisites, model generation commands, full processing pipeline, and a summary.

ONE-DCO-1.0-Signed-off-by: Sanggyu Lee sg5.lee@samsung.com

runtime/ggma/examples/generate_text/README.md

runtime/ggma/examples/generate_text/decode.py

glistening · 2025-11-14T07:17:44Z

I will append how to preparing ggma package and build ggma, and run.

- Created `runtime/ggma/examples/generate_text/tinyllama.md` with step‑by‑step guide. - Includes prerequisites, model generation commands, full processing pipeline, and a summary. ONE-DCO-1.0-Signed-off-by: Sanggyu Lee <sg5.lee@samsung.com>

dayo09 · 2025-11-21T10:06:55Z

runtime/ggma/examples/generate_text/decode.py

+
+model = AutoModelForCausalLM.from_pretrained(model_name)
+model.eval()
+circle_model = tico.convert(model, captured_input)


FOR OTHER REVIEWERS,

You may encounter export error related to vmap_impl which is caused as sdpa_mask_recent_torch is no more torch-exportable since 4.54.0 ~ 4.57.1 (maybe lower versions too, I checked only 4.54.0 and 4.57.1).

It can be resolved by using transformers==4.50.3 as the author wrote in requirements.txt.

runtime/ggma/examples/generate_text/tinyllama/requirements.txt

glistening · 2025-11-23T08:45:24Z

runtime/ggma/examples/generate_text/gyu/common.py

+PR_WORKTREE = "_pr_16233"
+PR_BRANCH = "pr-16233"
+PR_REF = "refs/pull/16233/head"


It will be removed once 16233 is merged.

glistening · 2025-11-24T05:54:25Z

runtime/ggma/examples/generate_text/tinyllama/tinyllama.pipeline

@@ -0,0 +1,10 @@
+decode: |
+  fuse.attention.py < decode_.circle
+      | reshape.io.py input --by_shape [1,16,30,4] [1,16,32,4]


Later, kv_cache's shape will be determined automatically based on config.json.

glistening · 2025-11-24T05:55:11Z

runtime/ggma/examples/generate_text/tinyllama/pipeline.yaml

+
+merge: |
+  merge.circles.py prefill.circle decode.circle
+      | fuse.bmm_lhs_const.py


onert does not allow const lhs for batchmatmul.

glistening · 2025-11-24T05:56:20Z

runtime/ggma/examples/generate_text/tinyllama/pipeline.yaml

+merge: |
+  merge.circles.py prefill.circle decode.circle
+      | fuse.bmm_lhs_const.py
+      | downcast.input_ids.py


I will use int32 instead of int64 (← the default type from TICO generated) for input_ids, which is given by gather.

glistening · 2025-11-24T05:56:49Z

runtime/ggma/examples/generate_text/tinyllama/pipeline.yaml

+  merge.circles.py prefill.circle decode.circle
+      | fuse.bmm_lhs_const.py
+      | downcast.input_ids.py
+      | gc.py > model.circle


It removes unreachable {input/output,tensor,buffer,...}.

glistening · 2025-11-24T05:58:28Z

runtime/ggma/examples/generate_text/tinyllama/pipeline.yaml

+      | transpose.io.kvcache.py > decode.circle
+
+merge: |
+  merge.circles.py prefill.circle decode.circle


It will merge two circles into one circle.
In this phase, the weight sharing is handled by pointing the same buffer index for same content of weights.

glistening commented Nov 14, 2025

View reviewed changes

runtime/ggma/examples/generate_text/README.md Outdated Show resolved Hide resolved

runtime/ggma/examples/generate_text/decode.py Outdated Show resolved Hide resolved

glistening force-pushed the ggma_example branch from 86030e4 to d16d8b1 Compare November 14, 2025 07:18

glistening force-pushed the ggma_example branch 3 times, most recently from 4234213 to a1219ae Compare November 21, 2025 09:48

dayo09 reviewed Nov 21, 2025

View reviewed changes

glistening mentioned this pull request Nov 21, 2025

[tools] Introduce circle2circle (python) #16233

Open

glistening force-pushed the ggma_example branch 3 times, most recently from 056dd75 to f78430e Compare November 22, 2025 09:49

Update document

fd78f13

glistening force-pushed the ggma_example branch 2 times, most recently from b24f78c to 71f6721 Compare November 23, 2025 09:57

dayo09 reviewed Nov 23, 2025

View reviewed changes

runtime/ggma/examples/generate_text/tinyllama/requirements.txt Show resolved Hide resolved

glistening force-pushed the ggma_example branch 3 times, most recently from edf7864 to cb3b36a Compare November 24, 2025 01:48

glistening commented Nov 24, 2025

View reviewed changes

glistening force-pushed the ggma_example branch from cb3b36a to e1d1b3b Compare November 24, 2025 05:52

glistening commented Nov 24, 2025

View reviewed changes

glistening force-pushed the ggma_example branch 2 times, most recently from cd293c9 to 0b8bd39 Compare November 24, 2025 06:03

mhs4670go mentioned this pull request Nov 24, 2025

Add attention operator and adapter for onert Samsung/TICO#400

Merged

glistening force-pushed the ggma_example branch 3 times, most recently from b93d59c to c86b5cd Compare November 25, 2025 04:59

glistening force-pushed the ggma_example branch 3 times, most recently from 3c8d290 to 2816c7f Compare November 26, 2025 04:28

Add USER.md and merge prefill.py and decode.py

f1d3ef6

glistening force-pushed the ggma_example branch from 2816c7f to f1d3ef6 Compare November 26, 2025 04:34

glistening mentioned this pull request Nov 26, 2025

[tools] Introduce circle package manger #16312

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ggma] Add documentation for TinyLlama example#16283

[ggma] Add documentation for TinyLlama example#16283
glistening wants to merge 3 commits intoSamsung:masterfrom
glistening:ggma_example

glistening commented Nov 14, 2025

Uh oh!

Uh oh!

Uh oh!

glistening commented Nov 14, 2025

Uh oh!

dayo09 Nov 21, 2025

Uh oh!

Uh oh!

glistening Nov 23, 2025

Uh oh!

glistening Nov 24, 2025

Uh oh!

glistening Nov 24, 2025

Uh oh!

glistening Nov 24, 2025

Uh oh!

glistening Nov 24, 2025

Uh oh!

glistening Nov 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

glistening commented Nov 14, 2025

Uh oh!

Uh oh!

Uh oh!

glistening commented Nov 14, 2025

Uh oh!

dayo09 Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

glistening Nov 23, 2025

Choose a reason for hiding this comment

Uh oh!

glistening Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

glistening Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

glistening Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

glistening Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

glistening Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants