[gemm3] Fix EOS token handling for Gemma text-only export by seyeong-han · Pull Request #206 · huggingface/optimum-executorch

seyeong-han · 2026-01-26T22:30:33Z

Summary

When exporting Gemma models with --task "text-generation", the C++ text runner wasn't stopping at the <end_of_turn> token (ID 106), only at the <eos> token (ID 1). This caused generation to continue beyond expected stopping points.

This diff modifies save_config_to_constant_methods() to export multiple EOS token IDs via get_eos_ids, which the C++ runner's kEosIds method already supports.

Changes

EOS list handling: Build a list of EOS token IDs that handles both single int and list cases from config.eos_token_id
Gemma detection: Detect Gemma models via config.model_type containing "gemma" and automatically add token 106 (<end_of_turn>) to the EOS list
Dual export: Export both get_eos_ids (full list) for C++ runner compatibility and get_eos_id (first element) for backward compatibility

Compatibility

Backward compatible: Python modeling.py already checks for both get_eos_id and get_eos_ids
C++ runner ready: The C++ runner's get_eos_ids() function already supports reading a list of EOS tokens via the kEosIds method

Test Plan

Export Gemma-3-1b-it with --task "text-generation" and verify metadata contains get_eos_ids: [1, 106]
Run the C++ text runner and verify generation stops at <end_of_turn>
Run existing tests: pytest tests/models/test_modeling_gemma3.py -v

…tion Add support for exporting multiple EOS token IDs (`get_eos_ids`) in model metadata to enable proper generation stopping for Gemma models. - Handle cases where config.eos_token_id is already a list vs single int - Detect Gemma models via config.model_type and add <end_of_turn> token (106) - Export get_eos_ids (list) for C++ runner compatibility - Maintain get_eos_id (first ID) for backward compatibility

seyeong-han · 2026-02-02T21:55:58Z

I believe this is not a universal way to adopt the various chat-templates.
So, I created another PR which can accept the chat-template variations by utilizing jinja format.

pytorch/executorch#16987

seyeong-han mentioned this pull request Jan 26, 2026

[gemma3] Add text-only runner for gemma-3-1B-it model pytorch/executorch#16885

Draft

mergennachin requested review from JacobSzwejbka, larryliu0820 and mergennachin January 30, 2026 20:51

seyeong-han closed this Feb 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gemm3] Fix EOS token handling for Gemma text-only export#206

[gemm3] Fix EOS token handling for Gemma text-only export#206
seyeong-han wants to merge 1 commit intohuggingface:mainfrom
seyeong-han:fix-gemma-eos-token-handling

seyeong-han commented Jan 26, 2026

Uh oh!

seyeong-han commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

seyeong-han commented Jan 26, 2026

Summary

Changes

Compatibility

Test Plan

Uh oh!

seyeong-han commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant