-
Notifications
You must be signed in to change notification settings - Fork 85
Description
Hello.
I am trying to change the old preview .task model file gemma-3n-E2B-it-int4.task from here to the new stable one gemma-3n-E2B-it-int4.litertlm from here in my app code, and after the change the app stops working properly (stuck for a several minutes on the 1st chat answer attempt and then behave unexpectedly).
I wonder, whether everything is OK with these new models inference support.
I see the repeated
<end_of_turn><end_of_turn><end_of_turn><end_of_turn><end_of_turn>
in the log, looks like the model template/chat is handled incorrectly.
FYI: Google AI Edge Gallery have just moved to the .litertlm format on both Android and iOS, so it looks like the change is major and permanent.
I use FlutterGemma class under the hood.
The package version:
flutter_gemma:
version: "0.11.13"
Thanks.