-
Notifications
You must be signed in to change notification settings - Fork 12
Open
Description
The signature of Tokenizer.transform suggests that the function can take an argument to describe how to deal with undefined segments
segments/src/segments/tokenizer.py
Line 229 in 369e36d
| def transform(self, word, column=Profile.GRAPHEME_COL, error=errors.replace): |
but the actual fallback
segments/src/segments/tokenizer.py
Line 262 in 369e36d
| target = self._errors['replace'](token) |
is always the
replace strategy.
(I came here looking for a keep strategy that would allow me to inspect the errors by bouncing them back to me instead of transforming them, even ignore replaces them by just nothing, but that's a different issue.)
Metadata
Metadata
Assignees
Labels
No labels