IPA tokenizer does not recognize pre-aspiration and pre-nazalization

I just tried to use `segments.Tokenizer()(x, ipa=True)` on some data containing pre-aspirated and pre-nazalized consonants and wondered that a subsequent `pyclts.TranscriptionSystem('bipa')` call complains about very many undefined segments. Apparently `segments` does not know to associate `ᵐ`, `ᵑ` and `ⁿ` with the subsequent sound, but appends them to the preceding vowel. (A similar problem exists with pre-aspirated consonants, but in that case I understand that distinguishing between pre- and post-aspiration is beyond the complexity `segments` wants to provide.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

IPA tokenizer does not recognize pre-aspiration and pre-nazalization #41

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

IPA tokenizer does not recognize pre-aspiration and pre-nazalization #41

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions