Tokenize IPA-encoded strings.
Parameters : | seq : string or unicode
diacritics : unicode
vowels : unicode
merge_vowels : bool
|
---|---|
Returns : | tokens : list
|
See also
Examples
>>> from lingpy import *
>>> myseq = 't͡sɔyɡə'
>>> ipa2tokens(myseq)
[u't\u0361s', u'\u0254y', u'\u0261', u'\u0259']
>>> for t in ipa2tokens(myseq): print t
t͡s
ɔy
ɡ
ə