This documentation is for version 2.0.dev, which is not released yet.
Tokenize the data with help of orthography profiles.
Parameters : | ortho_profile : str (default=’‘)
source : str (default=”counterpart”)
target : str (default=”tokens”)
|
---|
Notes
This is a shortcut to the extended Wordlist class that loads data and automatically tokenizes it.