This documentation is for version 2.0.dev, which is not released yet.
Basic class for automatic cognate detection.
Parameters : | filename : str
|
---|
Notes
Instantiating this class does not require a lot of parameters. However, the user may modify its behaviour by providing additional attributes in the input file.
Methods
add_entries(entry, source, function[, override]) | Add new entry-types to the word list by modifying given ones. |
align_pairs(idxA, idxB[, method, mode, gop, ...]) | Align all or some words of a given pair of languages. |
calculate(data[, taxa, concepts, cognates, ...]) | Function calculates specific data. |
cluster([method, cluster_method, threshold, ...]) | Function for flat clustering of words into cognate sets. |
get_dict([col, row, entry]) | Function returns dictionaries of the cells matched by the indices. |
get_entries(entry) | Return all entries matching the given entry-type as a two-dimensional list. |
get_etymdict([ref, entry, loans]) | Return an etymological dictionary representation of the word list. |
get_list([row, col, entry, flat]) | Function returns lists of rows and columns specified by their name. |
get_paps([ref, entry, missing]) | Function returns a list of present-absent-patterns of a given word list. |
get_random_distances([method, runs, mode, ...]) | Method calculates randoms scores for unrelated words in a dataset. |
get_scorer([method, ratio, vscale, runs, ...]) | Create a scoring function based on sound correspondences. |
output(fileformat, **keywords) | Write wordlist to file. |
pickle() | Store a dump of the data in a binary file. |
tokenize([ortho_profile, source, target]) | Tokenize the data with help of orthography profiles. |