LingPy

This documentation is for version 2.0.dev, which is not released yet.

lingpy.compare.lexstat.LexStat.get_scorer

LexStat.get_scorer(method='shuffle', ratio=(3, 2), vscale=0.5, runs=1000, threshold=0.7, modes=[('global', -2, 0.5), ('local', -1, 0.5)], factor=0.3, restricted_chars='_T', force=False, preprocessing=True, rands=1000, limit=10000, verbose=False, cluster_method='upgma', gop=-2)

Create a scoring function based on sound correspondences.

Parameters :

method : str (default=’markov’)

Select between “markov”, for automatically generated random strings, and “shuffle”, for random strings taken directly from the data.

ratio : tuple (default=3,2)

Define the ratio between derived and original score for sound-matches.

vscale : float (default=0.5)

Define a scaling factor for vowels, in order to decrease their score in the calculations.

runs : int (default=1000)

Choose the number of random runs that shall be made in order to derive the random distribution.

threshold : float (default=0.7)

The threshold which used to select those words that are compared in order to derive the attested distribution.

modes : list (default = [(“global”,-2,0.5),(“local”,-1,0.5)])

The modes which are used in order to derive the distributions from pairwise alignments.

factor : float (default=0.3)

The scaling factor for sound segments with identical prosodic environment.

force : bool (default=False)

Force recalculation of existing distribution.

preprocessing: bool (default=False) :

Select whether SCA-analysis shall be used to derive a preliminary set of cognates from which the attested distribution shall be derived.

rands : int (default=1000)

If “method” is set to “markov”, this parameter defines the number of strings to produce for the calculation of the random distribution.

limit : int (default=10000)

If “method” is set to “markov”, this parameter defines the limit above which no more search for unique strings will be carried out.

cluster_method : {“upgma” “single” “complete”} (default=”upgma”)

Select the method to be used for the calculation of cognates in the preprocessing phase, if “preprocessing” is set to c{True}.

gop : int (default=-2)

If “preprocessing” is selected, define the gap opening penalty for the preprocessing calculation of cognates.

This Page