-
-
Notifications
You must be signed in to change notification settings - Fork 42
Closed
Description
The gff.ti.gff_tigrinya lexical model is based on the Unilex project's wordlist for Tigrinya. Unfortunately, the contents contain many misspellings and non-Tigrinya words that come corpus of unknown provenance and pedigree. The contents also combine conflicting spelling conventions of both Eritrea and Ethiopia which also impact the frequency counts negatively.
An approach that would better meet user expectations is to have separate wordlists for each region. PR #216 and #217 address this directly. The gff.ti.gff_tigrinya lexicon can then be deleted from the repository or moved into a legacy directory if there is interest to preserve it.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels
Type
Projects
Status
Done