Skip to content

Build default lexical models from Unicode unilex data #70

@mcdurdin

Description

@mcdurdin

https://github.com/unicode-org/unilex/tree/master/data/frequency

I reckon we could get a long way with default models. Not perfect for all languages but maybe a decent base for others to work on.

Also they appear to be TSV files -- just need to strip off a line or two at the start!

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    Status

    No status

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions