Skip to content

[FR] Spaceless scripts word tokenization #6

@pxeemo

Description

@pxeemo

I propose using the Intl.Segmenter API, as it is now supported in most modern browsers. This approach has several advantages:

  • Fewer dependencies: It eliminates the need for limited APIs like Google or Azure.
  • No NLP setup required: Simplifies our development process by avoiding complex natural language processing configurations.

related to #4

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions