eg. `händler` in python 3.7 finds a token `ändler` and in 2.7, finds a token `ndler`. The same is also an issue for words with other diacritics