https://github.com/unicode-org/unilex/tree/master/data/frequency
I reckon we could get a long way with default models. Not perfect for all languages but maybe a decent base for others to work on.
Also they appear to be TSV files -- just need to strip off a line or two at the start!