remove N (size of corpus vocabulary) from phrase model formula #5

Alexjmsherman · 2018-04-28T03:25:26Z

According to the gensim documentation (https://radimrehurek.com/gensim/models/phrases.html#id2) for the models.phrases class, the formula for the phase model is from Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, and Jeffrey Dean. Distributed Representations of Words and Phrases and their Compositionality. In Proceedings of NIPS, 2013.

In the paper, the equation does not include N (size of the corpus vocabulary) as is listed in your notebook. I updated the equation removing N and it's definition
https://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf

FYI, I saw you present this at PyData D.C. I thought it was a great presentation and still, clearly, refer to this notebook often. Thanks for putting it together.

…is not used in gensim implementation

remove N (size of corpus vocabulary) from phrase model formula as it …

377e9e4

…is not used in gensim implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

remove N (size of corpus vocabulary) from phrase model formula #5

remove N (size of corpus vocabulary) from phrase model formula #5

Uh oh!

Alexjmsherman commented Apr 28, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

remove N (size of corpus vocabulary) from phrase model formula #5

Are you sure you want to change the base?

remove N (size of corpus vocabulary) from phrase model formula #5

Uh oh!

Conversation

Alexjmsherman commented Apr 28, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant