From: Graph-based exploration and clustering analysis of semantic spaces
Lexical databases | Word2Vec embeddings | |||
---|---|---|---|---|
Number of words | WordNet | Moby thesaurus | Google news | Amazon reviews |
Average degree of most frequent words | ||||
All Words | 7.32 | 34.52 | 69.89 | 31.63 |
1000 | 22.48 | 241.33 | 10.72 | 8.16 |
3000 | 17.58 | 175.71 | 11.08 | 15.88 |
5000 | 16.02 | 157.61 | 11.50 | 17.91 |
10000 | 14.10 | 134.01 | 14.23 | 21.80 |
Ratio of average degree of most frequent words to average degree of all words | ||||
1000 | 3.06 | 6.99 | 0.15 | 0.25 |
3000 | 2.39 | 5.08 | 0.16 | 0.50 |
5000 | 2.19 | 4.57 | 0.17 | 0.57 |
10000 | 1.92 | 3.88 | 0.20 | 0.69 |