Skip to main content

Advertisement

Springer Nature is making SARS-CoV-2 and COVID-19 research free. View research | View latest news | Sign up for updates

Table 6 Average degree of all nodes vs. average degree of words appearing in the lists of 1000, 3000, 5000 and 10000 most frequent words (from Moby Thesaurus ll and Beautiful Data, Natural Language Corpus data book (Norvig 2009)) in WordNet synonyms network, Moby Thesaurus and Word2Vec embedding of Google News and Amazon Reviews containing WordNet words (cosine similarity threshold = 0.5)

From: Graph-based exploration and clustering analysis of semantic spaces

 Lexical databasesWord2Vec embeddings
Number of wordsWordNetMoby thesaurusGoogle newsAmazon reviews
Average degree of most frequent words
All Words7.3234.5269.8931.63
100022.48241.3310.728.16
300017.58175.7111.0815.88
500016.02157.6111.5017.91
1000014.10134.0114.2321.80
Ratio of average degree of most frequent words to average degree of all words
10003.066.990.150.25
30002.395.080.160.50
50002.194.570.170.57
100001.923.880.200.69