From: From free text to clusters of content in health records: an unsupervised graph partitioning approach
Hyper-parameters | NRLS | Wikipedia | ||||
---|---|---|---|---|---|---|
Window Size | Minimum Count | Subsampling | 1M | 2M | 13M+ | 5M+ |
15 | 5 | 0.001 | 765 | 755 | 836 | 531 |
5 | 5 | 0.001 | 807 | 775 | 798 | 580 |
5 | 20 | 0.001 | 801 | 785 | 809 | 587 |
5 | 20 | 0.00001 | - | - | 379 | 465 |
15 | 20 | 0.00001 | - | - | 387 | 424 |