Skip to main content

Table 2 Dataset statistics after preprocessing (similar to Shchur et al. 2018)

From: The interplay between communities and homophily in semi-supervised classification using graph neural networks

  Dataset Labels Features Nodes Edges Homophily Mixing
Homophilic CORA-ML 7 1433 2485 5209 0.81 0.09
CiteSeer 6 3703 2110 3705 0.74 0.06
PubMed 3 500 19,717 44,335 0.80 0.09
CORA-Full 67 8710 18,703 64,259 0.57 0.10
Non-homophilic Squirrel 5 2089 5201 216,933 0.22 0.22
Actor 5 932 7600 29,926 0.22 0.21
Texas 4 1703 182 307 0.06 0.16
Wisconsin 5 1703 251 499 0.17 0.16