Skip to main content

Table 2 Dataset statistics after preprocessing (similar to Shchur et al. 2018)

From: The interplay between communities and homophily in semi-supervised classification using graph neural networks

 

Dataset

Labels

Features

Nodes

Edges

Homophily

Mixing

Homophilic

CORA-ML

7

1433

2485

5209

0.81

0.09

CiteSeer

6

3703

2110

3705

0.74

0.06

PubMed

3

500

19,717

44,335

0.80

0.09

CORA-Full

67

8710

18,703

64,259

0.57

0.10

Non-homophilic

Squirrel

5

2089

5201

216,933

0.22

0.22

Actor

5

932

7600

29,926

0.22

0.21

Texas

4

1703

182

307

0.06

0.16

Wisconsin

5

1703

251

499

0.17

0.16