Fig. 13

Results of sampling on the Pokec dataset starting from nodes with different regions as attributes. The distribution on the whole network is the black vertical line. Vertical lines on top of bars represent standard deviations across 10 runs of sampling. Numbers inside the legend are KL-divergence and entropy ratios between attribute distribution on the entire network and that inside the sample, as defined in the main text