Skip to main content

Table 4 Dataset characteristics for name extractions

From: Network-theoretic information extraction quality assessment in the human trafficking domain

Metric

Set1

Set2

Set3

Set4

Set5

Set6

Set7

Set8

No. of unique extractions

925

631

900

468

594

1,076

1,218

1,169

No. of extractions per advertisement

0.7563

0.6271

2.7060

0.3794

0.5140

1.0039

2.9484

0.6269

No. of ads with no extractions

4,708

5,937

532

7,607

6,536

3,613

328

6,362

No. of ads with at least 1 extraction

6,822

5,593

10,998

3,923

4,994

7,917

11,202

5,168

Precision

1.0

0.6050

0.1899

1.0

1.0

0.7533

0.2565

0.6050

Recall

1.0

0.5016

0.6796

0.5016

0.6796

1.0

1.0

0.5015

F-score

1.0

0.5485

0.2969

0.6681

0.8092

0.8593

0.4083

0.5484