Fig. 2
From: Selective network discovery via deep reinforcement learning on embedded spaces

Schematic approach of NAC algorithm. In the first component, NAC uses a network embedding and truncation step to avoid an explosion in the state-action space as the network grows. The truncation block ensures a constant size input into the learned policy. In the second component, NAC uses reinforcement learning to learn a policy offline