Policy Pruning and Shrinking of Deep Reinforcement Learning for edge devices