A hardware-aware neural architecture search algorithm targeting ultra-low-power microcontrollers