Deep Model Compression and Acceleration Towards On-Sensor AI