Hardware-Aware Neural Architecture Search and Compression for Efficient Deep Learning