Extremely low-bit quantization for Transformers Post date October 12, 2021 ← Enterprise Health & Wellness using wearables → Plenary: A review of on-device fully neural end-to-end speech recognition and synthesis algorithms