Extremely low-bit quantization for Transformers