Quantization Techniques for Efficient Large Language Model Inference