Quantization for Efficient Inference in Edge Devices