Quantized models are slower than float models on GPUs

thanks, I will do it !