Quantization plays a crucial role in deploying Large Language Models (LLMs) in resource-constrained environments. However, the presence of outlier features significantly hinders low-bit quantization.
Abstract: Convolutional neural networks (CNNs) have been widely used in fault diagnosis due to their superiority in feature extraction. Traditional CNNs are a type of closed-box techniques with little ...