MetaStock Language Tutorials

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Large language models (LLMs) show excellent performance but are compute- and memory-intensive. Quantization can reduce memory and accelerate inference. However, for LLMs beyond 100 billion parameters, ...

GitHub

isl-org/lang-seg

This project will no longer be maintained by Intel. Intel has ceased development and contributions including, but not limited to, maintenance, bug fixes, new releases, or updates, to this project.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

isl-org/lang-seg

Trending now