09/04/2025 4.1.0: Meituan LongCat Flash Chat, Llama 4, GPT-OSS (BF16), and GLM-4.5-Air support. New experimental mock_quantization config to skip complex computational code paths during quantization ...
This package can be found on PyPI.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results