Starcoder - Search News

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

We introduce StarCoder2-15B-Instruct-v0.1, the very first entirely self-aligned code Large Language Model (LLM) trained with a fully permissive and transparent pipeline. Our open-source pipeline uses ...

GitHub

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Thanks to AWQ, TinyChat can deliver more efficient responses with LLM/VLM chatbots through 4-bit inference. TinyChat on RTX 4090 (3.4x faster than FP16): TinyChat on Jetson Orin (3.2x faster than FP16 ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Trending now