研究学习 / 检索整理

quantizing-models-bitsandbytes

安装量 208GitHub Stars 8,486更新时间 2026年5月16日

描述

Quantizes LLMs to 8-bit or 4-bit for 50-75% memory reduction with minimal accuracy loss. Use when GPU memory is limited, need to fit larger models, or want…

安全审计

使用前的风险提示

未审计

规则审计

未审计

更新 1年1月1日

智能审计

未审计

更新 1年1月1日

uillmquantizingmodelsbitsandbytesquantizesllmsbitformemoryreductionminimal

GitHub 仓库