通用助手 / 编排推荐

quantizing-models-bitsandbytes

安装量 325GitHub Stars 27,327更新时间 2026年5月16日

描述

Quantizes LLMs to 8-bit or 4-bit for 50-75% memory reduction with minimal accuracy loss. Use when GPU memory is limited, need to fit larger models, or want…

安全审计

使用前的风险提示

未审计

规则审计

未审计

更新 1年1月1日

智能审计

未审计

更新 1年1月1日

uillmquantizingmodelsbitsandbytesquantizesllmsbitformemoryreductionminimal

GitHub 仓库