This website requires JavaScript.
Explore
Help
Register
Sign In
zzh
/
vllm-npu-plugin
Watch
1
Star
0
Fork
0
You've already forked vllm-npu-plugin
mirror of
https://github.com/handsomezhuzhu/vllm-npu-plugin.git
synced
2026-02-20 19:50:15 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
5df056dd17fe106f7002e0d030133383c5afb21f
vllm-npu-plugin
/
vllm_npu
/
quantization
History
handsomezhuzhu
6680585975
大改
2026-02-10 23:08:39 +08:00
..
__init__.py
大改
2026-02-10 23:08:39 +08:00
quant_config.py
大改
2026-02-10 23:08:39 +08:00
utils.py
大改
2026-02-10 23:08:39 +08:00
w4a4_flatquant_dynamic.py
大改
2026-02-10 23:08:39 +08:00
w4a8_dynamic.py
大改
2026-02-10 23:08:39 +08:00
w8a8_dynamic.py
大改
2026-02-10 23:08:39 +08:00
w8a8.py
大改
2026-02-10 23:08:39 +08:00