This website requires JavaScript.
Explore
Help
Register
Sign In
zzh
/
vllm-npu-plugin
Watch
1
Star
0
Fork
0
You've already forked vllm-npu-plugin
mirror of
https://github.com/handsomezhuzhu/vllm-npu-plugin.git
synced
2026-02-20 19:50:15 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
4ca9d52cf2ca7568d224a6f75c43cf4241acd027
vllm-npu-plugin
/
vllm_npu
/
ops
History
handsomezhuzhu
4ca9d52cf2
feat: Add Ascend NPU attention backend with NPU-specific FlashAttention, LayerNorm, and Rotary Embedding implementations.
2026-02-10 21:56:45 +08:00
..
__init__.py
feat: initial vllm-npu-plugin for Ascend NPU adaptation
2026-02-10 11:06:01 +08:00
activation.py
feat: initial vllm-npu-plugin for Ascend NPU adaptation
2026-02-10 11:06:01 +08:00
layernorm.py
feat: Add Ascend NPU attention backend with NPU-specific FlashAttention, LayerNorm, and Rotary Embedding implementations.
2026-02-10 21:56:45 +08:00
rotary_embedding.py
feat: Add Ascend NPU attention backend with NPU-specific FlashAttention, LayerNorm, and Rotary Embedding implementations.
2026-02-10 21:56:45 +08:00