Skip to content

Pull requests: casper-hansen/AutoAWQ

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Optimize Triton for MI300 (2-5x higher throughput)
#620 opened Sep 21, 2024 by casper-hansen Loading…
support minicpm3.0
#605 opened Sep 6, 2024 by LDLINGLINGLING Loading…
add qwen2vl support
#599 opened Aug 29, 2024 by kq-chen Loading…
Add support for Phi-3-vision series model
#596 opened Aug 21, 2024 by Isotr0py Loading…
New Model, support DeepSeek MoE model
#560 opened Jul 30, 2024 by xiaobochen123 Loading…
Fix fp16 overflow
#532 opened Jul 2, 2024 by TechxGenus Draft
Added Phi and Phi-2 support
#496 opened Jun 10, 2024 by vigarov Draft
Initial support for Jamba
#454 opened Apr 21, 2024 by TechxGenus Loading…
Adding bert - WIP
#328 opened Feb 5, 2024 by michaelfeil Loading…
Add Codeshell Support
#291 opened Jan 5, 2024 by MeJerry215 Loading…
ProTip! Follow long discussions with comments:>50.