-
Notifications
You must be signed in to change notification settings - Fork 87
Pull requests: quic/efficient-transformers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Qwen image with magcache
Diffusers
Use for PR related to diffusers in efficient-transformers.
performance
#998
opened May 20, 2026 by
quic-amitraj
Contributor
•
Draft
Add user_vision_size in VLM's get_specializations for chunked embedding in vLLM v1
#996
opened May 18, 2026 by
quic-xiyushi
Contributor
•
Draft
Magcache support for Use for PR related to diffusers in efficient-transformers.
performance
Diffuser
Diffusers
#993
opened May 18, 2026 by
quic-amitraj
Contributor
•
Draft
[CI-Nightly]: Validating the nightly Result with Previous Result
#992
opened May 18, 2026 by
abukhoy
Contributor
Loading…
Add GLM4-MOE Mode w/Disaggregated Prefill and Decode Support
#988
opened May 14, 2026 by
vbaddi
Contributor
Loading…
support multiple TLM decode specializations via num_speculative_tokens list
#984
opened May 13, 2026 by
eplatero97
Contributor
Loading…
4 tasks done
Adding PagedAttention support for CausalLM models
enhancement
New feature or request
#982
opened May 13, 2026 by
vaibverm
Contributor
Loading…
Fix for fp16/bf16 export & compile in qwen3vl & qwen3vlmoe models
#980
opened May 12, 2026 by
qcdipankar
Contributor
Loading…
Diffusers CI conditional check
Diffusers
Use for PR related to diffusers in efficient-transformers.
#978
opened May 11, 2026 by
quic-amitraj
Contributor
Loading…
Added support of Use for PR related to diffusers in efficient-transformers.
QEffDiffusionPipeline for Diffusers
Diffusers
#977
opened May 11, 2026 by
quic-amitraj
Contributor
Loading…
Enable ffn blocking for dense models with automatic blocking configurator
enhancement
New feature or request
qeff.blocking
#958
opened May 4, 2026 by
kdulla
Contributor
Loading…
fix: improve weight offloading to handle plain tensor attrs and use to_empty()
#952
opened Apr 28, 2026 by
quic-rishinr
Contributor
Loading…
feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill
enhancement
New feature or request
#935
opened Apr 21, 2026 by
vbaddi
Contributor
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.