quic / efficient-transformers Public

Notifications You must be signed in to change notification settings
Fork 87
Star 89

Code
Issues 4
Pull requests 42
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: quic/efficient-transformers

Labels 28 Milestones 0

New pull request New

42 Open 941 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Subfunction changes qwen2 5

#999 opened May 20, 2026 by abhishek-singh591 Contributor

Loading…

Qwen image with magcache Diffusers

Use for PR related to diffusers in efficient-transformers.

performance

#998 opened May 20, 2026 by quic-amitraj Contributor • Draft

Repeatkv transform

#997 opened May 19, 2026 by quic-dhirajku Contributor • Draft

Add user_vision_size in VLM's get_specializations for chunked embedding in vLLM v1

#996 opened May 18, 2026 by quic-xiyushi Contributor • Draft

Dflash: Block Diffusion Speculative Decoding

#995 opened May 18, 2026 by vjanfaza Contributor • Draft

Ft_v1 QAIC-profiler hotfix

#994 opened May 18, 2026 by quic-akuruvil Contributor

Loading…

Magcache support for Diffuser Diffusers

Use for PR related to diffusers in efficient-transformers.

performance

#993 opened May 18, 2026 by quic-amitraj Contributor • Draft

[CI-Nightly]: Validating the nightly Result with Previous Result

#992 opened May 18, 2026 by abukhoy Contributor

Loading…

Feat/enable glm4 moe

#991 opened May 15, 2026 by ochougul Contributor

Loading…

Add GLM4-MOE Mode w/Disaggregated Prefill and Decode Support

#988 opened May 14, 2026 by vbaddi Contributor

Loading…

Added head parallel kv blocking enhancement

New feature or request

qeff.blocking

#986 opened May 14, 2026 by kdulla Contributor • Draft

support multiple TLM decode specializations via num_speculative_tokens list

#984 opened May 13, 2026 by eplatero97 Contributor

Loading…

4 tasks done

Adding PagedAttention support for CausalLM models enhancement

New feature or request

#982 opened May 13, 2026 by vaibverm Contributor

Loading…

Fix for fp16/bf16 export & compile in qwen3vl & qwen3vlmoe models

#980 opened May 12, 2026 by qcdipankar Contributor

Loading…

Diffusers CI conditional check Diffusers

Use for PR related to diffusers in efficient-transformers.

#978 opened May 11, 2026 by quic-amitraj Contributor

Loading…

Added support of QEffDiffusionPipeline for Diffusers Diffusers

Use for PR related to diffusers in efficient-transformers.

#977 opened May 11, 2026 by quic-amitraj Contributor

Loading…

Layerwise int4 kimi

#973 opened May 7, 2026 by abhishek-singh591 Contributor • Draft

TF and other package update

#967 opened May 6, 2026 by quic-hemagnih Contributor • Draft

Gemma4

#966 opened May 6, 2026 by tchawada Contributor

Loading…

Add DPO specific changes

#964 opened May 6, 2026 by quic-akuruvil Contributor • Draft

MLA Int4 Changes

#962 opened May 5, 2026 by quic-mamta Contributor • Draft

Enable ffn blocking for dense models with automatic blocking configurator enhancement

New feature or request

qeff.blocking

#958 opened May 4, 2026 by kdulla Contributor

Loading…

fix: improve weight offloading to handle plain tensor attrs and use to_empty()

#952 opened Apr 28, 2026 by quic-rishinr Contributor

Loading…

feat(moe): NSP-blocked expert dispatch for Qwen3MOE and GPT-OSS prefill enhancement

New feature or request

#935 opened Apr 21, 2026 by vbaddi Contributor

Loading…

Added MDP generation to QEff Compile

#930 opened Apr 21, 2026 by quic-mohmeh

Loading…

Previous 1 2 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!