-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Pull requests: antirez/ds4
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Q4_K MoE prefill kernels + CUDA 11.x compatibility for V100
#279
opened May 28, 2026 by
eazlong
Loading…
2 of 4 tasks
Add DS4_EMBED_KERNELS flag to embed Metal kernel sources into binary
#274
opened May 27, 2026 by
kilork
Loading…
CPU support for Q4_K routed experts (fixes #171)
#272
opened May 27, 2026 by
hexxyan
Loading…
4 tasks done
download_model.sh: honor HF_ENDPOINT for Hugging Face mirrors
#270
opened May 27, 2026 by
crimsondhaks
Loading…
Add --api-key Bearer token authentication to ds4-server
#269
opened May 27, 2026 by
hexxyan
Loading…
RFC: Planar3 KV-cache quantization for compressed attention (experimental)
#265
opened May 27, 2026 by
hexxyan
Loading…
Add suffix-tree speculative decoding for repetitive/agentic generation patterns
#261
opened May 26, 2026 by
hexxyan
Loading…
speed-bench: add NVIDIA RTX PRO 6000 Blackwell results
#256
opened May 26, 2026 by
imbibekk
Loading…
speed-bench: add M5 Max 128GB q2-q4-imatrix curve (addresses #226)
#255
opened May 26, 2026 by
kenahrens
Loading…
download_model.sh: prefer huggingface CLI when available
#248
opened May 25, 2026 by
siraustin
Loading…
Add opt-in KV cache compression (turbo3 / turbo4 / comp_cache + HISA indexer)
#243
opened May 24, 2026 by
TheTom
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.