This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
All Workflows
add_label_automerge.yml
issue_autolabel.yml
macos-smoke-test.yml
new_pr_bot.yml
pre-commit.yml
stale.yml
Actor
All actors
biondizzle
Status
All status
Success
Failure
Waiting
Running
Revert "[V0 deprecation] Remove V0 CPU/XPU/TPU backends (#20412)"
publish.yml #71
:
Commit
a5dd03c1eb
pushed by
biondizzle
v0.9.2rc2
2026-04-11 03:27:35 +00:00
0s
View workflow file
[Misc] Remove _maybe_ignore_quant_config from GLM4.1v (#20432)
publish.yml #70
:
Commit
2f2fcb31b8
pushed by
biondizzle
v0.9.2rc1
2026-04-11 03:27:35 +00:00
0s
View workflow file
Revert "[V0 deprecation] Remove V0 CPU/XPU/TPU backends (#20412)"
publish.yml #69
:
Commit
a5dd03c1eb
pushed by
biondizzle
v0.9.2
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Misc] Slight improvement of the BNB (#19418)
publish.yml #68
:
Commit
b6553be1bc
pushed by
biondizzle
v0.9.1rc2
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Misc] Fix a config typo in disable_hybrid_kv_cache_manager configuration (#19383)
publish.yml #67
:
Commit
3a7cd627a8
pushed by
biondizzle
v0.9.1rc1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Misc] Slight improvement of the BNB (#19418)
publish.yml #66
:
Commit
b6553be1bc
pushed by
biondizzle
v0.9.1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[BugFix] FA2 MLA Accuracy Issue (#18807)
publish.yml #65
:
Commit
5fbbfe9a4c
pushed by
biondizzle
v0.9.0.1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Mistral tool calling when content is list (#18729)
publish.yml #64
:
Commit
5873877241
pushed by
biondizzle
v0.9.0
2026-04-11 03:27:34 +00:00
0s
View workflow file
[BugFix][Attention] Fix sliding window attention in V1 giving incorrect results (#17574)
publish.yml #63
:
Commit
3015d5634e
pushed by
biondizzle
v0.8.5.post1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Model] Add tuned triton fused_moe configs for Qwen3Moe (#17328)
publish.yml #62
:
Commit
ba41cc90e8
pushed by
biondizzle
v0.8.5
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Core][V0] Enable regex support with xgrammar (#13228)
publish.yml #61
:
Commit
dc1b4a6f13
pushed by
biondizzle
v0.8.4
2026-04-11 03:27:34 +00:00
0s
View workflow file
[V1][Spec Decode] Update N-gram Proposer Interface (#15750)
publish.yml #60
:
Commit
63375f0cdb
pushed by
biondizzle
v0.8.3rc1
2026-04-11 03:27:34 +00:00
0s
View workflow file
Revert "[V1] DP scale-out (1/N): Use zmq ROUTER/DEALER sockets for input queue (#15906)"
publish.yml #59
:
Commit
296c6572dd
pushed by
biondizzle
v0.8.3
2026-04-11 03:27:34 +00:00
0s
View workflow file
[V1][Spec Decode] Update target_logits in place for rejection sampling (#15427)
publish.yml #58
:
Commit
25f560a62c
pushed by
biondizzle
v0.8.2
2026-04-11 03:27:34 +00:00
0s
View workflow file
[V1] Minor V1 async engine test refactor (#15075)
publish.yml #57
:
Commit
61c7a1b856
pushed by
biondizzle
v0.8.1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Make Gemma3 MM V0 only for now (#14971)
publish.yml #56
:
Commit
37e3806132
pushed by
biondizzle
v0.8.0rc2
2026-04-11 03:27:34 +00:00
0s
View workflow file
[V1] [Spec Decode] Support random sampling for spec decode (#13933)
publish.yml #55
:
Commit
8d6cf89526
pushed by
biondizzle
v0.8.0rc1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Fix LoRA extra vocab size (#15047)
publish.yml #54
:
Commit
966f933ee1
pushed by
biondizzle
v0.8.0
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Fix deepseekv3 grouped topk error (#13474)
publish.yml #53
:
Commit
ed6e9075d3
pushed by
biondizzle
v0.7.3
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Misc] Improve error message for incorrect pynvml (#12809)
publish.yml #52
:
Commit
0408efc6d0
pushed by
biondizzle
v0.7.2
2026-04-11 03:27:34 +00:00
0s
View workflow file
Disable chunked prefill and/or prefix caching when MLA is enabled (#12642)
publish.yml #51
:
Commit
4f4d427ac2
pushed by
biondizzle
v0.7.1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Fix Granite 3.0 MoE model loading (#12446)
publish.yml #50
:
Commit
5204ff5c3f
pushed by
biondizzle
v0.7.0
2026-04-11 03:27:34 +00:00
0s
View workflow file
[BugFix] Fix quantization for all other methods (#11547)
publish.yml #49
:
Commit
2339d59f92
pushed by
biondizzle
v0.6.6.post1
2026-04-11 03:27:34 +00:00
0s
View workflow file
Deepseek v3 (#11502)
publish.yml #48
:
Commit
f49777ba62
pushed by
biondizzle
v0.6.6
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Fix request cancellation without polling (#11190)
publish.yml #47
:
Commit
2d1b9baa8f
pushed by
biondizzle
v0.6.5
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Misc] bump mistral common version (#10367)
publish.yml #46
:
Commit
a6221a144a
pushed by
biondizzle
v0.6.4.post1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Build] skip renaming files for release wheels pipeline (#9671)
publish.yml #45
:
Commit
02dbf30e9a
pushed by
biondizzle
v0.6.4
2026-04-11 03:27:34 +00:00
0s
View workflow file
[CI/Build] remove .github from .dockerignore, add dirty repo check (#9375)
publish.yml #44
:
Commit
a2c71c5405
pushed by
biondizzle
v0.6.3.post1
2026-04-11 03:27:33 +00:00
0s
View workflow file
[Docs] Remove PDF build from Readtehdocs (#9347)
publish.yml #43
:
Commit
fd47e57f4b
pushed by
biondizzle
v0.6.3
2026-04-11 03:27:33 +00:00
0s
View workflow file
[Misc] Support quantization of MllamaForCausalLM (#8822)
publish.yml #42
:
Commit
7193774b1f
pushed by
biondizzle
v0.6.2
2026-04-11 03:27:33 +00:00
0s
View workflow file
First
Previous
1
2
3
Next
Last