This website requires JavaScript.
Explore
Help
Register
Sign In
biondizzle
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
2
Packages
Projects
Releases
Wiki
Activity
All Workflows
add_label_automerge.yml
issue_autolabel.yml
macos-smoke-test.yml
new_pr_bot.yml
pre-commit.yml
stale.yml
Actor
All actors
biondizzle
Status
All status
Success
Failure
Waiting
Running
[V1] [Spec Decode] Support random sampling for spec decode (#13933)
publish.yml #55
:
Commit
8d6cf89526
pushed by
biondizzle
v0.8.0rc1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Fix LoRA extra vocab size (#15047)
publish.yml #54
:
Commit
966f933ee1
pushed by
biondizzle
v0.8.0
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Fix deepseekv3 grouped topk error (#13474)
publish.yml #53
:
Commit
ed6e9075d3
pushed by
biondizzle
v0.7.3
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Misc] Improve error message for incorrect pynvml (#12809)
publish.yml #52
:
Commit
0408efc6d0
pushed by
biondizzle
v0.7.2
2026-04-11 03:27:34 +00:00
0s
View workflow file
Disable chunked prefill and/or prefix caching when MLA is enabled (#12642)
publish.yml #51
:
Commit
4f4d427ac2
pushed by
biondizzle
v0.7.1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Fix Granite 3.0 MoE model loading (#12446)
publish.yml #50
:
Commit
5204ff5c3f
pushed by
biondizzle
v0.7.0
2026-04-11 03:27:34 +00:00
0s
View workflow file
[BugFix] Fix quantization for all other methods (#11547)
publish.yml #49
:
Commit
2339d59f92
pushed by
biondizzle
v0.6.6.post1
2026-04-11 03:27:34 +00:00
0s
View workflow file
Deepseek v3 (#11502)
publish.yml #48
:
Commit
f49777ba62
pushed by
biondizzle
v0.6.6
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Bugfix] Fix request cancellation without polling (#11190)
publish.yml #47
:
Commit
2d1b9baa8f
pushed by
biondizzle
v0.6.5
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Misc] bump mistral common version (#10367)
publish.yml #46
:
Commit
a6221a144a
pushed by
biondizzle
v0.6.4.post1
2026-04-11 03:27:34 +00:00
0s
View workflow file
[Build] skip renaming files for release wheels pipeline (#9671)
publish.yml #45
:
Commit
02dbf30e9a
pushed by
biondizzle
v0.6.4
2026-04-11 03:27:34 +00:00
0s
View workflow file
[CI/Build] remove .github from .dockerignore, add dirty repo check (#9375)
publish.yml #44
:
Commit
a2c71c5405
pushed by
biondizzle
v0.6.3.post1
2026-04-11 03:27:33 +00:00
0s
View workflow file
[Docs] Remove PDF build from Readtehdocs (#9347)
publish.yml #43
:
Commit
fd47e57f4b
pushed by
biondizzle
v0.6.3
2026-04-11 03:27:33 +00:00
0s
View workflow file
[Misc] Support quantization of MllamaForCausalLM (#8822)
publish.yml #42
:
Commit
7193774b1f
pushed by
biondizzle
v0.6.2
2026-04-11 03:27:33 +00:00
0s
View workflow file
bump version to v0.6.1.post2 (#8473)
publish.yml #41
:
Commit
9ba0817ff1
pushed by
biondizzle
v0.6.1.post2
2026-04-11 03:27:33 +00:00
0s
View workflow file
bump version to v0.6.1.post1 (#8440)
publish.yml #40
:
Commit
acda0b35d0
pushed by
biondizzle
v0.6.1.post1
2026-04-11 03:27:32 +00:00
0s
View workflow file
Bump version to v0.6.1 (#8379)
publish.yml #39
:
Commit
3fd2b0d21c
pushed by
biondizzle
v0.6.1
2026-04-11 03:27:32 +00:00
0s
View workflow file
Bump version to v0.6.0 (#8166)
publish.yml #38
:
Commit
32e7db2536
pushed by
biondizzle
v0.6.0
2026-04-11 03:27:32 +00:00
0s
View workflow file
Bump version to v0.5.5 (#7823)
publish.yml #37
:
Commit
09c7792610
pushed by
biondizzle
v0.5.5
2026-04-11 03:27:31 +00:00
0s
View workflow file
bump version to v0.5.4 (#7139)
publish.yml #36
:
Commit
4db5176d97
pushed by
biondizzle
v0.5.4
2026-04-11 03:27:31 +00:00
0s
View workflow file
Bump version to 0.5.3.post1 (#6696)
publish.yml #35
:
Commit
38c4b7e863
pushed by
biondizzle
v0.5.3.post1
2026-04-11 03:27:31 +00:00
0s
View workflow file
Bump version to v0.5.3 (#6674)
publish.yml #34
:
Commit
bb2fc08072
pushed by
biondizzle
v0.5.3
2026-04-11 03:27:31 +00:00
0s
View workflow file
[misc][distributed] fix pp missing layer condition (#6446)
publish.yml #33
:
Commit
4cf256ae7f
pushed by
biondizzle
v0.5.2
2026-04-11 03:27:30 +00:00
0s
View workflow file
[Docs] Fix readthedocs for tag build (#6158)
publish.yml #32
:
Commit
79d406e918
pushed by
biondizzle
v0.5.1
2026-04-11 03:27:30 +00:00
0s
View workflow file
Add `cuda_device_count_stateless` (#5473)
publish.yml #31
:
Commit
50eed24d25
pushed by
biondizzle
v0.5.0.post1
2026-04-11 03:27:30 +00:00
0s
View workflow file
[Doc] add common case for long waiting time (#5430)
publish.yml #30
:
Commit
8f89d72090
pushed by
biondizzle
v0.5.0
2026-04-11 03:27:30 +00:00
0s
View workflow file
[Build] Guard against older CUDA versions when building CUTLASS 3.x kernels (#5168)
publish.yml #29
:
Commit
1197e02141
pushed by
biondizzle
v0.4.3
2026-04-11 03:27:30 +00:00
0s
View workflow file
[CI] Reduce wheel size by not shipping debug symbols (#4602)
publish.yml #28
:
Commit
c7f2cf2b7f
pushed by
biondizzle
v0.4.2
2026-04-11 03:27:29 +00:00
0s
View workflow file
[Misc] Reduce supported Punica dtypes (#4304)
publish.yml #27
:
Commit
468d761b32
pushed by
biondizzle
v0.4.1
2026-04-11 03:27:29 +00:00
0s
View workflow file
[CI/Build] 0.4.0.post1, fix sm 7.0/7.5 binary (#3803)
publish.yml #26
:
Commit
a3c226e7eb
pushed by
biondizzle
v0.4.0.post1
2026-04-11 03:27:29 +00:00
0s
View workflow file
First
Previous
1
2
3
Next
Last