biondizzle/vllm

Files

History

Thomas Parnell 2be07a0db1 Update docs for Minimax-Text support (#22562 )

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

2025-08-09 00:18:18 -07:00

..

faq.md

Stop using title frontmatter and fix doc that can only be reached by search (#20623 )

2025-07-08 03:27:40 -07:00

metrics.md

Remove unnecessary explicit title anchors and use relative links instead (#20620 )

2025-07-08 02:49:13 -07:00

README.md

[Doc] Reorganize user guide (#18661 )

2025-05-24 07:25:33 -07:00

reproducibility.md

[Doc] Update reproducibility doc and example (#18741 )

2025-05-27 07:03:13 +00:00

security.md

[Docs] Switch to better markdown linting pre-commit hook (#21851 )

2025-07-29 19:45:08 -07:00

troubleshooting.md

[Docs] Rename “Distributed inference and serving” to “Parallelism & Scaling” (#22466 )

2025-08-08 19:26:21 +00:00

usage_stats.md

Make distinct code and console admonitions so readers are less likely to miss them (#20585 )

2025-07-07 19:55:28 -07:00

v1_guide.md

Update docs for Minimax-Text support (#22562 )

2025-08-09 00:18:18 -07:00

README.md

Using vLLM

vLLM supports the following usage patterns:

Inference and Serving: Run a single instance of a model.
Deployment: Scale up model instances for production.
Training: Train or fine-tune a model.