vllm/vllm/model_executor at d427e5cfda8d2536b81e6021128e71b2dbc281aa - vllm - Gitea: Git with a cup of tea

biondizzle/vllm

Files

History

Roger Wang b7dcc003dc [Model] Remove hardcoded image tokens ids from Pixtral (#11582 )

Signed-off-by: Roger Wang <ywang@roblox.com>

2024-12-28 10:54:23 +00:00

..

guided_decoding

[Bugfix] Fix CFGGuide and use outlines for grammars that can't convert to GBNF (#11389 )

2024-12-23 23:06:20 +08:00

[Bugfix] Fix for ROCM compressed tensor support (#11561 )

2024-12-27 20:12:11 +00:00

[Misc] Improve BNB loader to handle mixture of sharded and merged weights with same suffix (#11566 )

2024-12-27 19:45:13 +00:00

[Model] Remove hardcoded image tokens ids from Pixtral (#11582 )

2024-12-28 10:54:23 +00:00

__init__.py

[Performance] Optimize e2e overheads: Reduce python allocations (#7162 )

2024-08-08 21:34:28 -07:00

custom_op.py

[misc] move functions to config.py (#10624 )

2024-11-25 09:27:30 +00:00

parameter.py

[Model] [Quantization] Support deepseek_v3 w8a8 fp8 block-wise quantization (#11523 )

2024-12-26 15:33:30 -08:00

pooling_metadata.py

[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (#3734 )

2024-05-11 11:30:37 -07:00

sampling_metadata.py

[Misc] typo find in sampling_metadata.py (#10740 )

2024-11-29 05:17:57 +00:00

utils.py

[Hardware] using current_platform.seed_everything (#9785 )

2024-10-29 14:47:44 +00:00