Add xLAM tool parser support (#17148)
This commit is contained in:
@@ -226,6 +226,25 @@ AI21's Jamba-1.5 models are supported.
|
||||
|
||||
Flags: `--tool-call-parser jamba`
|
||||
|
||||
### xLAM Models (`xlam`)
|
||||
|
||||
The xLAM tool parser is designed to support models that generate tool calls in various JSON formats. It detects function calls in several different output styles:
|
||||
|
||||
1. Direct JSON arrays: Output strings that are JSON arrays starting with `[` and ending with `]`
|
||||
2. Thinking tags: Using `<think>...</think>` tags containing JSON arrays
|
||||
3. Code blocks: JSON in code blocks (```json ...```)
|
||||
4. Tool calls tags: Using `[TOOL_CALLS]` or `<tool_call>...</tool_call>` tags
|
||||
|
||||
Parallel function calls are supported, and the parser can effectively separate text content from tool calls.
|
||||
|
||||
Supported models:
|
||||
* Salesforce Llama-xLAM models: `Salesforce/Llama-xLAM-2-8B-fc-r`, `Salesforce/Llama-xLAM-2-70B-fc-r`
|
||||
* Qwen-xLAM models: `Salesforce/xLAM-1B-fc-r`, `Salesforce/xLAM-3B-fc-r`, `Salesforce/Qwen-xLAM-32B-fc-r`
|
||||
|
||||
Flags:
|
||||
* For Llama-based xLAM models: `--tool-call-parser xlam --chat-template examples/tool_chat_template_xlam_llama.jinja`
|
||||
* For Qwen-based xLAM models: `--tool-call-parser xlam --chat-template examples/tool_chat_template_xlam_qwen.jinja`
|
||||
|
||||
### Qwen Models
|
||||
|
||||
For Qwen2.5, the chat template in tokenizer_config.json has already included support for the Hermes-style tool use. Therefore, you can use the `hermes` parser to enable tool calls for Qwen models. For more detailed information, please refer to the official [Qwen documentation](https://qwen.readthedocs.io/en/latest/framework/function_call.html#vllm)
|
||||
|
||||
Reference in New Issue
Block a user