examples/offline_inference/basic/score.py

# SPDX-License-Identifier: Apache-2.0
# SPDX-FileCopyrightText: Copyright contributors to the vLLM project

from argparse import Namespace

from vllm import LLM, EngineArgs
from vllm.attention.backends.registry import AttentionBackendEnum
from vllm.config import AttentionConfig
from vllm.platforms import current_platform
from vllm.utils.argparse_utils import FlexibleArgumentParser


def parse_args():
    parser = FlexibleArgumentParser()
    parser = EngineArgs.add_cli_args(parser)
    # Set example specific arguments
    parser.set_defaults(
        model="BAAI/bge-reranker-v2-m3",
        runner="pooling",
        enforce_eager=True,
    )
    return parser.parse_args()


def main(args: Namespace):
    if current_platform.is_rocm():
        args.attention_config = AttentionConfig(
            backend=AttentionBackendEnum.FLEX_ATTENTION
        )

    # Sample prompts.
    text_1 = "What is the capital of France?"
    texts_2 = [
        "The capital of Brazil is Brasilia.",
        "The capital of France is Paris.",
    ]

    # Create an LLM.
    # You should pass runner="pooling" for cross-encoder models
    llm = LLM(**vars(args))

    # Generate scores. The output is a list of ScoringRequestOutputs.
    outputs = llm.score(text_1, texts_2)

    # Print the outputs.
    print("\nGenerated Outputs:\n" + "-" * 60)
    for text_2, output in zip(texts_2, outputs):
        score = output.outputs.score
        print(f"Pair: {[text_1, text_2]!r} \nScore: {score}")
        print("-" * 60)


if __name__ == "__main__":
    args = parse_args()
    main(args)
Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00			`# SPDX-License-Identifier: Apache-2.0`
[Misc] Add SPDX-FileCopyrightText (#19100) Signed-off-by: simon-mo <simon.mo@hey.com> 2025-06-03 11:20:17 -07:00			`# SPDX-FileCopyrightText: Copyright contributors to the vLLM project`
Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00
			`from argparse import Namespace`

			`from vllm import LLM, EngineArgs`
[ROCM][CI] Fix AMD Examples Test Group (#30276) Signed-off-by: Yida Wu <yida.wu@amd.com> Signed-off-by: Yida <yida.wu@amd.com> 2025-12-11 17:03:54 -06:00			`from vllm.attention.backends.registry import AttentionBackendEnum`
			`from vllm.config import AttentionConfig`
			`from vllm.platforms import current_platform`
[Chore]:Extract math and argparse utilities to separate modules (#27188) Signed-off-by: Yeshwanth Surya <yeshsurya@gmail.com> Signed-off-by: Yeshwanth N <yeshsurya@gmail.com> Signed-off-by: yeshsurya <yeshsurya@gmail.com> 2025-10-26 16:33:32 +05:30			`from vllm.utils.argparse_utils import FlexibleArgumentParser`
Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00

[Misc] refactor argument parsing in examples (#16635) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com> 2025-04-15 16:05:30 +08:00			`def parse_args():`
			`parser = FlexibleArgumentParser()`
			`parser = EngineArgs.add_cli_args(parser)`
			`# Set example specific arguments`
Convert `examples` to `ruff-format` (#18400) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-26 17:57:54 +01:00			`parser.set_defaults(`
[Deprecation][2/N] Replace `--task` with `--runner` and `--convert` (#21470) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-28 10:42:40 +08:00			`model="BAAI/bge-reranker-v2-m3",`
			`runner="pooling",`
			`enforce_eager=True,`
Convert `examples` to `ruff-format` (#18400) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-05-26 17:57:54 +01:00			`)`
[Misc] refactor argument parsing in examples (#16635) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com> 2025-04-15 16:05:30 +08:00			`return parser.parse_args()`


Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00			`def main(args: Namespace):`
[ROCM][CI] Fix AMD Examples Test Group (#30276) Signed-off-by: Yida Wu <yida.wu@amd.com> Signed-off-by: Yida <yida.wu@amd.com> 2025-12-11 17:03:54 -06:00			`if current_platform.is_rocm():`
			`args.attention_config = AttentionConfig(`
			`backend=AttentionBackendEnum.FLEX_ATTENTION`
			`)`

Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00			`# Sample prompts.`
			`text_1 = "What is the capital of France?"`
			`texts_2 = [`
			`"The capital of Brazil is Brasilia.",`
			`"The capital of France is Paris.",`
			`]`

			`# Create an LLM.`
[Deprecation][2/N] Replace `--task` with `--runner` and `--convert` (#21470) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> 2025-07-28 10:42:40 +08:00			`# You should pass runner="pooling" for cross-encoder models`
[Misc] unify variable for LLM instance (#20996) Signed-off-by: Andy Xie <andy.xning@gmail.com> 2025-07-21 19:18:33 +08:00			`llm = LLM(**vars(args))`
Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00
			`# Generate scores. The output is a list of ScoringRequestOutputs.`
[Misc] unify variable for LLM instance (#20996) Signed-off-by: Andy Xie <andy.xning@gmail.com> 2025-07-21 19:18:33 +08:00			`outputs = llm.score(text_1, texts_2)`
Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00
			`# Print the outputs.`
[Misc] improve example script output (#15528) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com> 2025-03-26 18:12:47 +08:00			`print("\nGenerated Outputs:\n" + "-" * 60)`
Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00			`for text_2, output in zip(texts_2, outputs):`
			`score = output.outputs.score`
[Misc] improve example script output (#15528) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com> 2025-03-26 18:12:47 +08:00			`print(f"Pair: {[text_1, text_2]!r} \nScore: {score}")`
			`print("-" * 60)`
Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00

			`if __name__ == "__main__":`
[Misc] refactor argument parsing in examples (#16635) Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com> 2025-04-15 16:05:30 +08:00			`args = parse_args()`
Merge similar examples in `offline_inference` into single `basic` example (#12737) 2025-02-20 12:53:51 +00:00			`main(args)`