Files
vllm/examples/online_serving/dashboards/perses
杨朱 · Kiki bb9172030e [Metrics] Complete removal of deprecated vllm:time_per_output_token_seconds metric (#32661)
This PR completes the removal of the deprecated vllm:time_per_output_token_seconds
metric that was deprecated in v0.11, hidden in v0.12, scheduled for removal in v0.13,
but delayed until v0.15.

Signed-off-by: carlory <baofa.fan@daocloud.io>
Co-authored-by: Claude Haiku 4.5 <noreply@anthropic.com>
2026-01-20 12:28:41 +00:00
..

Perses Dashboards for vLLM Monitoring

This directory contains Perses dashboard configurations designed to monitor vLLM performance and metrics.

Requirements

  • Perses instance (standalone or via operator)
  • Prometheus data source configured in Perses
  • vLLM deployment with Prometheus metrics enabled

Dashboard Format

We provide dashboards in the native Perses YAML format that works across all deployment methods:

  • Files: *.yaml (native Perses dashboard specifications)
  • Format: Pure dashboard specifications that work everywhere
  • Usage: Works with standalone Perses, API imports, CLI, and file provisioning
  • Kubernetes: Directly compatible with Perses Operator

Dashboard Descriptions

  • performance_statistics.yaml: Performance metrics with aggregated latency statistics
  • query_statistics.yaml: Query performance and deployment metrics

Deployment Options

Direct Import to Perses

Import the dashboard specifications via Perses API or CLI:

percli apply -f performance_statistics.yaml

Perses Operator (Kubernetes)

The native YAML format works directly with the Perses Operator:

kubectl apply -f performance_statistics.yaml -n <namespace>

File Provisioning

Place the YAML files in a Perses provisioning folder for automatic loading.