vLLM Master Gateway

Live Operations

Monitor live routing and backend health

Runtime status, active requests, and backend health are shown here for quick operational checks.

Live inflight requests using upstream usage token tracking

Request ID	IP	Username	Started (Local)	Model	Endpoint	Stream	Elapsed	TTFT	Input Tokens	Output Tokens

Client auth for /v1/*

One API key per line. Saving here replaces the accepted master keys. Values are stored outside config.yaml.

Read-only runtime view

Endpoint changes should be made through config.yaml or the raw configuration editor below.

Traffic Analytics

These tables focus on operational visibility for the current day across models, upstreams, requesters, and recent failures.

Aggregated by public model name

Model	Req	Success	Prompt	Gen	Avg Lat	Avg TTFT	Avg tok/s

Aggregated by routed upstream

Endpoint	Tier	Req	Success	Prompt	Gen	Avg Lat	Avg TTFT	Avg tok/s

Useful for colleague usage analysis

IP	Req	Success	Prompt	Gen	Avg Lat	Avg TTFT	Avg tok/s	Last Seen (Local)

Latest 50 requests

IP	Username	Model	Endpoint	Status	Stream	Prompt	Gen	Latency	TTFT	tok/s	Error	Local Time

Advanced Config

This editor is best for advanced changes that are not yet exposed through the Web UI. Security-sensitive secrets are stored separately.

Pause auto refresh before editing. The master also reloads external file changes automatically.

Admin password, admin session secret, and master API keys are stored separately from config.yaml.

Loading config…