Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.flowx.ai/llms.txt

Use this file to discover all available pages before exploring further.

The LLM Calls page is the single place to find any LLM invocation that has ever passed through Observatory. Use it when you know what you’re looking for — a user, an agent, an error spike, a slow call — and you want to drill into the actual prompt and completion.

Key features

Lazy-loaded table

Server-side pagination, sorting, and filtering. The page stays fast on datasets of millions of rows.

Global text search

The search box runs across agent name, model, input, and output.

Filter by agent

Quick filter to scope the table to a single agent name.

Time range

Last Hour, 3H, 24H, 7D, 1M, 6M selector at the page header level.

Status badges

Success, error, in-progress, and timed-out are colour-coded at a glance.

Detail drawer

Click any row to open the full input, output, metadata, and cost breakdown.

Columns

ColumnWhat it shows
Statussuccess, error, started, or timed-out (started > 5 min).
Start timeWhen the call began, in the viewer’s local timezone.
AgentThe @agent("name") decorator value.
ModelProvider model identifier — gpt-4, claude-opus-4-7, etc.
DurationWall-clock seconds. Below 4s is success, 4–8s warning, >8s danger.
UserThe user who triggered the call, when available in event metadata.
TokensTotal tokens for the call (prompt + completion).
RequestsNumber of HTTP requests bundled into the same logical call.

The text search box matches contains against the agent name, model, input, and output fields. The agent dropdown is the equivalent of an exact-match filter on the Agent column. For more complex queries — status, model, tags, latency bounds, cost — use the API directly:
GET /api/run/?status=error&model=gpt-4&min_latency_s=8
See LLMRunsQuery for the full parameter list.

The detail drawer

Click any row to open the run detail drawer. It contains:
  • The full prompt and completion (rendered as text or JSON depending on payload)
  • The chain-of-calls tree if the run contained nested chains or tools — links to Traces
  • Token and cost breakdown by model
  • Metadata extracted by the SDK (user_email, custom tags, request ID)
Long outputs are virtualised. Use the Copy button in the drawer header to grab the full payload when you need to paste it into a bug report.

Common workflows

  1. Set the time range to 24H.
  2. Pick the agent from the Filter by agent dropdown.
  3. Sort by Status to bring errors to the top.
Sort by Duration descending. Rows with the red danger badge are above the 8s threshold.
Paste a snippet of the user’s input into the search box. Matching rows surface immediately.

Traces

For runs that contain nested chains and tools, see the trace view.

Analytics

Aggregate trends across the same data.
Last modified on June 2, 2026