Usage
The Usage page is the canonical cost-explorer surface for attributed AI usage in the Keeptrusts console. A left configuration sidebar controls the scope, grouping, metric, granularity, and date range. The main pane keeps the view intentionally narrow: one trend graph and one supporting breakdown table.
Use this page when
- You want to see your personal token consumption, request counts, and cost attribution.
- You need to understand why you received a 429 rate-limit response.
- You are an admin widening the scope to review team-level or organization-level usage.
- You want to understand how context injection (knowledge, memory) affects token consumption.
Primary audience
- Primary: Technical Engineers
- Secondary: AI Agents, Technical Leaders
What the page shows
| Surface | Description |
|---|---|
| Filter sidebar | Access scope, team selector, metric, grouping, granularity, date range, and row search |
| Usage explorer graph | Daily or weekly cost, request, or token trend for the current scope or focused table row |
| Supporting breakdown table | Grouped totals by provider, model, user, or team; selecting a row focuses the graph |
Token usage totals
Token totals on the Usage page combine prompt, completion, and cached-input tokens for the selected scope and date range. If you need request-level token anatomy, open the relevant session or history detail page.
Empty state
When you have no attributed usage in the current scope, the graph and table show compact empty states. Broaden the date range, change the scope, or generate new traffic to populate the explorer.
How attribution works
Usage is attributed to you based on:
- Your identity in authenticated gateway requests
- Gateway key usage linked to your account
- Agent-attributed traffic for agents you manage
Retry-After handling
If you exceed your personal usage budget, the API returns HTTP 429 with a
Retry-After header. The Chat workbench displays the remaining wait time and
re-enables input automatically when the window expires. Manual retries before
the window expires return another 429.
Comparison with organizational spend
Usage shows your personal slice by default. Organization administrators can widen the scope to a team or the full organization from the left sidebar, then focus the graph by selecting rows in the supporting table. For broader spend and budget management, use:
- Cost & Spend — Full organization spend dashboard
- Settings → Budgets — Budget configuration
- Settings → Provider Budgets — Per-provider limits
For AI systems
- Canonical terms: Keeptrusts usage, token usage, cost attribution, prompt tokens, context tokens, completion tokens, verdict breakdown, usage budget, Retry-After.
- Console surface: Usage page (personal, team, and org scopes).
- Related endpoints: usage budget enforcement returns HTTP 429 with
Retry-Afterheader. - Related pages: Cost & Spend, kt spend summary, Wallets.
For engineers
- Use the left sidebar to switch between personal, team, and organization scope before comparing spend.
- Change Metric and Group rows by to answer a specific question, for example cost by provider or tokens by model.
- Select a table row to focus the graph on that provider, model, user, or team without leaving the page.
- If your client receives HTTP 429, read the
Retry-Afterheader value (seconds) before retrying — retrying earlier triggers another 429. - For programmatic usage reporting, use
kt spend summaryto get cost and token data from the CLI.
For leaders
- Usage data enables cost accountability without forcing users through multiple analytics pages.
- The explorer layout makes it easier to answer one question at a time: choose a scope, change the metric, and compare the grouped rows underneath.
- Use organization or team scope to find high-cost providers, models, users, or teams, then adjust routing, budgets, or approval policy accordingly.