Admin · AI

LLM cost dashboard

Token spend by model and capability · 7-day view

Spend last 7d

$19

4.9M tokens

Today's spend

$5

691 requests

Monthly budget

$4K

Alert at 80%

Budget utilization

2.1%

Healthy
$85 projected
Spend by capability
Last 7 days · grouped by AI capability
Embedding$11 · 1,760 req
Contract Intel$5 · 88 req
Vendor Intel$1 · 131 req
Document Gen$1 · 690 req
Brief Intake$1 · 58 req
General$0 · 384 req
Spend by model
4 models in production rotation
7-DAY TOTAL$19
  • GPT-5$11 · 58.2%
  • Claude Opus 4.7$5 · 27.6%
  • Claude Sonnet 4.6$2 · 8.3%
  • Claude Haiku 4.5$1 · 5.8%
Daily trend
Total cost per day across the last 7 days
$0$1$3$4$5May 3May 4May 5May 6May 7May 8May 9$1$1$3$3$3$4$5
Top capabilities by request count
Volume vs. unit economics across capabilities
CapabilityRequestsTotal tokensCostAvg cost / request
Embedding1,7602.3M$11$0.0064
Document Gen6901.5M$1$0.0012
General384612K$0$0.0008
Vendor Intel131201K$1$0.0079
Contract Intel88179K$5$0.0611
Brief Intake58107K$1$0.0101
Token efficiency

Document gen at 5.8% of total cost generates 34.5% of requests — Haiku optimization is paying off.

Across the rotation, 3,111 requests cost $19 over 7 days — averaging $0.0063 per request. Routing high-volume document generation to Haiku keeps unit cost flat as request volume scales; Opus is reserved for contract intel where reasoning depth justifies the premium.