💬 ~50 messages a day
Quick questions, reminders, "what's on my calendar?"
👥 Where most people land
Premium
Claude Sonnet
Anthropic — Best-in-class
$8
/week
✓ Best reasoning & coding · US-based
Smart Pick ⭐
Gemini Flash
Google — Fast & capable
$1
/week
✓ Has a free tier · Fastest response
Budget
DeepSeek V3
DeepSeek — Dirt cheap
$0.33
/week
✓ Absurdly cheap · ✗ Chinese
servers
Free
Llama 4 Scout
Meta — Open source
$0
/week
APIFree (Groq)
Self-host$0
✓ Fully open source · ✗ Needs
GPU to self-host
🔀 FYI: All of these models are accessible through one API — OpenRouter.
One key, all models, swap anytime.
📊 Performance scores based on Chatbot Arena & LMSys benchmark data (coding, reasoning, instruction
following)
5 Ways to Cut Costs
🔀
Route by
Complexity
↓ 60–80%
📦
Compress
Context
↓ 30–50%
🌙
Cheap Model for
Background
~$0.10/day
💾
Prompt
Caching
↓ up to 90%
🛑
Cap Max
Iterations
No runaways
Smart-routed agent — 93% of premium performance
$7–19/wk
Less than Netflix. Runs 24/7.