2026-01-10
4 min
Batch vs Live: A Practical Rulebook to Cut LLM Costs by 50%
We all know OpenAI's Batch API offers a 50% discount. So why aren't you using it? Here is a brutal reality check on when to wait 24 hours and when to pay full price.
2026-01-06
5 min
RAG Cost Breakdown: Vector DB and Context Overhead
A RAG app costing $3,400/month instead of $300. The breakdown: vector DB read units, context stuffing, and model selection. Practical fixes.