Blog
Notes from the stack.
Notes on AI infrastructure, GPU economics, agent ops, and the people building on inference.ai.
· LLM cost · AI FinOps
How to Take Control of AI Spend: A Practical Guide to LLM Cost Optimization
Why AI bills spiral and how to fix it — visibility, budget caps, model routing, and failover. A practical guide to LLM cost optimization.
Read article →· AI agents · infrastructure
How to Deploy an AI Agent: A Practical Infrastructure Guide
Where AI agents break in production — and how to host one that stays on. A practical guide to deploying agents the right way.
Read article →· cloud GPU · AI infrastructure
How to Choose the Right Cloud GPU for AI Workloads: A 2026 Comparison Guide
Compare cloud GPUs for AI training and inference — B300 to RTX 4090, hourly vs reserved pricing, and how to pick without overpaying.
Read article →