AI Cost Trends: Where We're Headed
A pragmatic look at AI cost trends in early February 2026, plus what to do about them.
Optimization coverage in this archive spans 6 posts from Aug 2017 to Feb 2026 and deals with structural tradeoffs: coupling, failure boundaries, and long-term change cost. The strongest adjacent threads are performance, cost, and ai. Recurring title motifs include ai, cost, go, and trends.
A pragmatic look at AI cost trends in early February 2026, plus what to do about them.
Price-per-token is the least useful number on your AI bill. Real cost benchmarking starts with your workload, not a provider's pricing page.
Caching LLM responses is the highest-leverage optimization most teams are not doing. Here is how I implement it in Go, with real patterns for keys, invalidation, and safety.
Most Postgres performance problems are indexing problems. The rest are vacuum problems. Here's how to find and fix both.
Practical patterns for squeezing performance out of Go services — profiling, allocation control, bounded concurrency, and HTTP/DB tuning from real production work.
The repeatable process I use at the fintech startup to diagnose and fix database performance problems instead of throwing random indexes at the wall.