Caching

LLM Prompt Caching in Go: Cut Costs Without Breaking Things

Mar 2024

Caching LLM responses is the highest-leverage optimization most teams are not doing. Here is how I implement it in Go, with real patterns for keys, invalidation, and safety.

Caching: The Easy Part Is Adding It, the Hard Part Is Everything Else

Aug 2022

Cache-aside, write-through, invalidation strategies, and the failure modes that will wake you up at night. With Go examples.