<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Law Zava</title><link>https://lawzava.com/</link><description>Systems engineering, AI infrastructure, and technical leadership at scale.</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Fri, 06 Mar 2026 23:38:36 +0000</lastBuildDate><atom:link href="https://lawzava.com/index.xml" rel="self" type="application/rss+xml"/><item><title>AI Startup Landscape 2026</title><link>https://lawzava.com/blog/2026-03-02-ai-startup-landscape/</link><pubDate>Mon, 02 Mar 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-03-02-ai-startup-landscape/</guid><description>By early March 2026, the AI startup market looks less like a gold rush and more like a durable industry with clear pressure points. This post lays out where leverage sits, what buyers reward, and what durable execution looks like now.</description></item><item><title>AI Security: Evolving Threats and Defenses</title><link>https://lawzava.com/blog/2026-02-23-ai-security-evolution/</link><pubDate>Mon, 23 Feb 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-02-23-ai-security-evolution/</guid><description>As of late February 2026, AI security is defined by adaptive attacks and layered, operational defenses.</description></item><item><title>AI Team Structures That Work</title><link>https://lawzava.com/blog/2026-02-16-ai-team-structures/</link><pubDate>Mon, 16 Feb 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-02-16-ai-team-structures/</guid><description>As of mid-February 2026, AI team structures have stabilized into a few workable patterns. This guide explains the models, tradeoffs, and roles that hold up in practice.</description></item><item><title>AI Cost Trends: Where We're Headed</title><link>https://lawzava.com/blog/2026-02-09-ai-cost-trends/</link><pubDate>Mon, 09 Feb 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-02-09-ai-cost-trends/</guid><description>A pragmatic look at AI cost trends in early February 2026, plus what to do about them.</description></item><item><title>AI Regulation Is Here. Stop Acting Surprised.</title><link>https://lawzava.com/blog/2026-02-02-ai-regulation-reality/</link><pubDate>Mon, 02 Feb 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-02-02-ai-regulation-reality/</guid><description>Regulation isn&amp;amp;rsquo;t a future problem anymore. It&amp;amp;rsquo;s showing up in procurement, security reviews, and internal sign-off. The teams that treat compliance as engineering will ship faster than the ones scrambling to bolt it on.</description></item><item><title>AI-Native Architecture Patterns 2026</title><link>https://lawzava.com/blog/2026-01-26-ai-native-architecture-2026/</link><pubDate>Mon, 26 Jan 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-01-26-ai-native-architecture-2026/</guid><description>As of late January 2026, AI-native architecture is a stable discipline with repeatable patterns for delivery, safety, and change management.</description></item><item><title>Building Reliable AI Agents in Go</title><link>https://lawzava.com/blog/2026-01-19-ai-agent-reliability/</link><pubDate>Mon, 19 Jan 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-01-19-ai-agent-reliability/</guid><description>Reliable agents aren&amp;amp;rsquo;t prompted into existence. They&amp;amp;rsquo;re engineered &amp;amp;ndash; with bounded tools, validation at every step, explicit recovery paths, and the same discipline you&amp;amp;rsquo;d apply to any production system. Here&amp;amp;rsquo;s how I build them in Go.</description></item><item><title>AI Video Applications in Practice</title><link>https://lawzava.com/blog/2026-01-12-ai-video-applications/</link><pubDate>Mon, 12 Jan 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-01-12-ai-video-applications/</guid><description>Video AI is practical for scoped workflows. This post covers what works, how to design for reliability, and where human review still matters.</description></item><item><title>What I Actually Expect from AI in 2026</title><link>https://lawzava.com/blog/2026-01-05-ai-predictions-2026/</link><pubDate>Mon, 05 Jan 2026 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2026-01-05-ai-predictions-2026/</guid><description>Less hype, more plumbing. Agents get real but stay bounded. Routing beats monolithic models. Governance lands on the critical path. And the teams that win will be the ones that treat AI like software, not magic.</description></item><item><title>2025: The Year AI Stopped Being Special</title><link>https://lawzava.com/blog/2025-12-22-year-in-review-2025/</link><pubDate>Mon, 22 Dec 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-12-22-year-in-review-2025/</guid><description>A year-end look at what actually happened in AI &amp;amp;ndash; not the hype, but the operational shift. The novelty phase is over. The infrastructure phase has begun.</description></item><item><title>AI in 2025: The Year It Became Boring (Finally)</title><link>https://lawzava.com/blog/2025-12-08-ai-2025-reflections/</link><pubDate>Mon, 08 Dec 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-12-08-ai-2025-reflections/</guid><description>The most important thing that happened to AI in 2025 wasn&amp;amp;rsquo;t a model release. It was the shift from &amp;amp;lsquo;what can it do&amp;amp;rsquo; to &amp;amp;lsquo;how do we run it.&amp;amp;rsquo; That&amp;amp;rsquo;s progress.</description></item><item><title>Scaling AI in the Enterprise Is a Management Problem</title><link>https://lawzava.com/blog/2025-11-24-ai-enterprise-scale/</link><pubDate>Mon, 24 Nov 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-11-24-ai-enterprise-scale/</guid><description>The technology works. The pilots work. What doesn&amp;amp;rsquo;t work is going from five demos to fifty production features without an operating model. That&amp;amp;rsquo;s not an AI problem &amp;amp;ndash; it&amp;amp;rsquo;s a management problem.</description></item><item><title>AI Incidents Don't Look Like Outages. That's the Problem.</title><link>https://lawzava.com/blog/2025-11-10-ai-incident-management/</link><pubDate>Mon, 10 Nov 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-11-10-ai-incident-management/</guid><description>Your AI system can return 200 OK and still be wrong, unsafe, or confidently hallucinating. Here&amp;amp;rsquo;s how to detect, contain, and learn from AI incidents &amp;amp;ndash; drawing from the same IR principles that work for traditional systems.</description></item><item><title>AI Technical Debt Is Eating Your Team Alive (And You Can't Even See It)</title><link>https://lawzava.com/blog/2025-10-27-ai-technical-debt/</link><pubDate>Mon, 27 Oct 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-10-27-ai-technical-debt/</guid><description>AI debt doesn&amp;amp;rsquo;t look like normal tech debt. It hides in prompts nobody owns, evals nobody runs, and data pipelines nobody watches. By the time you notice, every change feels dangerous.</description></item><item><title>AI Doesn't Make Your Team Faster. Shared Infrastructure Does.</title><link>https://lawzava.com/blog/2025-10-13-ai-team-productivity/</link><pubDate>Mon, 13 Oct 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-10-13-ai-team-productivity/</guid><description>Individual AI speedups are a distraction. The real gains come from treating AI as team infrastructure &amp;amp;ndash; embedded in docs, decisions, and onboarding.</description></item><item><title>Measuring AI ROI Without Lying to Yourself</title><link>https://lawzava.com/blog/2025-09-29-ai-roi-measurement/</link><pubDate>Mon, 29 Sep 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-09-29-ai-roi-measurement/</guid><description>Most AI ROI calculations are fantasy. Here&amp;amp;rsquo;s how to measure honestly: pick one workflow, capture the full cost, tie benefits to outcomes the business already tracks, and report a range instead of a single number.</description></item><item><title>AI Privacy Is a Plumbing Problem, Not a Policy Problem</title><link>https://lawzava.com/blog/2025-09-15-ai-data-privacy/</link><pubDate>Mon, 15 Sep 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-09-15-ai-data-privacy/</guid><description>Privacy in AI systems fails in the implementation details &amp;amp;ndash; what gets logged, who can replay prompts, how long artifacts linger. Treat it as infrastructure, not a compliance checkbox.</description></item><item><title>AI Pair Programming: It's a Junior Dev, Not a Wizard</title><link>https://lawzava.com/blog/2025-09-01-ai-pair-programming/</link><pubDate>Mon, 01 Sep 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-09-01-ai-pair-programming/</guid><description>AI coding assistants are useful when you treat them like a fast, literal junior teammate. Give them constraints, review their output, and stop expecting architectural insight.</description></item><item><title>Running AI Locally: A Practical Guide for Teams Who Care About Control</title><link>https://lawzava.com/blog/2025-08-18-local-ai-development/</link><pubDate>Mon, 18 Aug 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-08-18-local-ai-development/</guid><description>Local AI is no longer a hobby project. Here&amp;amp;rsquo;s how to set it up properly: provider abstraction, versioned models, evaluation harnesses, and cloud fallback for when local isn&amp;amp;rsquo;t enough.</description></item><item><title>AI Workflow Automation: Decisions Are Cheap, Actions Are Expensive</title><link>https://lawzava.com/blog/2025-08-04-ai-workflow-automation/</link><pubDate>Mon, 04 Aug 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-08-04-ai-workflow-automation/</guid><description>The trick to AI workflow automation is simple: let the model decide, let deterministic code act, and never confuse the two.</description></item><item><title>AI Docs That Don't Lie to Your Users</title><link>https://lawzava.com/blog/2025-07-21-ai-documentation-systems/</link><pubDate>Mon, 21 Jul 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-07-21-ai-documentation-systems/</guid><description>Most AI documentation systems retrieve the wrong version, hallucinate details, and never admit uncertainty. Here&amp;amp;rsquo;s how to build one that actually helps.</description></item><item><title>Your AI Metrics Are Measuring the Wrong Thing</title><link>https://lawzava.com/blog/2025-07-07-ai-product-metrics/</link><pubDate>Mon, 07 Jul 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-07-07-ai-product-metrics/</guid><description>Engagement metrics tell you people clicked. They tell you nothing about whether your AI feature actually helped anyone do anything.</description></item><item><title>Stop Fine-Tuning Models You Haven't Bothered to Prompt Properly</title><link>https://lawzava.com/blog/2025-06-23-fine-tuning-when-why/</link><pubDate>Mon, 23 Jun 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-06-23-fine-tuning-when-why/</guid><description>Fine-tuning is the goto move for teams who skipped the basics. Most of the time, better prompts and proper retrieval solve the actual problem.</description></item><item><title>AI Customer Support That Doesn't Make People Hate You</title><link>https://lawzava.com/blog/2025-06-09-ai-customer-support/</link><pubDate>Mon, 09 Jun 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-06-09-ai-customer-support/</guid><description>Most AI support systems are built to deflect tickets. The ones that actually work are built around escalation, grounding, and the simple idea that customers aren&amp;amp;rsquo;t idiots.</description></item><item><title>Your AI Pipeline Is Just ETL With Extra Steps (And That's Fine)</title><link>https://lawzava.com/blog/2025-05-26-ai-data-pipelines/</link><pubDate>Mon, 26 May 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-05-26-ai-data-pipelines/</guid><description>AI data pipelines aren&amp;amp;rsquo;t some new paradigm. They&amp;amp;rsquo;re ETL with a retrieval layer bolted on. The discipline that makes them work is the same discipline that has always made pipelines work: detect change, chunk intelligently, keep indexes fresh.</description></item><item><title>Agent Orchestration: Four Patterns, Honest Tradeoffs</title><link>https://lawzava.com/blog/2025-05-12-ai-agent-orchestration/</link><pubDate>Mon, 12 May 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-05-12-ai-agent-orchestration/</guid><description>Multi-agent systems aren&amp;amp;rsquo;t magic. They&amp;amp;rsquo;re distributed systems with all the usual coordination headaches. Here are the four patterns I&amp;amp;rsquo;ve seen work, and when each one falls apart.</description></item><item><title>AI Security: Same Principles, New Attack Surface</title><link>https://lawzava.com/blog/2025-04-28-ai-security-2025/</link><pubDate>Mon, 28 Apr 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-04-28-ai-security-2025/</guid><description>AI systems are exposed APIs with real blast radius. The threats are injection, leakage, and tool misuse. The defenses are the same ones we&amp;amp;rsquo;ve always needed &amp;amp;ndash; just applied to a new surface.</description></item><item><title>Testing AI Where It Actually Runs</title><link>https://lawzava.com/blog/2025-04-14-ai-testing-production/</link><pubDate>Mon, 14 Apr 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-04-14-ai-testing-production/</guid><description>Offline evals are necessary but not sufficient. Here&amp;amp;rsquo;s how I test AI features in production with shadow mode, canaries, and rollback automation &amp;amp;ndash; with Go code.</description></item><item><title>Your AI System Looks Healthy. It Is Not.</title><link>https://lawzava.com/blog/2025-03-31-ai-observability-deep/</link><pubDate>Mon, 31 Mar 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-03-31-ai-observability-deep/</guid><description>Traditional monitoring will tell you your AI service is up. It won&amp;amp;rsquo;t tell you it&amp;amp;rsquo;s returning confident garbage. Here&amp;amp;rsquo;s what observability actually looks like for AI.</description></item><item><title>MCP in Practice: Building Tool Servers in Go</title><link>https://lawzava.com/blog/2025-03-17-mcp-model-context-protocol/</link><pubDate>Mon, 17 Mar 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-03-17-mcp-model-context-protocol/</guid><description>Model Context Protocol promises to standardize how AI talks to tools. I built an MCP server in Go to see if the promise holds up. Here&amp;amp;rsquo;s what I found.</description></item><item><title>AI Governance That Does Not Suck</title><link>https://lawzava.com/blog/2025-03-03-ai-governance-practice/</link><pubDate>Mon, 03 Mar 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-03-03-ai-governance-practice/</guid><description>Governance that blocks delivery is broken. Governance that makes &amp;amp;lsquo;yes&amp;amp;rsquo; safe and fast is a competitive advantage. Here&amp;amp;rsquo;s how to build the second kind.</description></item><item><title>Video Understanding AI: What Actually Works</title><link>https://lawzava.com/blog/2025-02-17-video-understanding-ai/</link><pubDate>Mon, 17 Feb 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-02-17-video-understanding-ai/</guid><description>I pointed a video understanding pipeline at 200 hours of meeting recordings. The results taught me more about pipeline design than about meetings.</description></item><item><title>AI Code Review Is Mostly Noise</title><link>https://lawzava.com/blog/2025-02-03-ai-code-review/</link><pubDate>Mon, 03 Feb 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-02-03-ai-code-review/</guid><description>I&amp;amp;rsquo;ve been running AI code review on real PRs for months. It catches some real bugs. It also generates a staggering amount of useless commentary.</description></item><item><title>Reasoning Models in Production: A Practical Guide</title><link>https://lawzava.com/blog/2025-01-20-reasoning-models-production/</link><pubDate>Mon, 20 Jan 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-01-20-reasoning-models-production/</guid><description>Reasoning models are powerful but expensive and slow. Here&amp;amp;rsquo;s how I integrate them in Go services with routing, async patterns, and cost controls that actually work.</description></item><item><title>AI in 2025: The Year Discipline Wins</title><link>https://lawzava.com/blog/2025-01-06-ai-trends-2025/</link><pubDate>Mon, 06 Jan 2025 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2025-01-06-ai-trends-2025/</guid><description>The AI hype cycle is over. 2025 is about the teams who can make this stuff actually work in production &amp;amp;ndash; repeatably, measurably, and without burning money.</description></item><item><title>2025 Will Reward the Boring Teams</title><link>https://lawzava.com/blog/2024-12-23-preparing-for-2025/</link><pubDate>Mon, 23 Dec 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-12-23-preparing-for-2025/</guid><description>The AI advantage in 2025 goes to teams that ship measurable workflows, not teams that chase capabilities. The gap is discipline, not technology.</description></item><item><title>2024: The Year AI Got Boring (In a Good Way)</title><link>https://lawzava.com/blog/2024-12-16-year-in-review-2024/</link><pubDate>Mon, 16 Dec 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-12-16-year-in-review-2024/</guid><description>2024 was the year AI stopped being exciting and started being useful. The demo phase ended. The production phase began. Discipline won.</description></item><item><title>Your AI Infrastructure Is Not Special</title><link>https://lawzava.com/blog/2024-12-09-ai-infrastructure-scale/</link><pubDate>Mon, 09 Dec 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-12-09-ai-infrastructure-scale/</guid><description>AI infrastructure at scale is just infrastructure. The same boring patterns &amp;amp;ndash; gateways, caching, circuit breakers, budget enforcement &amp;amp;ndash; solve the same boring problems.</description></item><item><title>Your AI Team Problem Is Not Technical</title><link>https://lawzava.com/blog/2024-12-02-building-ai-teams/</link><pubDate>Mon, 02 Dec 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-12-02-building-ai-teams/</guid><description>Most AI team failures come from unclear ownership and weak evaluation, not missing talent. Structure and discipline beat hiring sprees.</description></item><item><title>Picking an AI Model for Production (Late 2024)</title><link>https://lawzava.com/blog/2024-11-25-ai-model-comparison-2024/</link><pubDate>Mon, 25 Nov 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-11-25-ai-model-comparison-2024/</guid><description>There&amp;amp;rsquo;s no best model. There&amp;amp;rsquo;s the model that fits your workload, latency budget, cost constraint, and ops tolerance. Here&amp;amp;rsquo;s how to compare them.</description></item><item><title>AI Safety Is Just Production Engineering</title><link>https://lawzava.com/blog/2024-11-11-ai-safety-production/</link><pubDate>Mon, 11 Nov 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-11-11-ai-safety-production/</guid><description>AI safety in production isn&amp;amp;rsquo;t a research problem. It&amp;amp;rsquo;s defense in depth, the same way cyber defense works &amp;amp;ndash; layered controls, assumed breach, observable boundaries.</description></item><item><title>Agent Patterns That Survive Production</title><link>https://lawzava.com/blog/2024-10-28-advanced-agent-patterns/</link><pubDate>Mon, 28 Oct 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-10-28-advanced-agent-patterns/</guid><description>Single-prompt agents break on real tasks. Plan-execute-replan, orchestrated specialists, structured memory, and explicit recovery &amp;amp;ndash; in Go &amp;amp;ndash; are what actually works.</description></item><item><title>AI Cost Benchmarking: What Your Bill Actually Tells You</title><link>https://lawzava.com/blog/2024-10-14-ai-cost-benchmarking/</link><pubDate>Mon, 14 Oct 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-10-14-ai-cost-benchmarking/</guid><description>Price-per-token is the least useful number on your AI bill. Real cost benchmarking starts with your workload, not a provider&amp;amp;rsquo;s pricing page.</description></item><item><title>RAG Retrieval That Actually Works</title><link>https://lawzava.com/blog/2024-09-30-retrieval-strategies-rag/</link><pubDate>Mon, 30 Sep 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-09-30-retrieval-strategies-rag/</guid><description>Most RAG failures are retrieval failures. Fixing them requires hybrid search, smarter chunking, query expansion, and reranking &amp;amp;ndash; measured independently from generation.</description></item><item><title>Let AI Write Your First Draft, Not Your Docs</title><link>https://lawzava.com/blog/2024-09-16-technical-documentation-ai/</link><pubDate>Mon, 16 Sep 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-09-16-technical-documentation-ai/</guid><description>AI is a decent drafting assistant for technical docs. It&amp;amp;rsquo;s a terrible replacement for ownership.</description></item><item><title>AI-Assisted Code Migration: What Actually Works</title><link>https://lawzava.com/blog/2024-09-02-ai-code-migration/</link><pubDate>Mon, 02 Sep 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-09-02-ai-code-migration/</guid><description>I used LLMs to help migrate a 200K-line Go codebase. The mechanical parts went fast. Everything else was still hard.</description></item><item><title>How I Actually Test LLM Features</title><link>https://lawzava.com/blog/2024-08-19-llm-testing-strategies/</link><pubDate>Mon, 19 Aug 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-08-19-llm-testing-strategies/</guid><description>LLM outputs are non-deterministic. That doesn&amp;amp;rsquo;t mean you can&amp;amp;rsquo;t test them rigorously. Here&amp;amp;rsquo;s the layered testing approach I use in production.</description></item><item><title>The Best Model Is the Smallest One That Works</title><link>https://lawzava.com/blog/2024-08-05-small-models-big-impact/</link><pubDate>Mon, 05 Aug 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-08-05-small-models-big-impact/</guid><description>Everyone reaches for GPT-4 by default. Most production tasks don&amp;amp;rsquo;t need it. Small models are faster, cheaper, and often better when the task is well-defined.</description></item><item><title>Stop Stuffing Your Context Window</title><link>https://lawzava.com/blog/2024-07-22-context-window-strategies/</link><pubDate>Mon, 22 Jul 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-07-22-context-window-strategies/</guid><description>Bigger context windows aren&amp;amp;rsquo;t an excuse to stop thinking about what goes into them. Most teams are paying for irrelevant tokens and wondering why quality degrades.</description></item><item><title>Function Calling Patterns That Survive Production</title><link>https://lawzava.com/blog/2024-07-08-function-calling-patterns/</link><pubDate>Mon, 08 Jul 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-07-08-function-calling-patterns/</guid><description>Function calling is how LLMs touch real systems. Treat tools like APIs, arguments like untrusted input, and permissions like the model is an intern with root access.</description></item><item><title>Claude 3.5 Sonnet: The Mid-Tier Model That Changes the Math</title><link>https://lawzava.com/blog/2024-06-24-claude-35-sonnet-analysis/</link><pubDate>Mon, 24 Jun 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-06-24-claude-35-sonnet-analysis/</guid><description>Anthropic&amp;amp;rsquo;s Claude 3.5 Sonnet isn&amp;amp;rsquo;t just another model release. It&amp;amp;rsquo;s a bet that the mid-tier can handle what you&amp;amp;rsquo;ve been paying top-tier prices for.</description></item><item><title>AI Compliance Without the Theater</title><link>https://lawzava.com/blog/2024-06-10-ai-compliance-enterprise/</link><pubDate>Mon, 10 Jun 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-06-10-ai-compliance-enterprise/</guid><description>Compliance doesn&amp;amp;rsquo;t have to slow you down. But you have to build it into the system from day one, not bolt it on after the demo impresses the board.</description></item><item><title>Why Your Enterprise AI Pilot Is Stuck</title><link>https://lawzava.com/blog/2024-06-03-enterprise-ai-adoption/</link><pubDate>Mon, 03 Jun 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-06-03-enterprise-ai-adoption/</guid><description>Most enterprise AI projects die between the demo and production. The blockers aren&amp;amp;rsquo;t technical &amp;amp;ndash; they&amp;amp;rsquo;re organizational. Here&amp;amp;rsquo;s what I keep seeing.</description></item><item><title>Building Voice AI That People Actually Use</title><link>https://lawzava.com/blog/2024-05-27-building-voice-ai/</link><pubDate>Mon, 27 May 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-05-27-building-voice-ai/</guid><description>Voice AI is ready to ship. The hard parts are latency, interruptions, and knowing when voice is the wrong interface. Here&amp;amp;rsquo;s how I approach it.</description></item><item><title>GPT-4o Changed the Interface, Not the Hard Part</title><link>https://lawzava.com/blog/2024-05-13-gpt4o-realtime-ai/</link><pubDate>Mon, 13 May 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-05-13-gpt4o-realtime-ai/</guid><description>OpenAI shipped a model that sees, hears, and talks back in real time. The demos look magical. The architecture implications are where it gets interesting.</description></item><item><title>Structured Output from LLMs: A Go Implementation Guide</title><link>https://lawzava.com/blog/2024-04-29-structured-output-patterns/</link><pubDate>Mon, 29 Apr 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-04-29-structured-output-patterns/</guid><description>LLMs generate text, not data structures. Here are the patterns I use in Go to get reliable, validated JSON out of models &amp;amp;ndash; with schemas, retries, and repair loops.</description></item><item><title>Most AI Developer Tools Are Not Worth Adopting Yet</title><link>https://lawzava.com/blog/2024-04-15-ai-developer-tooling/</link><pubDate>Mon, 15 Apr 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-04-15-ai-developer-tooling/</guid><description>The AI tooling landscape is exploding. Most of it adds complexity without removing real friction. Here is how I decide what earns a spot in the stack.</description></item><item><title>Agentic Workflows: From Demo Magic to Production Reality</title><link>https://lawzava.com/blog/2024-04-01-agentic-workflows-production/</link><pubDate>Mon, 01 Apr 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-04-01-agentic-workflows-production/</guid><description>AI agents that can take actions are fundamentally different from chatbots. The engineering bar must match the blast radius.</description></item><item><title>LLM Prompt Caching in Go: Cut Costs Without Breaking Things</title><link>https://lawzava.com/blog/2024-03-25-prompt-caching-strategies/</link><pubDate>Mon, 25 Mar 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-03-25-prompt-caching-strategies/</guid><description>Caching LLM responses is the highest-leverage optimization most teams are not doing. Here is how I implement it in Go, with real patterns for keys, invalidation, and safety.</description></item><item><title>Why I Run Multiple Models in Production</title><link>https://lawzava.com/blog/2024-03-18-multi-model-strategies/</link><pubDate>Mon, 18 Mar 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-03-18-multi-model-strategies/</guid><description>Betting on a single model provider is like having a single database with no failover. Here is why multi-model is the only sane production strategy.</description></item><item><title>Claude 3 First Impressions: Three Models, One Decision Framework</title><link>https://lawzava.com/blog/2024-03-04-claude-3-first-look/</link><pubDate>Mon, 04 Mar 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-03-04-claude-3-first-look/</guid><description>Anthropic shipped three models instead of one. That is actually the most interesting part of the release.</description></item><item><title>LLM Evaluation: Stop Shipping on Vibes</title><link>https://lawzava.com/blog/2024-02-19-evaluating-llm-applications/</link><pubDate>Mon, 19 Feb 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-02-19-evaluating-llm-applications/</guid><description>Your LLM feature looks great in demos and breaks in production. Here is how to build an evaluation loop that catches regressions before your users do.</description></item><item><title>Architecting AI-Native Applications (Without the Delusion)</title><link>https://lawzava.com/blog/2024-02-05-ai-native-architecture/</link><pubDate>Mon, 05 Feb 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-02-05-ai-native-architecture/</guid><description>The architecture of an AI-native app is fundamentally different from bolting a model onto a CRUD app. Here is how I structure them &amp;amp;ndash; with code, layers, and hard-won opinions.</description></item><item><title>Stop Paying OpenAI to Test Your Prompts</title><link>https://lawzava.com/blog/2024-01-22-local-llms-development/</link><pubDate>Mon, 22 Jan 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-01-22-local-llms-development/</guid><description>Local LLMs are finally good enough for development. Use them for iteration, keep the API bills for production.</description></item><item><title>AI Engineering Is Its Own Discipline Now</title><link>https://lawzava.com/blog/2024-01-08-ai-engineering-discipline/</link><pubDate>Mon, 08 Jan 2024 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2024-01-08-ai-engineering-discipline/</guid><description>AI engineering is not ML research with a product hat. It is the discipline of making models behave in production &amp;amp;ndash; and it demands its own skill set.</description></item><item><title>2023: The Year Everything Changed (and I Barely Kept Up)</title><link>https://lawzava.com/blog/2023-12-25-year-in-review-2023/</link><pubDate>Mon, 25 Dec 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-12-25-year-in-review-2023/</guid><description>A personal look back at 2023 &amp;amp;ndash; watching AI reshape the industry in real time, and figuring out what matters next.</description></item><item><title>Your AI Infrastructure Is Not Ready for Scale. Neither Is Mine.</title><link>https://lawzava.com/blog/2023-12-18-ai-infrastructure-scale/</link><pubDate>Mon, 18 Dec 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-12-18-ai-infrastructure-scale/</guid><description>The GPU shortage is real, rate limits are a production constraint, and your AI demo is going to collapse under real traffic. Some annoyed thoughts on infrastructure realism.</description></item><item><title>Multimodal AI: Five Use Cases That Actually Work (and Three That Do Not)</title><link>https://lawzava.com/blog/2023-12-11-multimodal-ai-applications/</link><pubDate>Mon, 11 Dec 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-12-11-multimodal-ai-applications/</guid><description>GPT-4V is out and everyone is building vision features. After testing it across real workflows, here is what ships well and what falls apart.</description></item><item><title>Two Weeks With the Assistants API: What I Like, What I Hate</title><link>https://lawzava.com/blog/2023-12-04-building-with-assistants-api/</link><pubDate>Mon, 04 Dec 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-12-04-building-with-assistants-api/</guid><description>I built three things with the Assistants API. One shipped, one got scrapped, and one taught me where the API&amp;amp;rsquo;s limits really are.</description></item><item><title>OpenAI DevDay Happened and I Have Opinions</title><link>https://lawzava.com/blog/2023-11-27-openai-devday-review/</link><pubDate>Mon, 27 Nov 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-11-27-openai-devday-review/</guid><description>OpenAI DevDay was not just a product launch. It was a platform play that changes the build-vs-buy calculus for every team shipping AI features.</description></item><item><title>I Tracked My AI-Assisted Coding for Three Months. Here Are the Numbers.</title><link>https://lawzava.com/blog/2023-11-13-ai-developer-productivity/</link><pubDate>Mon, 13 Nov 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-11-13-ai-developer-productivity/</guid><description>After three months of tracking Copilot and GPT-4 usage across real projects, the productivity picture is messier than the marketing suggests.</description></item><item><title>LLM Security: A Field Guide for People Who Ship Things</title><link>https://lawzava.com/blog/2023-10-30-llm-security-considerations/</link><pubDate>Mon, 30 Oct 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-10-30-llm-security-considerations/</guid><description>LLMs introduce security failure modes that most teams are not defending against. Prompt injection, data leakage, tool abuse, and cost attacks are real and exploitable today.</description></item><item><title>Responsible AI Is Just Risk Management. Treat It That Way.</title><link>https://lawzava.com/blog/2023-10-16-responsible-ai-development/</link><pubDate>Mon, 16 Oct 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-10-16-responsible-ai-development/</guid><description>Responsible AI is not an ethics committee. It is operational risk management, and teams that treat it otherwise are building liabilities.</description></item><item><title>AI Technical Debt Is Eating Your Codebase (You Just Cannot See It Yet)</title><link>https://lawzava.com/blog/2023-10-02-ai-technical-debt/</link><pubDate>Mon, 02 Oct 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-10-02-ai-technical-debt/</guid><description>AI features create a new species of technical debt that hides in prompts, data pipelines, and model versions. By the time you notice it, the cleanup bill is brutal.</description></item><item><title>Agent Architecture Patterns That Actually Work in Production</title><link>https://lawzava.com/blog/2023-09-18-agent-architecture-patterns/</link><pubDate>Mon, 18 Sep 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-09-18-agent-architecture-patterns/</guid><description>Most agent demos are impressive. Most agent production systems are not. Here is what separates the two.</description></item><item><title>Stop Starting With the Model: AI Product Strategy That Works</title><link>https://lawzava.com/blog/2023-09-04-ai-product-strategy/</link><pubDate>Mon, 04 Sep 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-09-04-ai-product-strategy/</guid><description>Every roadmap I&amp;amp;rsquo;ve seen this quarter has an AI feature. Most of them start with the wrong question. Start with the user problem, not the model.</description></item><item><title>LLM Observability: Your Existing Monitoring Is Not Enough</title><link>https://lawzava.com/blog/2023-08-21-llm-observability/</link><pubDate>Mon, 21 Aug 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-08-21-llm-observability/</guid><description>Traditional monitoring tells you the service is up. It doesn&amp;amp;rsquo;t tell you the model started confidently returning garbage last Tuesday. Here&amp;amp;rsquo;s how to actually observe LLM systems.</description></item><item><title>What I Learned Building AI Features Into a Fintech Product</title><link>https://lawzava.com/blog/2023-08-07-building-ai-features/</link><pubDate>Mon, 07 Aug 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-08-07-building-ai-features/</guid><description>Building AI features at a fintech infrastructure company taught me that the hard part isn&amp;amp;rsquo;t the model. It&amp;amp;rsquo;s defining quality, handling failures gracefully, and resisting the urge to ship a demo as a product.</description></item><item><title>Your LLM Bill Is Your Own Fault</title><link>https://lawzava.com/blog/2023-07-24-ai-cost-optimization/</link><pubDate>Mon, 24 Jul 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-07-24-ai-cost-optimization/</guid><description>Everyone&amp;amp;rsquo;s complaining about LLM costs. Almost nobody has done the basics: caching, model routing, or even measuring what they&amp;amp;rsquo;re spending per feature.</description></item><item><title>Embedding Models Compared: What Actually Matters for Retrieval</title><link>https://lawzava.com/blog/2023-07-10-embedding-models-deep-dive/</link><pubDate>Mon, 10 Jul 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-07-10-embedding-models-deep-dive/</guid><description>I benchmarked five embedding models against the same retrieval task. The results challenged some of my assumptions about model size, cost, and quality.</description></item><item><title>Most AI Startups Are Wrappers. That's the Problem.</title><link>https://lawzava.com/blog/2023-07-03-ai-startup-landscape/</link><pubDate>Mon, 03 Jul 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-07-03-ai-startup-landscape/</guid><description>Everyone has an AI startup now. Having been through two accelerators and founded two companies, I can tell you: most of these will not survive the year.</description></item><item><title>Building Semantic Search in Go: From Embeddings to Production</title><link>https://lawzava.com/blog/2023-06-26-semantic-search-implementation/</link><pubDate>Mon, 26 Jun 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-06-26-semantic-search-implementation/</guid><description>A hands-on walkthrough of building semantic search with Go, OpenAI embeddings, and pgvector. Includes chunking strategies, hybrid retrieval, and the gotchas I hit along the way.</description></item><item><title>Restructuring Engineering Orgs After Layoffs</title><link>https://lawzava.com/blog/2023-06-12-engineering-org-restructuring/</link><pubDate>Mon, 12 Jun 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-06-12-engineering-org-restructuring/</guid><description>Most post-layoff reorgs fail because they reorganize boxes instead of addressing the actual gaps. Here&amp;amp;rsquo;s what I&amp;amp;rsquo;ve seen work this year.</description></item><item><title>AI Code Review: What It Actually Catches (And What It Misses)</title><link>https://lawzava.com/blog/2023-05-29-ai-code-review/</link><pubDate>Mon, 29 May 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-05-29-ai-code-review/</guid><description>After three months of using AI-assisted code review across multiple projects, here&amp;amp;rsquo;s what actually works and what&amp;amp;rsquo;s just noise.</description></item><item><title>Fine-Tuning vs. Prompting: A Decision Framework</title><link>https://lawzava.com/blog/2023-05-15-fine-tuning-vs-prompting/</link><pubDate>Mon, 15 May 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-05-15-fine-tuning-vs-prompting/</guid><description>Most teams should exhaust prompting before they even think about fine-tuning. Here&amp;amp;rsquo;s how to decide which lever to pull.</description></item><item><title>LangChain Is the New ORM: Convenient Until It Is Not</title><link>https://lawzava.com/blog/2023-05-01-langchain-ai-frameworks/</link><pubDate>Mon, 01 May 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-05-01-langchain-ai-frameworks/</guid><description>LangChain promises to simplify LLM development. Instead it adds abstraction layers you will fight against the moment your use case gets real.</description></item><item><title>RAG Patterns That Actually Work in Production</title><link>https://lawzava.com/blog/2023-04-17-rag-architecture-patterns/</link><pubDate>Mon, 17 Apr 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-04-17-rag-architecture-patterns/</guid><description>RAG is the default architecture for grounding LLMs in private data. Here are the patterns that survive real traffic, with Go examples from production systems.</description></item><item><title>Vector Databases: What They Actually Are and When You Need One</title><link>https://lawzava.com/blog/2023-04-03-vector-databases-explained/</link><pubDate>Mon, 03 Apr 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-04-03-vector-databases-explained/</guid><description>A practical guide to vector databases &amp;amp;ndash; what they store, how similarity search works, and the architectural decisions that matter in production.</description></item><item><title>Claude vs GPT: A User's Honest Take</title><link>https://lawzava.com/blog/2023-03-27-claude-responsible-ai/</link><pubDate>Mon, 27 Mar 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-03-27-claude-responsible-ai/</guid><description>Anthropic&amp;amp;rsquo;s Claude takes a different approach to AI safety. Here is how it compares to GPT in practice, from someone using both daily.</description></item><item><title>AI Safety Is Just Security Engineering With Extra Steps</title><link>https://lawzava.com/blog/2023-03-20-ai-safety-for-engineers/</link><pubDate>Mon, 20 Mar 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-03-20-ai-safety-for-engineers/</guid><description>AI safety is not a philosophy problem for engineers. It is reliability, security, and accountability applied to a new kind of system.</description></item><item><title>My First Week Building with GPT-4</title><link>https://lawzava.com/blog/2023-03-06-building-with-gpt4/</link><pubDate>Mon, 06 Mar 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-03-06-building-with-gpt4/</guid><description>GPT-4 landed and everything changed. What I learned in the first week of building with it, and the architecture decisions that followed.</description></item><item><title>Leading Engineering Teams When Nobody Knows What Is Next</title><link>https://lawzava.com/blog/2023-02-20-engineering-leadership-uncertainty/</link><pubDate>Mon, 20 Feb 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-02-20-engineering-leadership-uncertainty/</guid><description>Uncertainty is not new for startups, but 2023 brought it to every engineering org. Here is what actually helps.</description></item><item><title>Prompt Engineering Is Not Engineering</title><link>https://lawzava.com/blog/2023-02-06-prompt-engineering-fundamentals/</link><pubDate>Mon, 06 Feb 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-02-06-prompt-engineering-fundamentals/</guid><description>The term &amp;amp;lsquo;prompt engineering&amp;amp;rsquo; oversells what is essentially clear writing. It is a useful skill, not a discipline.</description></item><item><title>LLM Integration Patterns That Actually Survive Production</title><link>https://lawzava.com/blog/2023-01-23-llm-integration-patterns/</link><pubDate>Mon, 23 Jan 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-01-23-llm-integration-patterns/</guid><description>Practical patterns for integrating LLMs into real applications &amp;amp;ndash; prompt management, structured outputs, caching, fallbacks, and tool use &amp;amp;ndash; with Go examples.</description></item><item><title>AI in Production Is Just Engineering. Treat It That Way.</title><link>https://lawzava.com/blog/2023-01-09-ai-in-production/</link><pubDate>Mon, 09 Jan 2023 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2023-01-09-ai-in-production/</guid><description>ChatGPT changed expectations overnight, but shipping AI features that actually work is an engineering problem, not a model problem.</description></item><item><title>2022: The Year the Music Stopped</title><link>https://lawzava.com/blog/2022-12-26-year-in-review-2022/</link><pubDate>Mon, 26 Dec 2022 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2022-12-26-year-in-review-2022/</guid><description>A personal look back at 2022: building through the downturn, watching ChatGPT arrive, and what the year taught me about building things that last.</description></item><item><title>Your Cloud Bill Is Not a Mystery</title><link>https://lawzava.com/blog/2022-12-19-infrastructure-cost-optimization/</link><pubDate>Mon, 19 Dec 2022 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2022-12-19-infrastructure-cost-optimization/</guid><description>Most cloud cost problems are visibility problems. Fix tagging, kill idle resources, right-size what remains, and make cost a regular engineering conversation.</description></item><item><title>Resilient Teams Are Boring Teams</title><link>https://lawzava.com/blog/2022-12-12-building-resilient-teams/</link><pubDate>Mon, 12 Dec 2022 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2022-12-12-building-resilient-teams/</guid><description>The engineering teams that survived 2022 best were not the ones with the most talent. They were the ones with the least drama.</description></item><item><title>Five Days With ChatGPT</title><link>https://lawzava.com/blog/2022-12-05-chatgpt-future-of-coding/</link><pubDate>Mon, 05 Dec 2022 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2022-12-05-chatgpt-future-of-coding/</guid><description>First impressions of ChatGPT from a working engineer. It is not a search engine, it is not a colleague, and it is definitely not a replacement. But it is something.</description></item><item><title>My Honest Take on GitHub Copilot After Six Months</title><link>https://lawzava.com/blog/2022-11-28-ai-code-assistants-evolution/</link><pubDate>Mon, 28 Nov 2022 00:00:00 +0000</pubDate><guid>https://lawzava.com/blog/2022-11-28-ai-code-assistants-evolution/</guid><description>Six months with Copilot in real projects. What it actually helps with, where it quietly makes things worse, and why the productivity claims are overblown.</description></item></channel></rss>