Machine Learning

Definition

Machine Learning coverage in this archive spans 4 posts from Feb 2018 to Jul 2023 and treats machine learning as a production discipline: evaluation loops, tool boundaries, escalation paths, and cost control. The strongest adjacent threads are ai, embeddings, and go. Recurring title motifs include embedding, models, compared, and retrieval.

Key claims

The archive repeatedly argues that machine learning only creates leverage when it is wired into an existing workflow.
The consistent theme from 2018 to 2023 is disciplined execution over hype cycles.
This topic repeatedly intersects with ai, embeddings, and go, so design choices here rarely stand alone.

Practical checklist

Define quality gates up front: eval sets, guardrails, and explicit rollback criteria.
Start with the newest post to calibrate current constraints, then backtrack to older entries for first principles.
When boundary questions appear, cross-read ai and embeddings before committing implementation details.

Failure modes

Shipping agent behavior without hard boundaries for tools, data access, and approvals.
Optimizing for model novelty while ignoring reliability, latency, or cost drift.
Applying guidance from 2018 to 2023 without revisiting assumptions as context changed.

References

Embedding Models Compared: What Actually Matters for Retrieval

Jul 2023

I benchmarked five embedding models against the same retrieval task. The results challenged some of my assumptions about model size, cost, and quality.

Fine-Tuning vs. Prompting: A Decision Framework

May 2023

Most teams should exhaust prompting before they even think about fine-tuning. Here's how to decide which lever to pull.

Most Teams Are Not Ready for MLOps

Mar 2021

MLOps is real, but most teams buying MLOps tooling cannot even version their training data. Fix the basics first.