Writing · Cuecoder

Writing · 8 entries

From the lab notebook.

Engineering notes from shipping AI in production.

2026-06-13 3 min

How to Pick the Right LLM for Your Agent

Not every agent needs the most capable model. Here's a practical framework for matching model to task — and why the answer almost always changes over time.

modelsagentsarchitecture

→

2026-06-09 3 min

MCP Explained: The Protocol That's Quietly Changing AI Tooling

Model Context Protocol (MCP) is becoming the USB-C of AI integrations. Here's what it is, why it matters, and what it means for how you build agents.

mcpagentstooling

→

2026-06-05 3 min

Why Your AI Agent Should Be Model-Agnostic

Locking your agent to a single model provider is the fastest way to accumulate technical debt. Here's how to build the abstraction right.

agentsarchitecturemodels

→

2026-06-01 2 min

What Is an AI Agent, Actually?

Everyone is shipping 'agents' — but most are just chatbots with a system prompt. Here's the real distinction and why it matters.

agentsfundamentals

→

2026-05-14 2 min

Evals are the product. Everything else is a side effect.

After two years shipping LLM features in production, the only artifact worth trusting is a well-designed eval suite. Here is how Cuecoder structures them.

evalsinfra

→

2026-04-29 2 min

Stop tuning prompts. Start tuning context.

Prompt engineering plateaus fast. The real lever is what you put in front of the model — retrieval, structured tools, and dynamic memory.

retrievalagents

→

2026-03-22 3 min

Building agents without frameworks

LangGraph, Crew, AutoGen — all good demos, none have shipped what production needed. Here's the 200-line loop quietly serving production traffic.

agents

→

2026-02-20 2 min

The 1M context window is a trap (for now)

Bigger context windows are exciting and useless without a retrieval strategy. A pragmatic guide to picking what to put in.

retrieval

→