Agentic Builders

Issue #1 · 3 min read · By Ben

Quiet day on the build beat: LangGraph nudges its event protocol forward while MCP quietly buys its first ad spend.

Mornin'. So somebody finally pointed an MCP-driven agent at a real ad budget and let it run a full video campaign for dentsu, no UI, no human clicking through Criteo's planner. We've been pitching MCP as "the USB-C of agents" for a year and it mostly meant my Claude could finally read my Notion. Now it's spending money. Sleep well.

-Ben

In today's newsletter:

LangGraph's v3 streaming gets sharper
MCP buys its first ad campaign
A 32-point rubric for agent autonomy

FRAMEWORK RELEASES

LangGraph 1.2.0a6 tightens up v3 event streaming

via GitHub

If you've ever tried to debug a multi-agent graph by squinting at a wall of half-typed event payloads, this alpha is for you.

LangGraph dropped 1.2.0a6, an alpha that forwards kwargs cleanly through the stream_events(version='v3') protocol. It also bumps the notebook dependency from 7.5.2 to 7.5.6, which is the kind of housekeeping nobody notices until it breaks.

The v3 stream is the spine of how LangGraph surfaces what a workflow is actually doing in real time: every node entry, every tool call, every interim token. If you're piping those events into a tracer, a UI, or your own observability layer, the kwarg forwarding fix means fewer "where did my metadata go" tickets.

Pre-release tagged on the langgraph repo, so pin explicitly if you pull it
Improvement targets stream_events(version='v3'), not the older v1 / v2 streams
Notebook bump is harmless but worth noting if you lock dev deps

Why it matters: Event streaming is where agentic frameworks actually earn their keep in production - if you can't see what the graph is doing, you can't ship it. Read more.

MCP IN THE WILD

An AI agent ran a full Criteo campaign for dentsu over MCP

MCP just graduated from "cool way to give Claude my filesystem" to "cool way to spend a media budget."

Criteo and dentsu say they ran the first fully orchestrated MCP ad campaign: an AI agent handled audience definition, launch, and optimization end-to-end through secure MCP API calls, with no human logging into a UI. What used to be a multi-day media planning workflow collapsed into a single conversational thread.

The target metric was completed views, so this isn't a synthetic demo. Real budget, real inventory, real performance reporting. That's a different bar than "agent successfully booked a flight on a sandbox."

Agent talked to Criteo's stack purely over MCP, no browser automation hacks
Replaces the traditional planner-driven workflow, not just augments it
First public case of MCP carrying a revenue-generating business process

Why it matters: If you're building MCP servers, the design conversation just shifted from "what tools do we expose" to "what happens when an agent uses them with a credit card attached." Read more.

RUBRIC WATCH

A 32-point scoring system for "how autonomous is this agent, really"

Every team has a hand-rolled rubric for "is this thing actually agentic or is it a for-loop with vibes." Now there's a shared one.

The agentic.ai guide got a refresh on May 4 with a 32-point framework spanning eight dimensions: action capability, autonomy, planning, adaptation, state continuity, reliability, interoperability, and safety. Score 0 to 32, then map to one of six named levels from Reactive Tool up to Strategic Agent.

They've already run 272 tools through the framework, which gives you a reference set to argue with when your director asks where your shiny new orchestrator sits.

Eight dimensions, four points each, deliberately boring math
Six named levels so you stop reinventing tier names in every doc
Explicit principle: higher autonomy is not the goal, fit-to-task is

Why it matters: Common vocabulary for agent capability beats yet another internal Notion page, especially when procurement starts asking what "Level 4" means. Read more.

WHAT ELSE IS SHIPPING

What else is shipping

CrewAI 1.14.5a2 - pre-release with six async fixes, including the long-standing shared LLM stop-words mutation that bit anyone running multiple agents off one client.
langchain-anthropic 1.4.3 - adds an httpx finalizer to plug resource leaks, and retargets deprecations toward create_agent.
langchain-classic 1.0.5 - maintenance release syncing pyyaml, pytz, and notebook versions across the monorepo. Boring, but the kind of boring that prevents a 2 a.m. dependency resolver fight.

LangGraph 1.2.0a6 tightens up v3 event streaming

An AI agent ran a full Criteo campaign for dentsu over MCP

A 32-point scoring system for "how autonomous is this agent, really"

What else is shipping

Also from TinyIdeas Media

Agentic Business

Agentic Quality