2026-05-10 — Yesterday I Learned
Give It the Destination, Not the Directions
Four tools and a single idea: when you hand an agent the outcome instead of the steps, you get judgment instead of a script — and that difference compounds.
Turn-by-turn directions tell every move. Telling the destination trusts the GPS. One approach takes over; the other lets the tool do its job.
Two Prompts, Same Task
Process-first
1. First check history. 2. Second look up policy. 3. Third compare. 4. Fourth write reply.
Scripted. Brittle. Grows with every edge case.
Outcome-first
Resolve the issue end-to-end. Success means a decision is made from available data, allowed actions are completed, and the final answer includes X, Y, Z. If evidence is missing, ask for it.
Shorter. Maintainable. Compounds as the library grows.
The outcome-first version gives the model more room to reason. As a prompt library grows, that maintainability compounds — fewer updates when adjacent steps change, because the outcome stays stable even when the path shifts.
Browser
The Browser as a Coworker
Claude Code can connect directly to a browser — not just to check rendered output during implementation, but to run repeatable business-flow validation the way a QA coworker would. The Prompt Warrior tutorial walks through three ways to wire this up.
The immediate use case I want to try: automated validation of promo item entries and market submission flows at DataCamp. These are repetitive, multi-step browser interactions — exactly the kind of work where a browser-connected agent can replace manual spot-checking and catch regressions before they ship.
The Web, Extracted
markdown.new opens as a blank editor. Paste, capture, or extract from any page — then feed structured Markdown directly into agents or build workflows.
Live web surfaces and DataCamp pages become source material the moment you export them. The bridge between a rendered page and a structured build prompt is shorter than it looks.
Videos Are Just HTML Files
Nous Research's Hermes Agent now has an official HyperFrames skill from HeyGen. Because video output is HTML, any agent that already understands HTML can write, edit, and render video the same way it writes web pages.
hermes skills install hyperframes
Document
PDF → Walkthrough Video
Turn any document into an animated step-by-step explainer from a single prompt.
Codebase
Repo → Launch Reel
Point the agent at a codebase and generate a polished video introduction of the project.
Branding
Animated Title Cards
Generate branded, motion-ready title sequences for any video project.
Social
Social Overlays
Dynamic lower-thirds, captions, and callout overlays sized for each platform.
Tutorial
Captioned Talking-Head Videos
Auto-caption any talking-head recording and style it for social or embedded use.
Audio
Audio-Reactive Visuals
Motion graphics that respond to an audio track — all rendered as HTML, all editable as code.
References