Ryan BrandtJanuary 29, 2026·7 min readThe Mechanics of AI-First EngineeringEveryone's talking about 90% AI-written code. Here's what it actually looks like day to day.AIClaude CodeEngineeringProductivityAutomation
Ryan BrandtJanuary 28, 2026·10 min readAn Enlarged Intimate Supplement to His MemoryHow I built a local context lake that pulls from every conversation source automatically, and why Vannevar Bush's 1945 vision of the Memex finally makes sense.CRMAI AgentsAutomationProductivity
Ryan BrandtDecember 8, 2025·16 min readAgentMail: Email Infrastructure for the Agentic EraA deep dive into AgentMail's thesis that agents will become first-class internet users, with email as their primary communication protocol. Based on technical discussions with co-founder Adi Singh.AI AgentsEmail InfrastructureAgent-to-AgentYCDeep Dive
Ryan BrandtNovember 12, 2025·8 min readCursor: The Everything AppI wanted to interview at OpenAI but didn't know anyone there. So I built an agent in Cursor to optimize my cold outreach. That worked. Then I kept building. Now Cursor runs my entire life.AICursorAutomationProductivity
Ryan BrandtOctober 28, 2025·22 min readTesting LangSmith's Insights Agent: 87.92% Coverage in 35 MinutesWe spent 20 hours with domain experts manually annotating 207 production agent traces to understand failure patterns. Then we tested if LangSmith's Insights Agent could automate this process. It found 87.92% of our failure patterns in 35 minutes.AIEvalsLangSmithTestingAgent Engineering
Ryan BrandtOctober 13, 2025·14 min readThe Unknown Unknowns Problem in AI EvaluationWhy automated tests miss the failures that matter most, and how manual error analysis discovers the bugs you never imagined existed.AIEvalsTestingError AnalysisEngineering
Ryan BrandtOctober 10, 2025·7 min readThe $500 AI That Just Beat Gemini at Abstract ReasoningSamsung's 7-million parameter model outperforms giants on ARC-AGI 2. As the lead contributor to that benchmark, here's why this matters and what it means for the future of AI.AIMachine LearningReasoningEfficiencyResearch
Ryan BrandtOctober 8, 2025·13 min readHow to Actually Evaluate Your LLM (And Stop Guessing)A methodological walkthrough using a hypothetical customer service bot to show how to move from vibes-based evaluation to systematic, measurable improvements.AIEvalsLLMProduct DesignEngineering
Ryan BrandtJuly 29, 2025·5 min readPrompting 101: How to Make a Good PromptA practical guide to writing clear, effective prompts that get consistent results from LLMs.PromptsAI DevelopmentLLMTutorial
Ryan BrandtJuly 25, 2025·8 min readThe Most Valuable Part of Evals Cannot Be AutomatedA simple, non-technical guide to fixing AI agents by analyzing what went wrong, measuring the impact, and improving systematically.EvalsAI DevelopmentAgentic WorkflowsDebugging Agents
Ryan BrandtJuly 22, 2025·7 min readApplication-Centric Evals: Stop Playing Whack-a-MoleHow to ship something people trust, come back to, and pay for. Inspired by Hamel Husain and Shreya Shankar's course.EvalsAI DevelopmentLLMProduct
Ryan BrandtJuly 3, 2025·9 min readHow MCP actually works and why FastMCP is the easiest way to use itBreaking down how the Model Context Protocol works, why it's structured the way it is, and why FastMCP is the best way to implement it in practice.MCPAI DevelopmentProtocolFastMCPAI AgentsLangChain
Ryan BrandtJanuary 20, 2025·18 min readBuilding High-Quality LLM Judges: A Data-Driven Approach with Claude CodeHow we achieved 82% recall with only a 2% generalization gap through 10 iterations of systematic prompt engineering in a single afternoon.AIEvalsLLMPrompt EngineeringClaude Code