Claude Code's underused /goal command for autonomous agent loops
Claude Code's /goal command runs an autonomous loop: take an action, check the goal condition, repeat until met, replacing step-by-step prompting.
LLM Evals: Everything You Need to Know
A sharp-opinion FAQ on LLM evals: skip generic metrics and tooling, do error analysis on your own traces, build a failure taxonomy, then write targeted evalu...
Reflexion: Language Agents with Verbal Reinforcement Learning
Reflexion lets an LLM agent learn from failures without fine-tuning, writing a verbal post-mortem and feeding it back as context, lifting GPT-4 to 91% on Hum...
Supacode: terminal-first agent harness manager for macOS
Supacode is a terminal-first agent harness session manager on libGhostty for macOS, built to manage many AI coding agents and worktrees at once.
Claude Code Live Artifacts: auto session summary to share
A short clip claims Claude Code now auto-generates a Live Artifact summarizing code and PRs at the end of a session to share with teammates.
Pagecast: publish HTML or Markdown from your coding agent
CLI tool that lets a coding agent like Claude Code or Codex publish HTML or Markdown to your own free Cloudflare Pages account with one confirmation.
Eloquent: Google's free on-device AI voice dictation app
Eloquent is Google's free voice dictation app that turns rambling speech into clean text, running on-device so it stays private and works offline.
Loop Engineering: Run Coding Agents in Iterative Loops
A short teaser for loop engineering, the idea that AI coding agents work best run in an iterative loop rather than expecting one-shot results.
Mega-ASR: open-source ASR built for noisy real-world audio
Open-source (Apache-2.0) foundation speech-recognition model built for messy audio, claiming up to ~30% WER gains over SOTA where other models collapse.
Local Qwen isn't a worse Opus, it's a different tool
Local Qwen 27B isn't near-Opus for coding, but it pays off on privacy-bound analysis where cloud models are contractually off-limits.
ralphex: The Extended Ralph Loop for Claude Code and codex
Open-source MIT orchestrator that drives coding agents through a structured plan, running each task in a fresh session plus multi-agent code review.
RFC 10008: The HTTP QUERY Method
RFC 10008 defines the HTTP QUERY method: a body-based request that is safe and idempotent like GET, so caches and retries work for large queries.
Anthropic launches Claude Fable 5, a safeguarded Mythos-class model
Anthropic launched Claude Fable 5, a Mythos-class model with request-screening safeguards, built for long-running autonomous work beyond coding.
Agentic Loops: Self-Prompting AI Workflows and How They Work
Loop engineering is an agentic workflow where you set one goal and let agents self-prompt through discovery, planning, parallel execution, and verification.
Claude Code Dynamic Workflows run hundreds of parallel agents
Claude Code Dynamic Workflows runs hundreds of parallel sub-agents in one session, compressing multi-day coding into a day at much higher token cost.
Architecting on Cloudflare
An independent book on Cloudflare's Developer Platform covering architectural decision frameworks, trade-off analysis, and migration playbooks from AWS, Azur...
Claude Code Workflows: script subagents in loops and sequences
Claude Code Workflows lets a workflow.js script invoke subagents in loops, sequences, or parallel, with structured output schemas to chain agents reliably.
Undocumented Claude Code Config Fields Found in the Source Code
Dozens of undocumented Claude Code config fields hidden in the npm source: command-rewriting hooks, persistent agent memory, and a plain-English auto-approva...
Introducing Gemma 4 12B: a unified, encoder-free multimodal model
Google's Gemma 4 12B is an open Apache 2.0 model handling text, vision, and audio with no separate encoders, fitting in 16 GB while nearing 26B benchmarks.
Lathe: LLM-generated hands-on technical tutorials you build yourself
Lathe is a Go CLI and LLM skill system that generates multi-part hands-on tutorials on demand, then serves them locally so you do the work by hand.