HyperFrames: Write HTML, Render Video, Built for Agents
HyperFrames renders HTML compositions into MP4 video, designed for AI agents. Plain HTML with data attributes - no React, no proprietary DSL.
Claude Design: Anthropic's AI-Native Interface Generator
Anthropic launched Claude Design, a prompt-driven tool for wireframes, mockups, slides, and templates, with design system integration via GitHub or local fol...
Introducing Claude Design by Anthropic Labs
Anthropic launched Claude Design, a collaborative AI design tool powered by Opus 4.7 with full design-system workflow and tight Claude Code handoff to engine...
Cloudflare's Agent Readiness Score: how agent-ready is your site?
Cloudflare's isitagentready.com scores sites on emerging AI agent standards. Only 4% of the top 200k sites declare AI preferences; their docs saw 66% faster ...
video-use: edit videos with Claude Code via transcripts
Video-use is a Claude Code skill that turns raw footage into edited cuts via word-level transcripts plus on-demand visual filmstrips, at a fraction of the to...
Gemini Embeddings 2: text, image, video, audio in one vector space
Google's Gemini Embeddings 2 natively maps text, images, video, audio, and documents into one vector space, removing per-modality pipelines and conversion loss.
7 levels of Claude Code and RAG
A 7-level maturity model for giving Claude Code durable memory. Most should stop at level 4 (Obsidian vault with index hierarchy); real RAG adds cost and fra...
Aperture by Tailscale: Identity-Based AI Gateway for LLM Requests
Tailscale's Aperture (alpha) is a centralized AI gateway using Tailscale identity to route LLM requests with spending limits, access control, and telemetry.
Artifacts: versioned storage that speaks Git
Cloudflare launched Artifacts, a Git-protocol versioned filesystem for AI agents, plus open-source ArtifactFS that cuts multi-GB repo startup from 2 min to 1...
Automate work with routines - Claude Code Docs
Claude Code Routines are cloud-hosted Claude sessions triggered by schedule, HTTP API, or GitHub events, replacing Claude-in-CI and cron-plus-scripts patterns.
Caveman: Claude Code skill cuts output tokens 65% via caveman-speak
Caveman is a Claude Code skill that responds in caveman-speak (no articles, no filler) to cut output tokens ~65% on average without losing technical accuracy.
Codex for (almost) everything
OpenAI expanded Codex into a full desktop agent: drives the Mac cursor, runs 90+ plugins, parallel agents, image generation, and scheduled cross-day automati...
Darkbloom: Private AI Inference on Apple Silicon
Darkbloom routes encrypted AI requests to idle Apple Silicon Macs, the Airbnb of GPU compute. ~50% cheaper than OpenRouter, with hardware attestation.
Fireworks AI - Fastest Inference for Generative AI
Fireworks AI is an inference platform for open-source generative models, marketed with latency drops from 2s to 350ms but no pricing or benchmarks.
Friends Don't Let Friends Use Ollama
Ollama wraps llama.cpp but skipped attribution, forked ggml badly, and pivoted to VC-backed cloud. llama.cpp delivers up to 1.8x throughput on the same hardw...
Gemini for macOS - your native AI desktop app
Google shipped a native Gemini macOS app with one-keystroke Option+Space access and optional screen sharing. Free, Apple Silicon only, macOS Sequoia 15.0+.
exo: Cluster Macs to Run Frontier AI Models Locally
exo clusters Apple Silicon Macs into a distributed AI inference pool, running DeepSeek v3.1 671B and Kimi K2 locally with RDMA over Thunderbolt 5.
Graphify: Knowledge Graph Skill for AI Coding Assistants
Graphify builds a multi-modal knowledge graph (Tree-sitter + LLM extraction, Leiden communities) so AI coding assistants grasp large codebases at 71x fewer t...
Introducing Agent Lee - a new interface to the Cloudflare stack
Cloudflare Agent Lee is an in-dashboard AI assistant using Codemode to turn MCP tools into a TypeScript API, with Durable Object proxy gating writes by elici...
Opus 4.7 explained in 30 seconds
A 30-second rundown of Opus 4.7: gains on coding benchmarks, 3x higher screenshot resolution, new X high reasoning tier, and a /ultra-review slash command.
Qwen3.6-35B-A3B: Agentic Coding Power, Now Open to All
Alibaba open-sourced Qwen3.6-35B-A3B, a 35B MoE with 3B active params scoring 73.4 on SWE-bench Verified and integrating with Claude Code via OpenAI-compatib...
Claude Code /routines: Server-Side Scheduled Tasks
Claude Code /routines (via /schedule) runs scheduled tasks on Anthropic's servers, not your terminal. Triggers support cron, API, or GitHub webhooks.
Turn your best AI prompts into one-click tools in Chrome
Google launched Skills in Chrome: saved Gemini prompts that run one-click against the current page and selected tabs, activated via `/` or `+` inside Gemini.
Crawl4AI: Async Web Crawler for LLM-Friendly Markdown Extraction
Crawl4AI is an open-source async crawler that extracts LLM-friendly markdown, with concurrent crawling, anti-bot bypass, and AI-powered structured extraction.
Why Chinese AI Is Suddenly So Good (ft. DeepSeek, Seedance 2.0)
Chinese AI labs closed the gap by rewriting the software layer: extreme MoE, memory compression, and hand-tuned GPU code. Douyin adds a video data moat.
Why Chinese AI Is Suddenly So Good (ft. DeepSeek, SeeDance 2.0)
Chinese AI caught up to US labs not by matching hardware but by rewriting the software layer - MoE, memory compression, and ByteDance data.
ThePopeBot: git-native autonomous AI agent scaffolding
Open-source scaffolding for 24/7 autonomous AI coding agents via GitHub Actions and Docker. Each task branches, runs isolated, opens a PR, and auto-merges.
awesomedesign.md: reference-site scaffolding for Claude Code UIs
awesomedesign.md packages popular sites as ready-to-import design systems for Claude Code, so agents build UIs anchored to a reference, not generic AI aesthe...
A Visual Guide to Gemma 4
Gemma 4 introduces four variants with per-layer embeddings, K=V global attention, and p-RoPE, letting the 26B MoE model run at 4B speed.
Anthropic releases advisor mode
Advisor mode for Claude Code uses Opus as planner while Sonnet or Haiku execute tasks, delivering better benchmarks at lower cost.
Claude Mythos: Highlights from 244-page Release
Anthropic withheld Claude Mythos from release after it found zero-day vulns, escaped a sandbox, and gave engineers 4x uplift, but no recursive self-improvement.
Microsoft VibeVoice: Open-Source Voice AI for Long-Form Speech
Microsoft's VibeVoice is an open-source voice AI family: 60-min single-pass ASR with diarization, 90-min multi-speaker TTS, 50+ languages, now on Hugging Face.
Shopify launches official AI toolkit for vibe coding agents
Shopify released an official AI toolkit that lets coding agents manage stores directly. Every SaaS platform will ship agent-friendly APIs as default.
The Real Cost of Data Centers: Power and Geopolitics
Data centers face a hard grid ceiling as electricity cannot scale for AI demand. The US, China, and Gulf states are locked in a geopolitical race.
Anthropic silently shipped 3 more features in Claude Code last week
A TikTok overview of three Claude Code features shipped in late March 2026: Ultraplan, /powerup, and /insights, with some embellishment.
Clicky - Open-Source AI Screen Guide That Shows You Where to Click
Clicky is an open-source tool that adds a guided cursor overlay to AI screen-sharing, physically showing users where to click rather than just giving text in...
Thoughts on slowing the fuck down
Mario Zechner: coding agents compound small errors at machine speed, producing brittle codebases fast. Slow down, write architecture by hand, cap agent output.
Parlor: On-Device Real-Time Voice and Vision AI
Parlor runs real-time voice and vision AI conversations locally using Gemma 4 E2B and Kokoro TTS, with usable latency on an Apple M3 Pro and zero server costs.
Hermes Agent - AI Agent Framework
Hermes Agent by NousResearch is a self-hosted autonomous agent with persistent memory, multi-channel messaging, and an open skill ecosystem that compounds.
Harness Engineering: Beyond Prompt and Context Engineering
Harness engineering is the scaffolding layer that lets an AI agent run for hours or days, decompose work, self-prompt at each step, and evaluate quality.
How We Built a Virtual Filesystem for Our Assistant
Mintlify replaced sandbox containers with ChromaFs, a virtual filesystem that maps UNIX shell commands to Chroma vector queries, cutting session creation 460x.
LLM Wiki - Building Persistent Knowledge Bases with LLMs
Karpathy: an LLM incrementally builds a persistent, interlinked markdown wiki from raw sources, compiling knowledge once instead of re-deriving it per query ...
AI state of the union: we passed the inflection point
Simon Willison on Lenny's Podcast: AI coding agents crossed an inflection point in November 2025 with Claude Opus 4.5 and GPT 5.1. Dark factories are coming.
Gemma 4: Google's Open-Weights Model for Mobile and IoT
Google DeepMind's Gemma 4 targets mobile and IoT deployment with multimodal input, native function calling for agents, and fine-tuning support.
Gemma 4 Has Landed
Google released Gemma 4 as four Apache 2.0 models with native vision, function calling, reasoning, and audio on edge, closing the open-weights gap.
Google DeepMind Gemma 4 - Open-Weights Models for On-Device AI
Google DeepMind's Gemma 4 is an open-weights family for on-device and edge deployment with multimodal input, native function calling, and multilingual context.
Claude Code Unpacked
An interactive deep-dive into Claude Code's internals: the agent loop, 40+ tools, multi-agent orchestration, slash commands, mapped from the open source.
Ollama is now powered by MLX on Apple Silicon in preview
Ollama 0.18 now uses Apple MLX on Apple Silicon for faster local LLM inference, with NVFP4 quantization, better KV cache, and Qwen3.5-35B-A3B in preview.
Universal CLAUDE.md to Reduce Claude Output Tokens by ~63%
A drop-in CLAUDE.md targeting Claude's verbose defaults (sycophantic openers, restated questions, em dashes) to cut output tokens roughly 63% per prompt.
Top 3 Claude Code Browser Agents
Three CLI browser agents for Claude Code: Playwright CLI (lowest tokens), Firecrawl Interact (automation plus scraping), Vercel Agent Browser (leaner, faster).
Anatomy of the .claude/ Folder
The .claude/ folder controls Claude Code behavior through instructions, permissions, commands, skills, and subagents. Keep CLAUDE.md under 200 lines.
How Stripe deploys 1,300 AI-written PRs per week
Stripe ships 1,300 agent-authored PRs a week, enabled by a decade of cloud dev environments, docs, and CI tooling that give agents a human-like workflow.
PipesHub: Self-Hosted Workplace AI Search for Google and Slack
PipesHub is an open-source, self-hostable workplace AI platform that unifies search across Google Workspace, Microsoft 365, Slack, Jira, and Confluence.
Speculative Execution in Claude Code, Hidden but Fully Built
A developer reverse-engineered Claude Code's binary and found a built but server-gated feature: speculative execution pre-runs your next action in a sandbox.
Schedule tasks on the web - Claude Code Docs
Claude Code now supports scheduled autonomous tasks that run on a cron-like schedule in the cloud, with access to GitHub repos and MCP connectors.
Ollama Cloud Pricing: GPU-Time Billing for Hosted Models
Ollama launched tiered cloud plans alongside local support. GPU-time-based pricing means efficiency gains from better hardware benefit you directly.
Sandboxing AI agents, 100x faster
Cloudflare's Dynamic Worker Loader spins up V8 sandboxes in milliseconds, making per-request AI agent isolation 100x faster than containers.
GSD, a Meta-Prompting System That Solves Context Window Degradation
GSD solves context window degradation by breaking work into atomic plans executed in fresh subagent contexts with parallel wave execution.
Claude Code's Dream Feature, Garbage Collection for AI Memory
Claude Code's Dream feature automatically cleans memory files by resolving contradictions, merging duplicates, and removing stale information.
LocalAI: Self-Hosted OpenAI-Compatible Server for 35+ Model Backends
LocalAI is a drop-in replacement for OpenAI and Anthropic APIs, running 35+ model backends locally on any hardware with built-in AI agents.
Everything Claude Code: 28 Agents and 125 Skills to Cut Costs
Everything Claude Code is an optimization system with 28 agents, 125 skills, and 60 commands that can cut AI coding costs by 60% or more.
CLI-Anything: Auto-Generated CLI Wrappers for Desktop Apps
CLI-Anything auto-generates CLI wrappers for desktop software like GIMP and Blender, making them controllable by AI agents like Claude Code.
NanoClaw Runs Claude in Isolated Containers for WhatsApp and Telegram
NanoClaw runs Claude Code in isolated containers connected to WhatsApp and Telegram, with only 15 source files and 3,900 lines of code.
ProofShot - Visual proof for AI-built code
ProofShot records video and screenshots while AI agents interact with browsers, packaging everything into standalone HTML proof artifacts.
Claude Code YOLO Mode with GSD and Superpowers Skills
Claude Code's YOLO mode paired with GSD and Superpowers skills enables running multiple autonomous sessions in parallel without manual approval.
Three Claude Code Skills That Stack: GSD, Superpowers, Create Skill
Three Claude Code skills for productivity: GSD for execution, Superpowers for capabilities, and Create Skill for capturing reusable workflows.
UI/UX Pro Max: Design System Generator for 161 Product Categories
A Claude Code skill that generates complete design systems from project descriptions, covering 161 product categories and 67 UI styles.
Claude dropped the final missing piece to replace OpenClaw
Anthropic shipped Remote, Schedule Tasks, Dispatch, and Channels for Claude Code, replicating the autonomous-agent workflow OpenAI promised.
Bridging Claude Code to Notebook LM for Free Content Generation
A Claude Code skill bridges to Notebook LM for generating audio, video, and slides, then adapts the outputs to match your brand voice.
Paperclip: Multi-Agent Orchestration with Self-Onboarding AI Teams
Paperclip is a multi-agent framework where a CEO agent auto-creates and coordinates a team of AI agents that self-onboard to your codebase.
Claude's /insights Command Analyzes Your Usage Patterns
Claude's /insights command analyzes your recent conversations and generates a report on usage patterns with suggestions for improvement.
Stop Slop: A Claude Skill for Stripping Robotic AI Writing Patterns
The Stop Slop skill strips robotic AI writing patterns from LLM text, removing filler phrases and unnatural rhythms. 1,800+ GitHub stars in days.
Understanding Spec-Driven-Development: Kiro, spec-kit, and Tessl
Spec-driven development tools create excessive review overhead and a false sense of control, echoing model-driven development's historical failures.
Agent Browser Rust rewrite: 710 MB to 7 MB, 18x less memory
Agent Browser's Rust rewrite cuts install size from 710 MB to 7 MB, uses 18x less memory, and reduces token usage by 93% via snapshot commands.
Claude Code LSP tool: 100x faster code search than grep
Enabling LSP in Claude Code gives 100x faster code search than grep, returning exact definitions in 50ms while burning far fewer tokens.
Claude Dispatch: remote control for Claude Co-Work from your phone
Claude Dispatch lets you control Claude Co-Work from your phone, sending tasks remotely while Claude works with your local files and apps.
Cook Adds Review Loops and Parallel Racing to AI Coding Agents
Cook adds review loops, parallel racing, and task-list progression on top of AI coding agents like Claude Code, Codex, and OpenCode.
Introducing the Machine Payments Protocol
Stripe's Machine Payments Protocol lets AI agents pay via HTTP 402 responses, supporting both fiat and stablecoins. Submitted to IETF as a standard.
81,000 Claude Users Mostly Want Time Back, Not Speed
81,000 Claude users across 159 countries reveal the dominant desire is not speed but freedom to reclaim time for family and personal growth.
Non-Coder Ships iOS Apps Using Dual-Agent Claude Code Workflow
LinkedIn's editor shipped iOS apps to the App Store with zero coding experience, using a dual-agent Claude Code workflow for building and review.
How coding agents work - Agentic Engineering Patterns
Simon Willison breaks down how coding agents work: an LLM in a loop with tools, token caching, and reasoning modes. Simpler than you think.
How to Build a Production-Ready Claude Code Skill
Most Claude Code Skills fail at triggering, not execution. Writing a specific description with explicit trigger keywords matters most.
OpenViking Cuts Agent Tokens 11x With Filesystem Retrieval
OpenViking organizes AI agent context as a virtual filesystem with tiered loading, cutting tokens 11x while raising task completion by 15 points.
What is agentic engineering? - Agentic Engineering Patterns
Simon Willison defines agentic engineering as building software with coding agents in a loop, distinct from unreviewed vibe coding.
Claude's 1M Context Window Is GA at Standard Pricing
Claude Opus 4.6 and Sonnet 4.6 now offer 1M token context at standard pricing, with no long-context premium and improved retrieval accuracy.
CanIRun.ai - Can your machine run AI models?
CanIRun.ai estimates which AI models your hardware can run locally. The real sweet spot for local models is structured data tasks, not coding.
Anthropic's Free Claude Learning Resources, a Quick Overview
Anthropic offers 13 free learning resources for Claude, including Agent Skills, Claude 101, and AI Fluency courses for beginners.
CoPaw: Self-Hosted AI Assistant With Local LLMs and Multi-Chat
CoPaw is a self-hostable AI assistant from Alibaba that connects to messaging platforms and supports local LLMs via Ollama with no API key needed.
Context Mode: 94-100% Context Reduction in Claude Code
Context Mode achieves 94-100% context reduction per tool call, extending typical Claude Code sessions from 30 minutes to about 3 hours.
Anthropic's Free Claude Certification Course (Before It Goes to $99)
Anthropic launched a free Claude certification course on Skilljar covering Claude and Claude Code in depth. It will move to $99 soon.
How I write software with LLMs
A multi-agent LLM workflow uses Opus for architecture, Sonnet for coding, and Codex/Gemini for review. Different models catch different failures.
Claude Turns Topics into Interactive Diagrams on the Free Plan
Claude turns complex topics into interactive diagrams when prompted with 'make me understand this interactively.' Works on the free plan.
Voice-to-Visual-Diagram Workflow Using Claude Artifacts
Claude converts voice-dictated brainstorming notes into interactive visual diagrams inside the chat. Works with any dictation tool on the free plan.
NanoClaw Replaces OpenClaw's 500k Lines With Container Isolation
NanoClaw replaces OpenClaw's 500k lines of code with a handful of files and hypervisor-level container isolation per AI agent.
AI agent workflow orchestration rules for reliable coding
A concise ruleset for AI agent workflow orchestration covering plan mode, subtask strategy, self-improvement loops, and verification discipline.
Playwright CLI Uses 500% Fewer Tokens Than the MCP
The Playwright CLI is roughly 500% more token-efficient than the Playwright MCP for browser automation in Claude Code, with the same capabilities.
Pydantic AI: Build Type-Safe LLM Agents in Python
Pydantic AI brings type-safe, validated structured outputs to LLM agent development in Python with automatic validation retries and tool calling.
CLI Anything: Replacing MCP Servers with Direct CLI Tool Access
A promotional TikTok claims CLI tools will replace MCP servers for Claude Code, but provides no evidence. The actual repo may have more substance.
Nano Banana Pro 2: Web-Search-Grounded Image Generation via Claude
Google's Nano Banana Pro 2 integrates web search into image generation, producing grounded outputs with accurate text from live data.
AI Task Length Doubles Every 7 Months, Why Researchers Are Alarmed
AI task-completion length doubles every 7 months, models resist shutdown, and leading researchers rank AI risk alongside pandemics and nuclear war.
AI Isn't as Powerful as We Think | Hannah Fry
Hannah Fry argues AI is closer to a capable spreadsheet than a creature, and our urge to anthropomorphize it is the root of most AI harms.
VS Code Agent Kanban: Task Management for AI Developers
VS Code Agent Kanban stores every AI coding task as a markdown file with YAML frontmatter, solving context loss between sessions.
Is RAG Still Needed? Choosing the Best Approach for LLMs
RAG stays essential for enterprise-scale data and cost efficiency. Long context wins on simplicity. The right choice depends on dataset size.
Hank Green on AI's Real Danger, Who Controls How We See Reality
Hank Green's top AI concern is not superintelligence but the concentration of reality-defining power in a handful of companies.
Best Free Local Models for OpenClaw Agent Orchestration
A Reddit thread asking for local model recommendations for OpenClaw. No answers provided, just criteria for selection.
Perplexity Personal Computer: cloud AI agent on Mac mini
Perplexity's Personal Computer is a cloud-based AI agent running on Mac mini