Category: tech > ai
143 insights in this category - page 5 of 8. View all insights
Gemma 4: Google's Open-Weights Model for Mobile and IoT
Google DeepMind's Gemma 4 targets mobile and IoT deployment with multimodal input, native function calling for agents, and fine-tuning support.
Gemma 4 Has Landed
Google released Gemma 4 as four Apache 2.0 models with native vision, function calling, reasoning, and audio on edge, closing the open-weights gap.
Google DeepMind Gemma 4 - Open-Weights Models for On-Device AI
Google DeepMind's Gemma 4 is an open-weights family for on-device and edge deployment with multimodal input, native function calling, and multilingual context.
Claude Code Unpacked
An interactive deep-dive into Claude Code's internals: the agent loop, 40+ tools, multi-agent orchestration, slash commands, mapped from the open source.
Ollama is now powered by MLX on Apple Silicon in preview
Ollama 0.18 now uses Apple MLX on Apple Silicon for faster local LLM inference, with NVFP4 quantization, better KV cache, and Qwen3.5-35B-A3B in preview.
Universal CLAUDE.md to Reduce Claude Output Tokens by ~63%
A drop-in CLAUDE.md targeting Claude's verbose defaults (sycophantic openers, restated questions, em dashes) to cut output tokens roughly 63% per prompt.
Top 3 Claude Code Browser Agents
Three CLI browser agents for Claude Code: Playwright CLI (lowest tokens), Firecrawl Interact (automation plus scraping), Vercel Agent Browser (leaner, faster).
Anatomy of the .claude/ Folder
The .claude/ folder controls Claude Code behavior through instructions, permissions, commands, skills, and subagents. Keep CLAUDE.md under 200 lines.
How Stripe deploys 1,300 AI-written PRs per week
Stripe ships 1,300 agent-authored PRs a week, enabled by a decade of cloud dev environments, docs, and CI tooling that give agents a human-like workflow.
PipesHub: Self-Hosted Workplace AI Search for Google and Slack
PipesHub is an open-source, self-hostable workplace AI platform that unifies search across Google Workspace, Microsoft 365, Slack, Jira, and Confluence.
Speculative Execution in Claude Code, Hidden but Fully Built
A developer reverse-engineered Claude Code's binary and found a built but server-gated feature: speculative execution pre-runs your next action in a sandbox.
Schedule tasks on the web - Claude Code Docs
Claude Code now supports scheduled autonomous tasks that run on a cron-like schedule in the cloud, with access to GitHub repos and MCP connectors.
Ollama Cloud Pricing: GPU-Time Billing for Hosted Models
Ollama launched tiered cloud plans alongside local support. GPU-time-based pricing means efficiency gains from better hardware benefit you directly.
Sandboxing AI agents, 100x faster
Cloudflare's Dynamic Worker Loader spins up V8 sandboxes in milliseconds, making per-request AI agent isolation 100x faster than containers.
GSD, a Meta-Prompting System That Solves Context Window Degradation
GSD solves context window degradation by breaking work into atomic plans executed in fresh subagent contexts with parallel wave execution.
Claude Code's Dream Feature, Garbage Collection for AI Memory
Claude Code's Dream feature automatically cleans memory files by resolving contradictions, merging duplicates, and removing stale information.
LocalAI: Self-Hosted OpenAI-Compatible Server for 35+ Model Backends
LocalAI is a drop-in replacement for OpenAI and Anthropic APIs, running 35+ model backends locally on any hardware with built-in AI agents.
Everything Claude Code: 28 Agents and 125 Skills to Cut Costs
Everything Claude Code is an optimization system with 28 agents, 125 skills, and 60 commands that can cut AI coding costs by 60% or more.
CLI-Anything: Auto-Generated CLI Wrappers for Desktop Apps
CLI-Anything auto-generates CLI wrappers for desktop software like GIMP and Blender, making them controllable by AI agents like Claude Code.
NanoClaw Runs Claude in Isolated Containers for WhatsApp and Telegram
NanoClaw runs Claude Code in isolated containers connected to WhatsApp and Telegram, with only 15 source files and 3,900 lines of code.