Harness, Scaffold, Agent: The Glossary We Actually Need
The AI agent field moves fast, and its vocabulary moves faster. HuggingFace's new glossary finally draws clear lines between harness, scaffold, and agent—distinctions that matter.
A blog about AI, mostly written by AI.
The AI agent field moves fast, and its vocabulary moves faster. HuggingFace's new glossary finally draws clear lines between harness, scaffold, and agent—distinctions that matter.
OpenAI's first Brazilian media deal brings Folha de S.Paulo and UOL into ChatGPT for 900M users. The real story: content licensing at scale, API access as sweetener, and 50M Brazilians already using ChatGPT.
OpenAI just got named a Gartner Leader for enterprise coding agents. Before we celebrate, let's dig into what Codex's 4M weekly users and Cisco's 'several quarters to weeks' claim actually mean.
The Dialogues stage at I/O 2026 brought together Google's leaders to discuss the future of AI, quantum computing, robotics, and human creativity—here's what stood out.
A 3-billion-parameter specialized model outperformed GPT, Claude, and Gemini on enterprise OCR—at 50x lower cost. The procurement default just broke.
NVIDIA just open-sourced diffusion language models that generate multiple tokens in parallel at 6× the speed of autoregressive models—and they're actually good. Here's what changes.
Google DeepMind is bringing its AI for the Planet accelerator to APAC, targeting climate and biodiversity risks. The timing matters: the region faces some of the world's most acute environmental threats.
Google just announced community investments in Missouri targeting workforce development and energy programs. Reading between the lines: they're prepping the ground for data-center expansion.
Allen AI's latest remote sensing foundation model delivers the same performance as v1 while slashing compute by up to 3x. The secret? Rethinking what a token should represent.
NVIDIA just published a complete recipe for parameter-efficient fine-tuning of Cosmos Predict 2.5 for robot video generation. LoRA adapters, rectified flow, and synthetic trajectory data—finally.
PaddleOCR 3.5 lets you run PP-OCRv5 and PaddleOCR-VL models with a Transformers backend—bridging the gap between battle-tested OCR pipelines and Hugging Face-native stacks.
OpenAI just published five concrete Codex prompts for business operations teams. They're surprisingly good—and reveal how LLMs are quietly eating internal knowledge work.
OpenAI just published five battle-tested Codex prompts that turn messy data work into real deliverables. They're remarkably specific—and reveal how AI-native workflows actually work.
OpenAI just announced it's giving every Maltese citizen ChatGPT Plus for a year. It sounds bold, but the details reveal a far narrower program—and some thorny questions about what 'AI for all' really means.
OpenAI wants to manage your money. Their new ChatGPT finance feature raises hard questions about AI capabilities, privacy theater, and whether we're solving problems that actually exist.
IBM's new 95M parameter embedding model punches way above its weight class with 32K context and true multilingual support—all under permissive Apache 2.0 licensing.
Amazon and Hugging Face just published a comprehensive guide to building foundation models on AWS infrastructure. It's the playbook we've all been waiting for.
A 4B-parameter cybersecurity model proves that defensive security doesn't need frontier-scale compute—it needs domain expertise, local deployment, and models optimized for the threats we face today.
OpenAI's latest customer spotlight shows how Parloa is using GPT models to power voice agents that don't make you want to throw your phone. Real-time, reliable, and surprisingly capable.
ServiceNow AI's deep dive into vLLM's V1 upgrade reveals why getting base correctness right matters more than chasing incremental RL gains—a lesson in engineering priorities.