Frontier AI Watch

MODELS & TOOLS

Models & Tools

Coverage of model releases, developer tooling, workflows, and product adoption.

News on models, agents, developer tools, workflows, and adoption.

Published

Top stories from trusted sources.

Open source AI illustration
InfoQ AI, ML and Data EngineeringPublished

Arm Open-Sources Metis, an AI Security Framework Outperforming Traditional SAST Tools

Arm has open-sourced Metis, an agentic AI security framework designed to autonomously uncover complex software vulnerabilities. Unlike traditional pattern-based tools, Metis applies semantic reasoning to analyze cross-component dependencies and provides clear,…

Open original
AI agents illustration
WIRED AIPublished

Hands-On With Gemini Spark: I Gave It Access to My Life and It Friend-Zoned My Boyfriend

Google’s new AI agent combed through my emails, documents, and calendar to plan a birthday party and still didn’t clock the person most important to me.

Open original
AI research illustration
InfoQ AI, ML and Data EngineeringPublished

Presentation: Building Evals for AI Adoption: From Principles to Practice

Mallika Rao discusses the hidden risk of evaluation debt in production AI systems, drawing on her experience at Twitter, Walmart, and Netflix. She explains why traditional metrics fail modern architectures, breaks down a five-layer evaluation stack spanning…

Open original
AI infrastructure illustration
InfoQ AI, ML and Data EngineeringPublished

AI-Assisted Migration Tool Helps Teams Move from ingress-nginx to Higress in Minutes

The Cloud Native Computing Foundation has highlighted a new AI-assisted migration approach that enabled engineers to migrate 60 ingress-nginx resources to Higress in roughly 30 minutes, demonstrating how artificial intelligence is increasingly being applied to…

Open original
AI agents illustration
OpenAI NewsPublished

How Braintrust turns customer requests into code with Codex

How Braintrust engineers use Codex with GPT-5.5 to run experiments and code faster.

Open original
AI agents illustration
InfoQ AI, ML and Data EngineeringPublished

GitHub Slashes Agent Workflow Token Spend up to 62% with Daily Audits and MCP Pruning

GitHub reports cutting token costs in agentic CI workflows by up to 62% by pruning unused MCP tools, swapping some MCP calls for gh CLI, and running daily “auditor” and “optimizer” agents. A token-usage.jsonl artefact and an Effective Tokens metric help track…

Open original
AI models illustration
OpenAI NewsPublished

Strengthening societal resilience with Rosalind Biodefense

OpenAI launches Rosalind Biodefense, expanding trusted access to GPT-Rosalind for vetted developers and U.S. government partners advancing biodefense, public health, and pandemic preparedness through frontier AI.

Open original