The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Graph-of-Agents: A Graph-based Framework for Multi-Agent LLM Collaboration

Graph-of-Agents proposes graph-based multi-agent LLM orchestration with node sampling and decoupled communication to improve task performance over Mixture-of-Agents.

Sukwon Yun·19 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SeekerGym: A Benchmark for Reliable Information Seeking

SeekerGym benchmark evaluates completeness and trustworthiness of information retrieved by deep research AI agents, addressing information gaps and bias.

Remy Kim·19 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads

HIVEMIND applies OS scheduling primitives (admission control, AIMD backpressure, circuit breaking) to coordinate concurrent LLM coding agents sharing rate-limited API endpoints.

Justice Owusu Agyemang·19 days ago

NVIDIA Dev Blog· INFRA

Full-Stack Optimizations for Agentic Inference with NVIDIA Dynamo

Coding agents are starting to write production code at scale. Stripe’s agents generate 1,300+ PRs per week. Ramp attributes 30% of merged PRs to agents.... Coding agents are starting to write production code at scale. Stripe’s agents generate 1,300+ PRs per week. Ramp attributes 30% of merged PRs to agents. Spotify reports 650+ agent-generated PRs per month. Tools like Claude Code and Codex make hundreds of API calls per coding session, each carrying the full conversation history. Behind every one of these workflows is an inference stack under… Source

Ishan Dhanani·20 days ago

NVIDIA Dev Blog· INFRA

Build a More Secure, Always-On Local AI Agent with OpenClaw and NVIDIA NemoClaw

Agents are evolving from question-and-answer systems into long-running autonomous assistants that read files, call APIs, and drive multi-step workflows.... Source

Patrick Moorhead·20 days ago

NVIDIA Dev Blog· INFRA

How to Build Vision AI Pipelines Using NVIDIA DeepStream Coding Agents

Developing real-time vision AI applications presents a significant challenge for developers, often demanding intricate data pipelines, countless lines of code,... Developing real-time vision AI applications presents a significant challenge for developers, often demanding intricate data pipelines, countless lines of code, and lengthy development cycles. NVIDIA DeepStream 9 removes these development barriers using coding agents, such as Claude Code or Cursor, to help you easily create deployable, optimized code that brings your vision AI applications to… Source

Debraj Sinha·21 days ago

Anthropic· FRONTIER

Introducing Claude Opus 4.7

Anthropic releases Claude Opus 4.7 with improved coding, agents, vision, and multi-step reasoning capabilities.

Anthropic·21 days ago

Hugging Face· INFRA

Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents

Hugging Face·22 days ago

Hugging Face· INFRA

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

Hugging Face·22 days ago

OpenAI· FRONTIER

The next evolution of the Agents SDK

OpenAI updates Agents SDK with native sandbox execution and model-native harness for secure, long-running agent development.

OpenAI·22 days ago

Latent Space· ANALYST

Notion’s Token Town: 5 Rebuilds, 100+ Tools, MCP vs CLIs and the Software Factory Future — Simon Last & Sarah Sachs of Notion

Notion ships knowledge work AI agents via 5 rebuilds, 100+ tools, and MCP integration after internal infrastructure overhaul.

Latent Space·23 days ago

Import AI· ANALYST

Import AI 453: Breaking AI agents; MirrorCode; and ten views on gradual disempowerment

Import AI 453 examines agent vulnerabilities, MirrorCode tool, and disempowerment strategies for AI systems.

Jack Clark·24 days ago

Latent Space· ANALYST

Moonlake: Causal World Models should be Multimodal, Interactive, and Efficient — with Chris Manning and Fan-yun Sun

We cap out our World Models coverage with one of the most exciting new approaches - long running, multiplayer, interactive world models built with agents bootstrapped from game engines!

Latent Space·1 month ago

OpenAI· FRONTIER

Gradient Labs gives every bank customer an AI account manager

Gradient Labs deploys GPT-4.1 and GPT-5.4 mini/nano agents for automated banking support with low-latency agentic workflows.

OpenAI·1 month ago

Mistral AI· FRONTIER

Spaces: A CLI Built for Humans and Agents

Mistral releases Spaces, a CLI tool designed for both human developers and autonomous agents.

Mistral AI·1 month ago

Mistral AI· FRONTIER

Two users, one CLI: people and agents

Mistral AI shares design philosophy for CLI tools supporting both human users and AI agents, emphasizing unified tooling that improves developer experience.

Mistral AI·1 month ago

NVIDIA Dev Blog· INFRA

Building NVIDIA Nemotron 3 Agents for Reasoning, Multimodal RAG, Voice, and Safety

Agentic AI is an ecosystem where specialized models work together to handle planning, reasoning, retrieval, and safety guardrailing. As these systems scale,... Agentic AI is an ecosystem where specialized models work together to handle planning, reasoning, retrieval, and safety guardrailing. As these systems scale, developers need models that can understand real-world multimodal data, converse naturally with users globally, and operate safely across languages and modalities. At GTC 2026, NVIDIA introduced a new generation of NVIDIA Nemotron models… Source

Chintan Patel·1 month ago

Hugging Face· INFRA

A New Framework for Evaluating Voice Agents (EVA)

Hugging Face·1 month ago

Mistral AI· FRONTIER

Speaking of Voxtral

Mistral open-sources Voxtral, a fast, adaptable TTS model for voice agents with real-time synthesis.

Mistral AI·2 months ago

OpenAI· FRONTIER

How we monitor internal coding agents for misalignment

OpenAI uses chain-of-thought monitoring to detect misalignment risks in internal coding agents via real-world deployment analysis.

OpenAI·2 months ago

NVIDIA Dev Blog· INFRA

How to Build Deep Agents for Enterprise Search with NVIDIA AI-Q and LangChain

While consumer AI offers powerful capabilities, workplace tools often suffer from disjointed data and limited context. Built with LangChain, the NVIDIA AI-Q... While consumer AI offers powerful capabilities, workplace tools often suffer from disjointed data and limited context. Built with LangChain, the NVIDIA AI-Q blueprint is an open source template that bridges this gap. LangChain recently introduced an enterprise agent platform built with NVIDIA AI to support scalable, production-ready agent development. This tutorial, available as an NVIDIA… Source

Sean Lopp·2 months ago

NVIDIA Dev Blog· INFRA

Building the AI Grid with NVIDIA: Orchestrating Intelligence Everywhere

AI-native services are exposing a new bottleneck in AI infrastructure: As millions of users, agents, and devices demand access to intelligence, the challenge is... AI-native services are exposing a new bottleneck in AI infrastructure: As millions of users, agents, and devices demand access to intelligence, the challenge is shifting from peak training throughput to delivering deterministic inference at scale—predictable latency, jitter, and sustainable token economics. NVIDIA announced at GTC 2026 that telcos and distributed cloud providers are… Source

Sree Sankar·2 months ago

NVIDIA Dev Blog· INFRA

Introducing NVIDIA BlueField-4-Powered CMX Context Memory Storage Platform for the Next Frontier of AI

AI‑native organizations increasingly face scaling challenges as agentic AI workflows drive context windows to millions of tokens and models scale toward... AI‑native organizations increasingly face scaling challenges as agentic AI workflows drive context windows to millions of tokens and models scale toward trillions of parameters. These systems rely on agentic long‑term memory for context that persists across turns, tools, and sessions so agents can build on prior reasoning instead of starting from scratch on every request. Source

Moshe Anschel·2 months ago

NVIDIA Dev Blog· INFRA

Scaling Autonomous AI Agents and Workloads with NVIDIA DGX Spark

Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage long-running tasks that use multiple communication channels and... Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage long-running tasks that use multiple communication channels and background subprocesses simultaneously to explore options, test solutions, and generate optimal results. This places extreme demands on local compute. NVIDIA DGX Spark provides the performance necessary for autonomous agents to execute… Source

Allen Bourgoyne·2 months ago

NVIDIA Dev Blog· INFRA

Run Autonomous, Self-Evolving Agents More Safely with NVIDIA OpenShell

AI has evolved from assistants following your directions to agents that act independently. Called claws, these agents can take a goal, figure out how to achieve... AI has evolved from assistants following your directions to agents that act independently. Called claws, these agents can take a goal, figure out how to achieve it, and execute indefinitely—while leaving you out of the loop. The more capable claws become, the harder they are to trust. And their self-evolving autonomy changes everything about the environment in which they operate. Source

Ali Golshan·2 months ago

OpenAI· FRONTIER

Designing AI agents to resist prompt injection

ChatGPT agents defend against prompt injection via action constraints and sensitive data protection in agent workflows.

OpenAI·2 months ago

NVIDIA Dev Blog· INFRA

How to Minimize Game Runtime Inference Costs with Coding Agents

NVIDIA ACE is a suite of technologies for building AI agents for gaming. ACE provides ready-to-integrate cloud and on-device AI models for every part of in-game... NVIDIA ACE is a suite of technologies for building AI agents for gaming. ACE provides ready-to-integrate cloud and on-device AI models for every part of in-game characters, from speech to intelligence to animation. To run these models alongside the game engine efficiently, the NVIDIA In-Game Inferencing (NVIGI) SDK includes a set of performant libraries that developers can integrate into C++… Source

Brandon Rowlett·2 months ago

NVIDIA Dev Blog· INFRA

Develop Native Multimodal Agents with Qwen3.5 VLM Using NVIDIA GPU-Accelerated Endpoints

Alibaba has introduced the new open source Qwen3.5 series built for native multimodal agents. The first model in this series is a ~400B parameter native... Alibaba has introduced the new open source Qwen3.5 series built for native multimodal agents. The first model in this series is a ~400B parameter native vision-language model (VLM) with reasoning built with a hybrid architecture of mixture of experts (MoE) and Gated Delta Networks. Qwen3.5 can understand and navigate user interfaces, which improves on the previous generation of VLMs. Qwen3.5… Source

Anu Srivastava·2 months ago

OpenAI· FRONTIER

OpenAI and Amazon announce strategic partnership

OpenAI and Amazon expand partnership: Frontier platform, custom models, and enterprise agents on AWS.

OpenAI·2 months ago

OpenAI· FRONTIER

Introducing the Stateful Runtime Environment for Agents in Amazon Bedrock

Amazon Bedrock adds stateful runtime for persistent orchestration and memory in multi-step AI agent workflows.

OpenAI·2 months ago

← Front Page30 matches

← Newer Older →