Topic

Agents

Every story matching this topic across titles and summaries, newest first.

OpenClaw and Claude can put your AI-generated podcasts in Spotify

Save to Spotify is a new command-line tool designed specifically for AI agents like OpenClaw, Claude Code, or OpenAI Codex. If you're the kind of person who collects research on a topic, then feeds it through their AI of choice to create audio summaries and personal podcasts, this lets you save them right alongside the latest episode of The Vergecast and Welcome to Night Vale on Spotify. To set it up, you need to download and install the Save to Spotify CLI from GitHub. Then you just prompt your AI agent as normal, but tack on "and save to Spotify," and it should show up right in your podcast...

Terrence O’Brien·2 hours ago

OpenAI· FRONTIERNew

Parloa builds service agents customers want to talk to

Parloa uses OpenAI models to build voice-driven customer service agents with simulation and real-time deployment capabilities for enterprises.

OpenAI·5 hours ago

Agents

OpenClaw and Claude can put your AI-generated podcasts in Spotify

Parloa builds service agents customers want to talk to

LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

Executable World Models for ARC-AGI-3 in the Era of Coding Agents

Anthropic's Claude Managed Agents can now "dream," sort of

Self-Induced Outcome Potential: Turn-Level Credit Assignment for Agents without Verifiers

Anthropic’s new finance AI agents feel like a bigger move than just “better chat”

SAP bets $1.16B on 18-month-old German AI lab and says yes to NemoClaw

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

MOSAIC-Bench: Measuring Compositional Vulnerability Induction in Coding Agents

Agents for financial services

How to Build In-Vehicle AI Agents with NVIDIA: From Cloud to Car

Contextual Multi-Objective Optimization: Rethinking Objectives in Frontier AI Systems

Building for the Rising Complexity of Agentic Systems with Extreme Co-Design

ProgramBench: Can we really rebuild huge binaries from scratch? (doesn't look like it)

CopilotKit raises $27M to help devs deploy app-native AI agents

OpenAI and PwC collaborate to reimagine the office of the CFO

FlexSQL: Flexible Exploration and Execution Make Better Text-to-SQL Agents

AIs and Humans with Agency

ORPilot: A Production-Oriented Agentic LLM-for-OR Tool for Optimization Modeling

Hybrid Inspection and Task-Based Access Control in Zero-Trust Agentic AI

Remote Action Generation: Remote Control with Minimal Communication

DataEvolver: Let Your Data Build and Improve Itself via Goal-Driven Loop Agents

Runtime Evaluation of Procedural Content Generation in an Endless Runner Game Using Autonomous Agents

Talk is Cheap, Communication is Hard: Dynamic Grounding Failures and Repair in Multi-Agent Negotiation

GRAVITY: Architecture-Agnostic Structured Anchoring for Long-Horizon Conversational Memory

Can Coding Agents Reproduce Findings in Computational Materials Science?

Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory

[AINews] Agents for Everything Else: Codex for Knowledge Work, Claude for Creative Work

Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists

Stripe introduces Link, a digital wallet that autonomous AI agents can use, too

Agent-Agnostic Evaluation of SQL Accuracy in Production Text-to-SQL Systems

Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

Automating GPU Kernel Translation with AI Agents: cuTile Python to cuTile.jl

A Pattern Language for Resilient Visual Agents

Exploring Interaction Paradigms for LLM Agents in Scientific Visualization

D3-Gym: Constructing Real-World Verifiable Environments for Data-Driven Discovery

Language Models Refine Mechanical Linkage Designs Through Symbolic Reflection and Modular Optimisation

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

From Unstructured Recall to Schema-Grounded Memory: Reliable AI Memory via Iterative, Schema-Aware Extraction

Graph World Models: Concepts, Taxonomy, and Future Directions

Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs

Modeling Clinical Concern Trajectories in Language Model Agents

KellyBench: A Benchmark for Long-Horizon Sequential Decision Making

[Open Source] We built a local code search MCP for Claude Code that uses ~98% fewer tokens than grep+read

How to be better than 99% of Claude Code users while doing less, imo:

Absolutely blown away by the utility of the Claude Word add-in

ClawGym: A Scalable Framework for Building Effective Claw Agents

Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations

FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

Remote agents in Vibe. Powered by Mistral Medium 3.5.

Coby Adcock’s Scout AI raises $100 million to train its models for war. We visited its bootcamp

DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

An Interview with OpenAI CEO Sam Altman and AWS CEO Matt Garman About Bedrock Managed Agents

ADEMA: A Knowledge-State Orchestration Architecture for Long-Horizon Knowledge Synthesis with LLMAgents

From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

Toward Scalable Terminal Task Synthesis via Skill Graphs

Scalable Inference Architectures for Compound AI Systems: A Production Deployment Study

Think Before You Act -- A Neurocognitive Governance Model for Autonomous AI Agents

Modeling Human-Like Color Naming Behavior in Context

Red Hat’s OpenClaw maintainer just made enterprise Claw deployments a lot safer

SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents

From CRUD to Autonomous Agents: Formal Validation and Zero-Trust Security for Semantic Gateways in AI-Native Enterprise Systems

Automated Adversarial Collaboration for Advancing Theory Building in the Cognitive Sciences

OpenAI models, Codex, and Managed Agents come to AWS

The Chameleon's Limit: Investigating Persona Collapse and Homogenization in Large Language Models

Can Current Agents Close the Discovery-to-Application Gap? A Case Study in Minecraft

Governing What You Cannot Observe: Adaptive Runtime Governance for Autonomous AI Agents

AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents

Skill Retrieval Augmentation for Agentic AI

How do you test AI agents in production? The unpredictability is overwhelming.[D]

China vetoes Meta’s $2B Manus deal after months-long probe

OpenAI could be making a phone with AI agents replacing apps

Join the new AI Agents Vibe Coding Course from Google and Kaggle

Choco automates food distribution with AI agents

What is the best coding agent (CLI) like Claude Code for Local Development

Structural Enforcement of Goal Integrity in AI Agents via Separation-of-Powers Architecture