The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Recursive Agent Harnesses

Recursive language models (RLMs) showed that recursion over model calls is an effective strategy for long-context reasoning, and production coding agents have begun to write code that spawns subagents at scale, most recently in Anthropic's dynamic workflows. We name and study the pattern between these two lines of work, where the recursive unit is a full agent harness with filesystem tools, code execution, and planning rather than a model call with no tools. We call this the Recursive Agent Harness (RAH) and frame it as harness recursion, the code-first extension to the model recursion of RLM...

Elias Lumer·12 days ago

The Archive

Recursive Agent Harnesses

The Stable Recovery Manifold: Geometric Principles Governing Recoverability in Continual Learning

Operads for compositional reasoning in LLMs

Aerial Wildfire Suppression Planning with a Hybrid CNN-Cellular Automata Fire Model

From Tokens to Faces: Investigating Discrete Speech Representations for 3D Facial Animation

Valid Inference with Synthetic Data via Task Exchangeability

Generative Modeling of Bach-Style Symbolic Music: A Comparative Study of Autoregressive, Latent-Variable, and Adversarial Approaches

Beyond Uniform Tokens: Adaptive Compression for Time Series Language Models

Beyond Runtime Enforcement: Shield Synthesis as Defensibility Analysis for Adversarial Networks

Amazon&#8217;s data centers used 2.5 billion gallons of water last year

Majority-of-Three is Optimal

One Polluted Page Is Enough: Evaluating Web Content Pollution in Generative Recommenders

AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility

Reasoning as Pattern Matching: Shared Mechanisms in Human and LLM Everyday Reasoning

Distribution-Agnostic Robust Trajectory Optimization via Chance-Constrained Reinforcement Learning

Multi-Agent Reinforcement Learning from Delayed Marketplace Feedback for Objective-Weight Adaptation in Three-Sided Dispatch

Beyond the Commitment Boundary: Probing Epiphenomenal Chain-of-Thought in Large Reasoning Models

EpiBench: Verifiable Evaluation of AI Agents on Epigenomics Analysis

Reward Modeling for Multi-Agent Orchestration

Multiagent Protocols with Aggregated Confidence Signals

Simplex-Constrained Sparse Bagging: Transitioning from Uniform Priors to Sparse Posteriors in Ensemble Learning

The Tone of Awareness: Topic, Sentiment, and Toxicity Maps During Mental Health Month on TikTok

EvTexture++: Event-Driven Texture Enhancement for Video Super-Resolution

LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories

Learning with Simulators: No Regret in a Computationally Bounded World

ArogyaSutra: A Multi-Agent Framework for Multimodal Medical Reasoning in Indic Languages

Existence Precedes Value: Joint Modeling of Observational Existence and Evolving States in Time Series Forecasting

Adjusted Cup-Product Neural Layer

A Three-Layer Framework for AI in Scientific Discovery

A2D2: Fine-Tuning Any-Length Discrete Diffusion for Adaptive Decoding

Amazon’s data centers used 2.5 billion gallons of water last year