The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

These LLMs are the best at resisting Russian propaganda

Estonian government benchmark shows how dozens of models combat Russia's "strategic narratives."

Kyle Orland ·20 days ago

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

We talk with the VendingBench authors on evaling Claudes from Haiku to Mythos, and how they build leading, and lasting, frontier evals from scratch.

Latent Space·20 days ago

Ars Technica AI· PRESS

Elon Musk tries again to escape FTC audits of X data handling

Musk can't be trusted to protect X user privacy, public commenters warn FTC.

Ashley Belanger ·20 days ago

TechCrunch AI· PRESS

Meta steals a tactic from Tesla and builds data centers in tents

Meta may have one found one way to slash its massive data center bill: tents.

Tim De Chant·20 days ago

TechCrunch AI· PRESS

Apple approves Poke as the first AI agent on its Messages for Business platform

Poke, the startup that lets people use AI agents through simple text messages, has become the first AI agent approved for Apple’s Messages for Business platform.

Sarah Perez·20 days ago

Hugging Face· INFRA

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Hugging Face·20 days ago

The Verge AI· PRESS

Kevin O’Leary agrees to downsize massive Utah data center

Kevin O'Leary agreed to halve the size of his planned 40,000-acre data center in Utah amid mounting pressure from residents and activists, as reported earlier by local affiliate ABC4. The Shark Tank star sent a letter to Utah Senate President J. Stuart Adams on Thursday, saying that he will remove 19,430 acres from the project, located in and around the Locomotive Springs Waterfowl Management Area. The change comes just days after Adams called on O'Leary to slash the size of his Project Stratos data center by 75 percent, which would reduce it to about 10,000 acres. Adams also asked O'Leary to...

Emma Roth·20 days ago

The Archive

These LLMs are the best at resisting Russian propaganda

Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

Elon Musk tries again to escape FTC audits of X data handling

Meta steals a tactic from Tesla and builds data centers in tents

Apple approves Poke as the first AI agent on its Messages for Business platform

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Kevin O’Leary agrees to downsize massive Utah data center

TailLoR: Protecting Principal Components in Parameter-Efficient Continual Learning

HANDOFF: Humanoid Agentic Task-Space Whole-Body Control via Distilled Complementary Teachers

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

TempoVLA: Learning Speed-Controllable Vision-Language-Action Policies

Regret Minimization with Adaptive Opponents in Repeated Games

Operation-Guided Progressive Human-to-AI Text Transformation Benchmark for Multi-Granularity AI-Text Detection

DNQ: Deep Nash Q-Network for Partially Observable n-Player Games

Pretraining Recurrent Networks without Recurrence

RREDCoT: Segment-Level Reward Redistribution for Reasoning Models

Self-Augmenting Retrieval for Diffusion Language Models

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training

How abundant are good interpolators?

Goedel-Architect: Streamlining Formal Theorem Proving with Blueprint Generation and Refinement

You Only Index Once: Cross-Layer Sparse Attention with Shared Routing

Human Adults and LLMs as Scientists: Who Benefits from Active Exploration?

Benchmark Everything Everywhere All at Once

Will the Agent Recuse Itself? Measuring LLM-Agent Compliance with In-Band Access-Deny Signals

Event Detection for Parameter-to-KPI Dependency Learning for AI-RAN

In-Context Multiple Instance Learning

Scaffold, Not Vocabulary? A Controlled, Two-Tier, Pre-Registered Study of a Popperian Code-Generation Skill

Vortex: Efficient and Programmable Sparse Attention Serving for AI Agents

Agent Memory: Characterization and System Implications of Stateful Long-Horizon Workloads