The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

r/singularity· COMMUNITY

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA)

Link: x.com

u/Scared_Bluebird_7243·2 days ago·113 pts / 43 comm

r/LocalLLaMA· COMMUNITY

Supercharging LLM inference on Google TPUs: Achieving 3X speedups with diffusion-style speculative decoding- Google Developers Blog

Google demonstrates 3X LLM inference speedup on TPUs using diffusion-style speculative decoding technique.

u/eternviking·2 days ago·41 pts / 11 comm

TechCrunch AI· PRESS

PayPal says it’s ‘becoming a technology company again.’ That means AI.

PayPal is pitching an AI-led turnaround, tying automation and restructuring to $1.5B in savings as it cuts jobs and works to modernize its tech stack.

Sarah Perez·2 days ago

r/ClaudeAI· COMMUNITY

"Stream ended without a final message" in Claude Design

User reports 'Stream ended without a final message' error in Claude Design, a feature for sketching animations.

u/mazthepa·2 days ago·20 pts / 39 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Raising the Ceiling: Better Empirical Fixation Densities for Saliency Benchmarking

Proposes improved empirical fixation density estimation methods beyond fixed-bandwidth Gaussian KDE for saliency benchmarking and per-image model evaluation.

Susmit Agrawal·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

QKVShare: Quantized KV-Cache Handoff for Multi-Agent On-Device LLMs

QKVShare framework for quantized KV-cache handoff between multi-agent LLMs on edge devices; token-level mixed-precision allocation reduces memory vs. full-precision transfer.

Pratik Honavar·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Deco: Extending Personal Physical Objects into Pervasive AI Companion through a Dual-Embodiment Framework

Dual-Embodiment Companion Framework extends AI capabilities to personal physical objects (plush toys); formative study derives design principles for emotional continuity.

Zhihan Jiang·2 days ago

r/LocalLLaMA· COMMUNITY

ProgramBench: Can we really rebuild huge binaries from scratch? (doesn't look like it)

ProgramBench: 200-task evaluation showing agents struggle to rebuild large binaries from scratch without cheating vulnerabilities.

u/klieret·2 days ago·41 pts / 18 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models

DMGD proposes training-free dataset distillation using diffusion models with semantic-distribution matching guidance.

Qichao Wang·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Spatiotemporal Convolutions on EEG signal -- A Representation Learning Perspective on Efficient and Explainable EEG Classification with Convolutional Neural Nets

Study compares 2D spatiotemporal convolutions vs. concatenated 1D convolutions for EEG signal classification with CNNs.

Laurits Dixen·2 days ago

TechCrunch AI· PRESS

Etsy launches its app within ChatGPT as it continues its AI push

Etsy's new native app within ChatGPT aims to be a conversational shopping experience for users.

Lauren Forristal·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics

EvoLM enables self-improvement in language models using co-evolved discriminative rubrics without external reward supervision.

Shuyue Stella Li·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On Adaptivity in Zeroth-Order Optimization

MEAZO: memory-efficient adaptive zeroth-order optimizer for LLM fine-tuning, outperforms ZO-Adam with scalar-only tracking.

Hassan Dbouk·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Memory-Efficient Continual Learning with CLIP Models

Distributionally robust continual learning method for CLIP models using dynamic per-class loss reweighting with small memory buffers.

Ryan King·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Quantifying the human visual exposome with vision language models

Vision language models quantify semantic richness of personal visual environments to predict mental health outcomes from 2674 participant photos.

Christian Rominger·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Correct Is Not Enough: Training Reasoning Planners with Executor-Grounded Rewards

TraceLift: planner-executor framework trains LLM reasoning traces on executor-grounded rewards, not just final-answer correctness.

Tianyang Han·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MCJudgeBench: A Benchmark for Constraint-Level Judge Evaluation in Multi-Constraint Instruction Following

MCJudgeBench: benchmark for constraint-level evaluation of LLM judges in multi-constraint instruction following with per-constraint gold labels.

Jaeyun Lee·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Mechanical Conscience: A Mathematical Framework for Dependability of Machine Intelligenc

Mathematical framework for dependability of distributed collaborative intelligence systems where locally correct decisions compose into unsafe global behaviors.

Munkhdegerekh Batzorig·2 days ago

r/OpenAI· COMMUNITY

Chatgpt shows his love of goblins

Anecdotal Reddit post about ChatGPT's conversational behavior; no technical substance or news value.

u/batrix03·2 days ago·50 pts / 10 comm

r/LocalLLaMA· COMMUNITY

<thinking></thinking>

Incomplete post with no content.

u/Comfortable-Rock-498·2 days ago·52 pts / 14 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

SOAR: Real-Time Joint Optimization of Order Allocation and Robot Scheduling in Robotic Mobile Fulfillment Systems

SOAR: real-time joint optimization of order allocation and robot scheduling for robotic mobile fulfillment warehouse systems.

Yibang Tang·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Complex Equation Learner: Rational Symbolic Regression with Gradient Descent in Complex Domain

Complex-valued gradient descent for symbolic regression enables discovery of equations with singularities and domain constraints like division and logarithms.

Sergei Garmaev·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On Computing Total Variation Distance Between Mixtures of Product Distributions

Randomized algorithm approximates total variation distance between mixtures of product distributions with polynomial-time complexity bounds.

Weiming Feng·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains

TRACE: engineering framework for trustworthy agentic AI in critical domains combining reference architecture, trust metrics, and bounded human supervision.

Serhii Zabolotnii·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Domain Incremental Continual Learning Benchmark for ICU Time Series Model Transportability

Domain incremental learning benchmark for ICU time-series model transfer across hospitals with domain shift and patient data heterogeneity.

Ryan King·2 days ago

r/Anthropic· COMMUNITY

I literally just started a new chat for a project. The project has 3 Markdown files, around 200 lines each, and after just 4 messages I’ve already hit 75% of my Pro plan usage. Can someone tell me what the hell is going on?

u/richbaro23·2 days ago·10 pts / 30 comm

r/LocalLLaMA· COMMUNITY

Heretic 1.3 released: Reproducible models, integrated benchmarking system, reduced peak VRAM usage, broader model support, and more

Heretic 1.3 adds reproducibility, integrated benchmarking, reduced VRAM, and broader model support for model decensoring.

u/-p-e-w-·2 days ago·54 pts / 10 comm

The Verge AI· PRESS

OpenAI is reportedly launching a phone for ChatGPT

OpenAI's first hardware product might be a phone instead of a mysterious Jony Ive gadget. As reported by MacRumors, supply chain analyst Ming-Chi Kuo shared details about the rumored phone, claiming OpenAI is "fast-tracking" it and aiming to start mass production in early 2027. According to Kuo, the phone will run on a "customized version of the [MediaTek] Dimensity 9600," which is expected to launch this fall and follow up the Dimensity 9500 currently powering phones like the Vivo X300 Pro and the Oppo Find X9 Pro. The custom chip's "headline spec" will be its image signal processor (ISP), w...

Stevie Bonifield·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Reproducing Complex Set-Compositional Information Retrieval

Reproducibility study of neural retrievers on set-compositional queries; introduces LIMIT+ benchmark for constraint-satisfaction information retrieval.

Vincent Degenhart·2 days ago

r/singularity· COMMUNITY

New Boston Dynamics Atlas trick

Boston Dynamics Atlas demonstrates new physical capability; limited technical details available from social media post.

u/Distinct-Question-16·2 days ago·301 pts / 63 comm

← Front Page30 stories

← Newer Older →

The Archive

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA)

Supercharging LLM inference on Google TPUs: Achieving 3X speedups with diffusion-style speculative decoding- Google Developers Blog

PayPal says it’s ‘becoming a technology company again.’ That means AI.

"Stream ended without a final message" in Claude Design

Raising the Ceiling: Better Empirical Fixation Densities for Saliency Benchmarking

QKVShare: Quantized KV-Cache Handoff for Multi-Agent On-Device LLMs

Deco: Extending Personal Physical Objects into Pervasive AI Companion through a Dual-Embodiment Framework

ProgramBench: Can we really rebuild huge binaries from scratch? (doesn't look like it)

DMGD: Train-Free Dataset Distillation with Semantic-Distribution Matching in Diffusion Models

Spatiotemporal Convolutions on EEG signal -- A Representation Learning Perspective on Efficient and Explainable EEG Classification with Convolutional Neural Nets

Etsy launches its app within ChatGPT as it continues its AI push

EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics

On Adaptivity in Zeroth-Order Optimization

Memory-Efficient Continual Learning with CLIP Models

Quantifying the human visual exposome with vision language models

Correct Is Not Enough: Training Reasoning Planners with Executor-Grounded Rewards

MCJudgeBench: A Benchmark for Constraint-Level Judge Evaluation in Multi-Constraint Instruction Following

Mechanical Conscience: A Mathematical Framework for Dependability of Machine Intelligenc

Chatgpt shows his love of goblins

&lt;thinking&gt;&lt;/thinking&gt;

SOAR: Real-Time Joint Optimization of Order Allocation and Robot Scheduling in Robotic Mobile Fulfillment Systems

Complex Equation Learner: Rational Symbolic Regression with Gradient Descent in Complex Domain

On Computing Total Variation Distance Between Mixtures of Product Distributions

TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains

A Domain Incremental Continual Learning Benchmark for ICU Time Series Model Transportability

hello????

Heretic 1.3 released: Reproducible models, integrated benchmarking system, reduced peak VRAM usage, broader model support, and more

OpenAI is reportedly launching a phone for ChatGPT

Reproducing Complex Set-Compositional Information Retrieval

New Boston Dynamics Atlas trick

<thinking></thinking>