The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

I think Claude’s having a seizure…

Reddit user reports Claude exhibiting erratic behavior; anecdotal observation without technical detail or reproduction steps.

u/david8840·21 hours ago·23 pts / 15 comm

r/LocalLLaMA· COMMUNITY

Most people seem obsessed with token generation speed, but isn’t prefill the real bottleneck? Am I missing something?

Reddit discussion argues prefill latency is underemphasized vs. token generation speed in local LLM benchmarking and optimization focus.

u/wbulot·21 hours ago·40 pts / 35 comm

r/Anthropic· COMMUNITY

Does this mean you'll restore original models?

User complaint about Claude Opus 4.6 performance restrictions and usage limits; expresses concern about competitive loss to alternative tools.

u/hatekhyr·21 hours ago·10 pts / 28 comm

r/LocalLLaMA· COMMUNITY

ZAYA1-8B: Frontier intelligence density, trained on AMD

Zyphra releases ZAYA1-8B, an 8B parameter model optimized for inference efficiency, trained on AMD hardware.

u/carbocation·21 hours ago·60 pts / 31 comm

Hugging Face· INFRA

vLLM V0 to V1: Correctness Before Corrections in RL

Hugging Face·22 hours ago

r/ClaudeAI· COMMUNITY

Claude has a conscience!

Claude declined to optimize a CV for Philip Morris tobacco role, citing ethical concerns about tobacco marketing.

u/Feeling_Function1184·22 hours ago·30 pts / 40 comm

r/Anthropic· COMMUNITY

So we now happy using Toxic air turbines Dario?

Just a reminder that the data centre announced to be used is the one xAI installed a massive amount of toxic gas turbines to power it, which is illegal and deadly to the local area.

u/DisaffectedLShaw·22 hours ago·11 pts / 8 comm

r/singularity· COMMUNITY

Genesis AI's Gene'26.5

Genesis AI claims Gene'26.5 is autonomous; limited details available from social media post.

u/torb·22 hours ago·113 pts / 34 comm

r/Anthropic· COMMUNITY

Anthropic Gets in Bed With SpaceX as the AI Race Turns Weird

Reddit post speculating on Anthropic-SpaceX partnership; lacks concrete details or sourced reporting.

u/wiredmagazine·22 hours ago·24 pts / 12 comm

TechCrunch AI· PRESS

How Elon Musk left OpenAI, according to Greg Brockman

Cutthroat negotiations between startup founders are rarely shared so publicly, especially when a company becomes as world-changing as OpenAI.

Tim Fernholz·22 hours ago

r/ClaudeAI· COMMUNITY

What it means that Elon just rented out all his GPUs to Anthropic

Reddit speculation that Elon/xAI rented GPUs to Anthropic, interpreted as signal of competitive pressure and capacity constraints.

u/ContextCustodian·23 hours ago·34 pts / 30 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Taming Outlier Tokens in Diffusion Transformers

Study identifies outlier tokens in Diffusion Transformers that attract disproportionate attention in image generation, affecting both encoder and denoiser layers.

Xiaoyu Wu·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Implicit Representations of Grammaticality in Language Models

Research shows pretrained language models implicitly distinguish grammaticality from string probability through internal representations, despite surface statistics.

Yingshan Susan Wang·23 hours ago

The Verge AI· PRESS

Mira Murati tells the court that she couldn’t trust Sam Altman’s words

Mira Murati, OpenAI's former CTO, has testified under oath that CEO Sam Altman lied to her about the safety standards for a new AI model. In a video deposition shown during the ongoing Musk v. Altman trial on Wednesday, Murati said Altman falsely stated that OpenAI's legal department determined a new AI model did not need to go through the company's deployment safety board. "As you understand it, was Mr. Altman telling the truth when he made that statement to you?" Murati was asked in the deposition. "No," Murati said. Murat said that during her tenure at OpenAI, Altman made her work more dif...

Jay Peters·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Grokability in five inequalities

Grok AI model discovered five new mathematical inequalities and bounds in convex geometry and combinatorics, verified by human authors.

Paata Ivanisvili·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Almost-Orthogonality in Lp Spaces: A Case Study with Grok

Mathematical analysis refuting Carbery's triangle inequality conjecture for Lp spaces with counterexample and sharp bounds on exponent.

Ziang Chen·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

LongSeeker proposes Context-ReAct paradigm for elastic context management in long-horizon search agents, maintaining trajectory at variable detail levels.

Yijun Lu·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Sharp Capacity Thresholds in Linear Associative Memory: From Winner-Take-All to Listwise Retrieval

Theoretical analysis establishes sharp capacity thresholds for linear associative memory, showing d²∼n log n scaling for top-1 retrieval via phase transition.

Nicholas Barnfield·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Estimating the expected output of wide random MLPs more efficiently than sampling

Method estimates expected outputs of wide random MLPs without sampling by propagating activation distributions via cumulants and Hermite expansions.

Wilson Wu·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Understanding In-Context Learning for Nonlinear Regression with Transformers: Attention as Featurizer

Theoretical framework explains transformers' in-context learning on nonlinear regression by showing attention mechanisms construct polynomial and spline bases.

Alexander Hsu·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MRI-Eval: A Tiered Benchmark for Evaluating LLM Performance on MRI Physics and GE Scanner Operations Knowledge

MRI-Eval benchmark with 1365 items assesses LLM performance on MRI physics and GE scanner operations with tiered difficulty and diagnostic conditions.

Perry E. Radau·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning

Q2RL algorithm extracts Q-functions from behavior cloning for efficient offline-to-online robot learning, preventing policy collapse via distribution mismatch.

Lakshita Dodeja·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Design Conductor 2.0: An agent builds a TurboQuant inference accelerator in 80 hours

Design Conductor 2.0 autonomous agent builds hardware accelerators (TurboQuant) in 80 hours using frontier April 2026 models, demonstrating 80x capability scaling over prior work.

The Verkor Team·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The First Token Knows: Single-Decode Confidence for Hallucination Detection

First-token confidence (phi_first) from single greedy decode detects LLM hallucinations as effectively as multi-sample semantic self-consistency with lower computational cost.

Mina Gabriel·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Geometry-Aware State Space Model: A New Paradigm for Whole-Slide Image Representation

Geometry-Aware State Space Model applies hyperbolic geometry to whole-slide histopathology image analysis via Multiple Instance Learning, improving patch aggregation for gigapixel resolution.

Enhui Chai·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

SemEval-2026 Task 9 system fine-tunes Gemma 3 (12B/27B) per-language with LoRA and GPT-4o-mini synthetic data augmentation for 22-language polarization detection.

Srikar Kashyap Pulipaka·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Aes3D: Aesthetic Assessment in 3D Gaussian Splatting

Aes3D proposes aesthetic assessment framework for 3D Gaussian Splatting, addressing composition and visual appeal evaluation beyond reconstruction fidelity.

Chuanzhi Xu·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Superposition Is Not Necessary: A Mechanistic Interpretability Analysis of Transformer Representations for Time Series Forecasting

Sparse autoencoders reveal PatchTST uses non-superposed, task-specific representations for time-series forecasting, explaining competitiveness against simple linear models.

Alper Yıldırım·23 hours ago

TechCrunch AI· PRESS

SpaceX may spend up to $119 billion on ‘Terafab’ chip factory in Texas

SpaceX, Elon Musk's space company that also houses his AI company, xAI, is considering spending $55 billion, at least initially, to build a semiconductor factory in Texas, according to a filing with Grimes County.

Ram Iyer·23 hours ago

TechCrunch AI· PRESS

DeepSeek could hit $45B valuation from its first investment round

In just a few weeks of talks, DeepSeek's potential valuation has reportedly soared from $20 billion to $45 billion.

Julie Bort·23 hours ago

← Front Page30 stories

← Newer Older →