I think Claude’s having a seizure…
Reddit user reports Claude exhibiting erratic behavior; anecdotal observation without technical detail or reproduction steps.
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
Reddit user reports Claude exhibiting erratic behavior; anecdotal observation without technical detail or reproduction steps.
Reddit discussion argues prefill latency is underemphasized vs. token generation speed in local LLM benchmarking and optimization focus.
User complaint about Claude Opus 4.6 performance restrictions and usage limits; expresses concern about competitive loss to alternative tools.
Zyphra releases ZAYA1-8B, an 8B parameter model optimized for inference efficiency, trained on AMD hardware.
Claude declined to optimize a CV for Philip Morris tobacco role, citing ethical concerns about tobacco marketing.
Just a reminder that the data centre announced to be used is the one xAI installed a massive amount of toxic gas turbines to power it, which is illegal and deadly to the local area.
Genesis AI claims Gene'26.5 is autonomous; limited details available from social media post.
Reddit post speculating on Anthropic-SpaceX partnership; lacks concrete details or sourced reporting.
Cutthroat negotiations between startup founders are rarely shared so publicly, especially when a company becomes as world-changing as OpenAI.
Reddit speculation that Elon/xAI rented GPUs to Anthropic, interpreted as signal of competitive pressure and capacity constraints.
Study identifies outlier tokens in Diffusion Transformers that attract disproportionate attention in image generation, affecting both encoder and denoiser layers.
Research shows pretrained language models implicitly distinguish grammaticality from string probability through internal representations, despite surface statistics.
Mira Murati, OpenAI's former CTO, has testified under oath that CEO Sam Altman lied to her about the safety standards for a new AI model. In a video deposition shown during the ongoing Musk v. Altman trial on Wednesday, Murati said Altman falsely stated that OpenAI's legal department determined a new AI model did not need to go through the company's deployment safety board. "As you understand it, was Mr. Altman telling the truth when he made that statement to you?" Murati was asked in the deposition. "No," Murati said. Murat said that during her tenure at OpenAI, Altman made her work more dif...
Grok AI model discovered five new mathematical inequalities and bounds in convex geometry and combinatorics, verified by human authors.
Mathematical analysis refuting Carbery's triangle inequality conjecture for Lp spaces with counterexample and sharp bounds on exponent.
LongSeeker proposes Context-ReAct paradigm for elastic context management in long-horizon search agents, maintaining trajectory at variable detail levels.
Theoretical analysis establishes sharp capacity thresholds for linear associative memory, showing d²∼n log n scaling for top-1 retrieval via phase transition.
Method estimates expected outputs of wide random MLPs without sampling by propagating activation distributions via cumulants and Hermite expansions.
Theoretical framework explains transformers' in-context learning on nonlinear regression by showing attention mechanisms construct polynomial and spline bases.
MRI-Eval benchmark with 1365 items assesses LLM performance on MRI physics and GE scanner operations with tiered difficulty and diagnostic conditions.
Q2RL algorithm extracts Q-functions from behavior cloning for efficient offline-to-online robot learning, preventing policy collapse via distribution mismatch.
Design Conductor 2.0 autonomous agent builds hardware accelerators (TurboQuant) in 80 hours using frontier April 2026 models, demonstrating 80x capability scaling over prior work.
First-token confidence (phi_first) from single greedy decode detects LLM hallucinations as effectively as multi-sample semantic self-consistency with lower computational cost.
Geometry-Aware State Space Model applies hyperbolic geometry to whole-slide histopathology image analysis via Multiple Instance Learning, improving patch aggregation for gigapixel resolution.
SemEval-2026 Task 9 system fine-tunes Gemma 3 (12B/27B) per-language with LoRA and GPT-4o-mini synthetic data augmentation for 22-language polarization detection.
Aes3D proposes aesthetic assessment framework for 3D Gaussian Splatting, addressing composition and visual appeal evaluation beyond reconstruction fidelity.
Sparse autoencoders reveal PatchTST uses non-superposed, task-specific representations for time-series forecasting, explaining competitiveness against simple linear models.
SpaceX, Elon Musk's space company that also houses his AI company, xAI, is considering spending $55 billion, at least initially, to build a semiconductor factory in Texas, according to a filing with Grimes County.
In just a few weeks of talks, DeepSeek's potential valuation has reportedly soared from $20 billion to $45 billion.