The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

New results suggest Mythos' cyber threat isn't "a breakthrough specific to one model."

Kyle Orland ·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment

Decoupled relation alignment method (DRSA) for extending graph foundation models to multi-domain heterogeneous graphs.

Ziyu Zheng·7 days ago

MIT Tech Review· PRESS

Operationalizing AI for Scale and Sovereignty

Companies are taking control of their own data to tailor AI for their needs. The challenge lies in balancing ownership with the safe, trusted flow of high‑quality data needed to power reliable insights. This conversation from MIT Technology Review’s EmTech AI conference examines how AI factories unlock new levels of scale, sustainability, and governance—positioning data…

MIT Technology Review Events·7 days ago

r/ClaudeAI· COMMUNITY

Cloud Skills Are Still Just Skills - How Anthropic no longer releases new skills, and gates them within the Cloud now with Ultraplan, Ultrareview, and Cloud Security.

Anthropic gates new Claude capabilities (Ultraplan, Ultrareview, Cloud Security) behind paid Cloud plans rather than open releases, fragmenting the skill ecosystem and limiting composability.

u/AndyNemmity·7 days ago·21 pts / 35 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Weisfeiler Lehman Test on Combinatorial Complexes: Generalized Expressive Power of Topological Neural Networks

Combinatorial Complex Weisfeiler-Lehman test extending WL expressiveness framework to unified topological structures.

Jiawen Chen·7 days ago

r/ClaudeAI· COMMUNITY

claude.md files in apple’s support app.

Apple's support app includes claude.md files, indicating internal Claude integration or documentation.

u/SnooOpinions4234·7 days ago·115 pts / 18 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Decentralized Proximal Stochastic Gradient Langevin Dynamics

Decentralized MCMC algorithm for constrained sampling via proximal stochastic gradient Langevin dynamics with convergence guarantees.

Mohammad Rafiqul Islam·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Towards Improving Speaker Distance Estimation through Generative Impulse Response Augmentation

ICASSP 2025 challenge entry using diffuse RIR generation and quality filtering to improve speaker distance estimation models.

Anton Ratnarajah·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Aitchison Embeddings for Learning Compositional Graph Representations

Compositional graph embedding framework using Aitchison geometry for interpretable node representations as simplex-valued mixtures.

Nikolaos Nakis·7 days ago

r/OpenAI· COMMUNITY

Whatever happened to the Jony Ive product(s)??

Reddit speculation on status of rumored OpenAI-Jony Ive hardware/product collaboration; no new information.

u/gibecrake·7 days ago·50 pts / 26 comm

r/LocalLLaMA· COMMUNITY

PFlash: 10x prefill speedup over llama.cpp at 128K on a RTX 3090

PFlash: speculative prefill technique achieves 10x speedup on 128K context with quantized 27B models on RTX 3090, open-source C++/CUDA implementation.

u/sandropuppo·7 days ago·68 pts / 17 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Deep Kernel Learning for Stratifying Glaucoma Trajectories

Deep kernel learning with transformer embeddings stratifies glaucoma patient risk from sparse EHR data; medical ML application without LLM/frontier AI component.

Bruce Rushing·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FinSafetyBench: Evaluating LLM Safety in Real-World Financial Scenarios

FinSafetyBench: bilingual red-teaming benchmark (14 subcategories) for evaluating LLM refusal of financial crimes and ethics violations grounded in real cases.

Yutao Hou·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory

MemCoE: cognition-inspired two-stage memory optimization for LLM agents to learn personalized long-term user preferences within context windows.

Derong Xu·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FedKPer: Tackling Generalization and Personalization in Medical Federated Learning via Knowledge Personalization

FedKPer addresses generalization/personalization in medical federated learning via knowledge personalization; healthcare ML infrastructure without LLM focus.

Zoe Fowler·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Adaptive Querying with AI Persona Priors

Persona-induced latent variable model for adaptive user querying under budget constraints; ML methodology tangential to frontier LLM research.

Kaizheng Wang·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models

ML-Bench&Guard: policy-grounded multilingual safety benchmark (14 languages) aligning LLMs with region-specific regulations and cultural context.

Yunhan Zhao·7 days ago

r/Anthropic· COMMUNITY

Extreme hallucinations output from Opus today (1st May) and yesterday

Reddit user reports severe hallucinations and task non-compliance in Claude Opus 4.7 on May 1st; anecdotal complaint without reproduction details.

u/hamada147·7 days ago·17 pts / 11 comm

r/singularity· COMMUNITY

My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.

Developer demo of generative game engine using Gemini 3 for spell generation with 6-player multiplayer physics simulation.

u/VirtualJamesHarrison·7 days ago·111 pts / 26 comm

The Verge AI· PRESS

Pentagon strikes classified AI deals with OpenAI, Google, and Nvidia — but not Anthropic

The Pentagon has struck deals with OpenAI, Google, Microsoft, Amazon, Nvidia, Elon Musk's xAI, and the startup Reflection, allowing the agency to use their AI tools in classified settings, according to an announcement on Friday. At the same time, the Defense Department has left out Anthropic - which it previously used for classified information - after declaring it a supply-chain risk. This builds upon deals with OpenAI and xAI, which have already reached agreements with the Pentagon for the "lawful" use of their AI systems. A report from The Information suggests Google has struck a similar a...

Emma Roth·7 days ago

r/LocalLLaMA· COMMUNITY

GitHub - intel/auto-round: A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

Intel releases AutoRound, a low-bit quantization algorithm optimized for CPU/XPU/CUDA with vLLM and Transformers compatibility.

u/muyuu·7 days ago·41 pts / 23 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Evaluating the Architectural Reasoning Capabilities of LLM Provers via the Obfuscated Natural Number Game

Obfuscated Natural Number Game benchmarks LLM prover architectural reasoning vs. pattern matching; evaluates formal theorem-proving capabilities beyond saturation.

Lixing Li·7 days ago

TechCrunch AI· PRESS

Musk v. Altman is just getting started

Elon Musk spent the better part of three days on the witness stand this week in his lawsuit against OpenAI, and it’s already getting messy. Emails, texts, and his own tweets are surfacing in court, and there are plenty more witnesses to come. Musk’s argument against OpenAI? By converting the company to a for-profit model, Sam Altman betrayed the “nonprofit for the […]

Theresa Loconsolo·7 days ago·+ covered by others

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Benchmarks: MathArena as an Evaluation Platform for Mathematics with LLMs

MathArena: continuously-maintained evaluation platform aggregating mathematics benchmarks to track LLM progress; successor to static math benchmarks.

Jasper Dekoninck·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Augmented Lagrangian Multiplier Network for State-wise Safety in Reinforcement Learning

Augmented Lagrangian Multiplier Network stabilizes state-wise constraint enforcement in RL; safety optimization methodology without LLM specificity.

Jiaming Zhang·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

InpaintSLat: Inpainting Structured 3D Latents via Initial Noise Optimization

InpaintSLat: training-free 3D inpainting via initial noise optimization in latent diffusion; computer vision task orthogonal to LLM/frontier AI focus.

Jaeyoung Chung·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Spiking Sequence Machines and Transformers

Formalizes Phase-Latency Isomorphism showing spiking sparse distributed memory and transformers share five functional operations with cosine similarity retrieval.

Joy Bose·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Reinforcement Learning with Markov Risk Measures and Multipattern Risk Approximation

Introduces mini-batch Markov risk measures and multipattern Q-learning with regret bounds for risk-averse finite-horizon MDPs.

Andrzej Ruszczynski·7 days ago

The Verge AI· PRESS

Elon Musk had a bad week in court

Elon Musk is the one who wanted this trial. He has spent months claiming OpenAI "stole a nonprofit," and saying he was the actual driving force behind one of the most important companies currently in tech. All indications are that he won't win his case against the company, but he's fighting it anyway. So you'd think he'd have done better when it was his time to take the stand. Verge subscribers, don't forget you get exclusive access to ad-free Vergecast wherever you get your podcasts. Head here. Not a subscriber? You can sign up here. Instead, Musk spent much of the week arguing with lawyers ...

David Pierce·7 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AdaMeZO: Adam-style Zeroth-Order Optimizer for LLM Fine-tuning Without Maintaining the Moments

AdaMeZO enables Adam-style zeroth-order LLM fine-tuning without storing moment estimates, reducing GPU memory while maintaining convergence.

Zhijie Cai·7 days ago

← Front Page30 stories

← Newer Older →

The Archive

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment

Operationalizing AI for Scale and Sovereignty

Cloud Skills Are Still Just Skills - How Anthropic no longer releases new skills, and gates them within the Cloud now with Ultraplan, Ultrareview, and Cloud Security.

Weisfeiler Lehman Test on Combinatorial Complexes: Generalized Expressive Power of Topological Neural Networks

claude.md files in apple’s support app.

Decentralized Proximal Stochastic Gradient Langevin Dynamics

Towards Improving Speaker Distance Estimation through Generative Impulse Response Augmentation

Aitchison Embeddings for Learning Compositional Graph Representations

Whatever happened to the Jony Ive product(s)??

PFlash: 10x prefill speedup over llama.cpp at 128K on a RTX 3090

Deep Kernel Learning for Stratifying Glaucoma Trajectories

FinSafetyBench: Evaluating LLM Safety in Real-World Financial Scenarios

Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory

FedKPer: Tackling Generalization and Personalization in Medical Federated Learning via Knowledge Personalization

Adaptive Querying with AI Persona Priors

ML-Bench&Guard: Policy-Grounded Multilingual Safety Benchmark and Guardrail for Large Language Models

Extreme hallucinations output from Opus today (1st May) and yesterday

My dream of a fully generative game is getting pretty close to possible now. I made a demo where you can prompt any spell and fight online.

Pentagon strikes classified AI deals with OpenAI, Google, and Nvidia — but not Anthropic

GitHub - intel/auto-round: A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

Evaluating the Architectural Reasoning Capabilities of LLM Provers via the Obfuscated Natural Number Game

Musk v. Altman is just getting started

Beyond Benchmarks: MathArena as an Evaluation Platform for Mathematics with LLMs

Augmented Lagrangian Multiplier Network for State-wise Safety in Reinforcement Learning

InpaintSLat: Inpainting Structured 3D Latents via Initial Noise Optimization

Spiking Sequence Machines and Transformers

Reinforcement Learning with Markov Risk Measures and Multipattern Risk Approximation

Elon Musk had a bad week in court

AdaMeZO: Adam-style Zeroth-Order Optimizer for LLM Fine-tuning Without Maintaining the Moments