The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Exponential families from a single KL identity

Theoretical paper derives unified KL identity for exponential families applicable to softmax, Gaussians, variational inference, and RLHF.

Marc Dymetman·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Ease of dependency distance minimization in star-like structures

Linguistic study on syntactic dependency distance minimization in star-like sentence structures; narrow theoretical interest.

Emília Garcia-Casademont·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Shuffling-Aware Optimization for Private Vector Mean Estimation

Differential privacy optimization for mean estimation in shuffle model; foundational theory without AI systems application.

Shun Takagi·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation

DriftBench evaluates constraint adherence across 7 LLM models in iterative ideation; shows models lose fidelity under refinement pressure.

Garvin Kruthof·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MIFair: A Mutual-Information Framework for Intersectionality and Multiclass Fairness

MIFair framework for bias assessment via mutual information; addresses intersectionality and multiclass fairness in ML systems.

Jeanne Monnier·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Reliable Answers for Recurring Questions: Boosting Text-to-SQL Accuracy with Template Constrained Decoding

TeCoD system improves Text-to-SQL accuracy via template-constrained decoding from query pattern reuse in labeled workloads.

Smit Jivani·8 days ago

r/singularity· COMMUNITY

Claude Mythos supports Image outputs - Anthropic's first image gen model

https://preview.redd.it/u1ik0uejlcyg1.png?width=1080&format=png&auto=webp&s=d2ea7758fbfe5fdf2b65a3a79f2bb99711a07db8 As you can see in the outputs, Mythos can output images.

u/exordin26·8 days ago·107 pts / 22 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

FedHarmony: Harmonizing Heterogeneous Label Correlations in Federated Multi-Label Learning

FedHarmony addresses label correlation drift in federated multi-label learning across heterogeneous client datasets.

Zhiqiang Kou·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Universal statistical laws governing culinary design

Empirical study finds statistical laws in global recipe structures via NER; cultural/linguistic interest, not AI-relevant.

Ganesh Bagler·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Cost-Aware Learning

Cost-Aware SGD algorithm for finite-sum objectives with heterogeneous sampling costs; applied to RL with language models.

Clara Mohri·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Design Structure Matrix Modularization with Large Language Models

LLM-based combinatorial optimization for Design Structure Matrix modularization; engineering application without novel AI contribution.

Shuo Jiang·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Faster 3D Gaussian Splatting Convergence via Structure-Aware Densification

Structure-aware densification improves 3D Gaussian Splatting convergence by distinguishing geometric vs. aliasing errors.

Linjie Lyu·8 days ago

r/Anthropic· COMMUNITY

Opus 4.7 is a regression from 4.6 - real-world document generation broken

Anthropic just released Opus 4.7 as their most advanced model. I reverted to 4.6 within days. I use Claude for production work -- not chat, not summaries. Real deliverables with real deadlines. Here is what happened. I asked 4.7 to update a Word document. It is a task the previous model handled routinely. The new model produced a plain text markdown file with a .docx extension. Not a degraded document. Not a partially formatted document. A file that was literally not a Word document at all. Delivered with full confidence and zero warning that anything was wrong. When I caught it and ...

u/Seeker-888·8 days ago·14 pts / 5 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Learning from Disagreement: Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care

Framework reframes clinician AI overrides as preference learning signals for RLHF in clinical decision support, with observable outcomes and expert annotators.

Prabhjot Singh·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Kernelized Advantage Estimation: From Nonparametric Statistics to LLM Reasoning

Kernelized advantage estimation reduces RL training overhead for LLM reasoning by replacing value networks with nonparametric statistics.

Shijin Gong·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Pattern Language for Resilient Visual Agents

Architectural pattern language for vision language agents balances latency/non-determinism of VLMs against real-time enterprise control requirements.

Habtom Kahsay Gidey·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Latent-GRPO: Group Relative Policy Optimization for Latent Reasoning

Latent-GRPO stabilizes reinforcement learning in latent reasoning space by addressing probability density and sampling mechanism shifts.

Jingcheng Deng·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Exploring Interaction Paradigms for LLM Agents in Scientific Visualization

Comparative evaluation of three LLM agent paradigms (domain-specific, computer-use, coding) on scientific visualization tasks across 15 benchmarks.

Jackson Vonderhorst·8 days ago

r/ClaudeAI· COMMUNITY

Dear Claude

what could you possibly be thinking so long for 😭

u/fruvvs·8 days ago·25 pts / 5 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Dynamic Scaled Gradient Descent for Stable Fine-Tuning for Classifications

Dynamic scaled gradient descent prevents optimization collapse during fine-tuning on imbalanced datasets via gradient scaling.

Nghia Bui·8 days ago

TechCrunch AI· PRESS

X announces a rebuilt ad platform powered by AI

X is rolling out a rebuilt ads platform powered by AI as it works to grow revenue again.

Sarah Perez·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ITS-Mina: A Harris Hawks Optimization-Based All-MLP Framework with Iterative Refinement and External Attention for Multivariate Time Series Forecasting

ITS-Mina applies Harris Hawks optimization and external attention in MLP-based framework for multivariate time series forecasting.

Pourya Zamanvaziri·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

D3-Gym: Constructing Real-World Verifiable Environments for Data-Driven Discovery

D3-Gym dataset: 565 verifiable tasks from real scientific repositories for evaluating LLM agents on data-driven discovery.

Hanane Nour Moussa·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

TransVLM: A Vision-Language Framework and Benchmark for Detecting Any Shot Transitions

TransVLM formalizes shot transition detection (not binary cut detection) for video editing using vision-language models.

Ce Chen·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

From LLM-Driven Trading Card Generation to Procedural Relatedness: A Pokémon Case Study

LLM and diffusion models generate procedural trading card game content to improve metagame diversity in Pokémon.

Johannes Pfau·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

From Mirage to Grounding: Towards Reliable Multimodal Circuit-to-Verilog Code Generation

MLLMs fail on circuit-to-Verilog translation due to 'Mirage' phenomenon; visual perturbations cause hallucinated code despite correct diagram interpretation.

Guang Yang·8 days ago

r/MachineLearning· COMMUNITY

[R] Joint Embedding Variational Bayes (TMLR ’26)

TMLR paper introduces Joint Embedding Variational Bayes, a probabilistic framework for non-contrastive representation learning via factorized embedding likelihood.

u/ISwallow5Gum·8 days ago·81 pts / 5 comm

The Verge AI· PRESS

All these smart glasses and nothing to do

Despite only having one face, I made testing work. I'm currently wearing a pair of smart glasses called the Even Realities G2. Another two pairs, from Rokid, sit on my desk. A few feet away, I've got the Meta Ray-Ban Display charging alongside their Neural Wristband. In my closet are six pairs of $50 smart sunnies that an overzealous Walmart rep sent me. Those sit next to some Xreal, RayNeo, and Lucyd glasses, plus an old pair of Razer Anzu. Later, I'm calling my optician because I'm hoping to test a pair of the new Ray-Ban Meta Optics, which can supposedly handle my challenging prescription....

Victoria Song·8 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Differentiable latent structure discovery for interpretable forecasting in clinical time series

StructGP uses differentiable DAG learning on irregular EHR time series for uncertainty-aware clinical forecasting with interpretable causal structure.

Ivan Lerner·8 days ago

r/OpenAI· COMMUNITY

After seeing deepseek refused to acknowledge Taiwan is a coutry I had to do a little experiment

Reddit user reports DeepSeek model refusing to acknowledge Taiwan's sovereignty, raises questions about geopolitical bias in AI systems.

u/Daethir·8 days ago·53 pts / 36 comm

← Front Page30 stories

← Newer Older →