The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

LASER: Low-Rank Activation SVD for Efficient Recursion

LASER uses low-rank SVD of activations to understand and optimize recursive model architectures like Tiny Recursive Models.

Ege Çakar·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Region-Affinity Attention for Whole-Slide Breast Cancer Classification in Deep Ultraviolet Imaging

Region-Affinity Attention applies Deep Ultraviolet imaging for whole-slide breast cancer classification, a medical imaging application.

Nagur Shareef Shaik·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Bilinear Input Modulation for Mamba: Koopman Bilinear Forms for Memory Retention and Multiplicative Computation

Proposes bilinear input modulation for Mamba SSMs to improve memory retention and computational expressiveness via Koopman forms.

Hiroki Fujii·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Dynamics of Cognitive Heterogeneity: Investigating Behavioral Biases in Multi-Stage Supply Chains with LLM-Based Simulation

Uses LLM-based multi-agent simulation to study cognitive biases and coordination in supply chain dynamics at scale.

Jiuyun Jiang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PAC-Bayes Bounds for Gibbs Posteriors via Singular Learning Theory

Derives non-asymptotic PAC-Bayes generalization bounds for Gibbs posteriors using singular learning theory.

Chenyang Wang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Cross-Modal Attention Analysis and Optimization in Vision-Language Models: A Study on Visual Reliability

Analyzes text shortcut learning in Vision-Language Models via adversarial evaluation framework measuring visual-textual trade-offs.

Lijie Zhou·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Continual Safety Alignment via Gradient-Based Sample Selection

Proposes gradient-based sample selection to preserve safety alignment during fine-tuning by identifying high-gradient harmful samples.

Thong Bach·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond the Basics: Leveraging Large Language Model for Fine-Grained Medical Entity Recognition

Evaluates LLM performance on fine-grained medical entity recognition in clinical narratives beyond standard benchmarks.

Nwe Ni Win·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Guardrails in Logit Space: Safety Token Regularization for LLM Alignment

Introduces safety token regularization to preserve alignment properties during domain-specific fine-tuning of LLMs.

Thong Bach·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

DREAM: Dynamic Retinal Enhancement with Adaptive Multi-modal Fusion for Expert Precision Medical Report Generation

Proposes DREAM framework for medical report generation from retinal images using adaptive multi-modal fusion with limited data.

Nagur Shareef Shaik·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CDSA-Net:Collaborative Decoupling of Vascular Structure and Background for High-Fidelity Coronary Digital Subtraction Angiography

Proposes CDSA-Net for coronary digital subtraction angiography by decoupling vascular structure from background noise.

Si Li·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Demystifying the unreasonable effectiveness of online alignment methods

Reconciles theory-practice gap in online alignment methods by analyzing temperature-zero regret vs. KL-regularized regret criteria.

Enoch Hyunwook Kang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Calibrating Model-Based Evaluation Metrics for Summarization

Framework for calibrating model-based summarization metrics without reference summaries or human annotations, addressing reliability in automatic evaluation.

Hongye Liu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Learning to Control Summaries with Score Ranking

Loss function approach for controlling summary generation across multiple quality dimensions while managing trade-offs between completeness and conciseness.

Hongye Liu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Forecast Sports Outcomes under Efficient Market Hypothesis: Theoretical and Experimental Analysis of Odds-Only and Generalised Linear Models

Odds-only probabilistic models for sports forecasting and market efficiency analysis via betting data conversion methods.

Kaito Goto·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Do LLM-derived graph priors improve multi-agent coordination?

Study of using LLM-generated graph priors to improve agent coordination in multi-agent reinforcement learning without hand-specified topologies.

Nikunj Gupta·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Overlap Metrics: Rewarding Reasoning and Preferences for Faithful Multi-Role Dialogue Summarization

Reward-based optimization framework combining reasoning traces with preference alignment for faithful multi-role dialogue summarization.

Xiaoyong Mei·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Persona-Based Requirements Engineering for Explainable Multi-Agent Educational Systems: A Scenario Simulator for Clinical Reasoning Training

Requirements engineering methodology using personas to design explainable multi-agent educational systems for clinical training scenarios.

Weibing Zheng·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SynthFix: Adaptive Neuro-Symbolic Code Vulnerability Repair

Hybrid neural-symbolic framework combining LLM-based code synthesis with compiler feedback for automated vulnerability repair via adaptive routing.

Yifan Zhang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Model and Estimation of the Bitcoin Transaction Fee

Structural economic model of Bitcoin transaction fee formation using mempool queueing data and mechanism design theory.

Daniel Aronoff·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Layer-wise MoE Routing Locality under Shared-Prefix Code Generation: Token-Identity Decomposition and Compile-Equivalent Fork Redundancy

Analysis of Mixture-of-Experts routing locality and KV cache sharing overlap across layers in multi-candidate code generation from shared prefixes.

Shun-ichiro Hayashi·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Decentralised Trust and Security Mechanisms for IoT Networks at the Edge: A Comprehensive Review

Survey of decentralized trust mechanisms for IoT edge networks including federated learning, Zero Trust, and lightweight blockchain.

Khandoker Ashik Uz Zaman·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Cognitive Policy-Driven LLM for Diagnosis and Intervention of Cognitive Distortions in Emotional Support Conversation

New CogBiasESC dataset for training LLMs to detect and address cognitive distortions in emotional support conversations.

Lin Zhong·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Decomposing the Depth Profile of Fine-Tuning

Empirical study measuring representational change depth profiles across 240 fine-tuning runs on 15 transformer and state-space models.

Jayadev Billa·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Intent-aligned Autonomous Spacecraft Guidance via Reasoning Models

Framework for autonomous spacecraft guidance using reasoning models to interpret mission intent and generate safe trajectories.

Yuji Takubo·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

RosettaSearch: Multi-Objective Inference-Time Search for Protein Sequence Design

RosettaSearch: LLM-based inference-time optimization for protein sequence design using RosettaFold3 structure prediction rewards.

Meghana Kshirsagar·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Modeling Multi-Dimensional Cognitive States in Large Language Models under Cognitive Crowding

CognitiveBench: new benchmark measuring LLM performance on four cognitive dimensions (emotion, stance, thinking style, intention) jointly.

Lin Zhong·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CCCL: In-GPU Compression-Coupled Collective Communication

CCCL: GPU-native compression-coupled collective communication library for LLM training reducing overhead in tensor and expert parallelism.

Chon Lam Lao·2 months ago

Simon Willison· ANALYST

Changes in the system prompt between Claude Opus 4.6 and 4.7

Comparison of system prompt changes between Claude Opus 4.6 and 4.7, analyzed via git history visualization.

Simon Willison·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Virtue of Sparsity in Complexity

Asset pricing study arguing capacity sparsity and factor sparsity are complementary in high-dimensional financial feature discovery.

Nima Afsharhajari·2 months ago

← Front Page30 stories

← Newer Older →