Section · The Wire

The Wire

A live dispatch from every source on the network. Chronological, ranked, and refreshed continuously as stories break.

Live feed

Sort

100 stories

OpenAI· FRONTIERNew

Parloa builds service agents customers want to talk to

Parloa uses OpenAI models to build voice-driven customer service agents with simulation and real-time deployment capabilities for enterprises.

OpenAI·5 hours ago

Latent Space· ANALYST

[AINews] Anthropic-SpaceXai's 300MW/$5B/yr deal for Colossus I, ARR growth is 8000% annualized

Anthropic secures 300MW, $5B/year compute deal with SpaceX for Colossus I cluster; ARR growth tracking 8000% annualized.

Latent Space·10 hours ago

r/LocalLLaMA· COMMUNITY

Qwen3.6 27B uncensored heretic v2 Native MTP Preserved is Out Now With KLD 0.0021, 6/100 Refusals and the Full 15 MTPs Preserved and Retained, Available in Safetensors, GGUFs and NVFP4s formats.

Community fine-tune of Qwen 3.6 27B with reduced safety filters released in multiple quantization formats.

u/LLMFan46·13 hours ago·49 pts / 14 comm

r/singularity· COMMUNITY

Subquadratic claims to break LLM scaling limits! 1000x less costs

Subquadratic claims subquadratic attention architecture reducing LLM inference costs by 1000x; ex-DeepMind/Meta team, early access signup required.

u/Immediate_Simple_217·8 hours ago·121 pts / 39 comm

r/LocalLLaMA· COMMUNITY

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

ParoQuant introduces pairwise rotation quantization to reduce inference cost for reasoning LLMs while maintaining output quality.

u/Total-Resort-3120·14 hours ago·61 pts / 10 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Design Conductor 2.0: An agent builds a TurboQuant inference accelerator in 80 hours

Design Conductor 2.0 autonomous agent builds hardware accelerators (TurboQuant) in 80 hours using frontier April 2026 models, demonstrating 80x capability scaling over prior work.

The Verkor Team·22 hours ago

r/LocalLLaMA· COMMUNITY

ZAYA1-8B: Frontier intelligence density, trained on AMD

Zyphra releases ZAYA1-8B, an 8B parameter model optimized for inference efficiency, trained on AMD hardware.

u/carbocation·20 hours ago·60 pts / 31 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Taming Outlier Tokens in Diffusion Transformers

Study identifies outlier tokens in Diffusion Transformers that attract disproportionate attention in image generation, affecting both encoder and denoiser layers.

Xiaoyu Wu·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Superposition Is Not Necessary: A Mechanistic Interpretability Analysis of Transformer Representations for Time Series Forecasting

Sparse autoencoders reveal PatchTST uses non-superposed, task-specific representations for time-series forecasting, explaining competitiveness against simple linear models.

Alper Yıldırım·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Geometry-Aware State Space Model: A New Paradigm for Whole-Slide Image Representation

Geometry-Aware State Space Model applies hyperbolic geometry to whole-slide histopathology image analysis via Multiple Instance Learning, improving patch aggregation for gigapixel resolution.

Enhui Chai·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Rollout Pass-Rate Control: Steering Binary-Reward RL Toward Its Most Informative Regime

Proposes Prefix Sampling to optimize RL training efficiency by maintaining 50% pass rate—the regime maximizing reward signal and entropy in agentic tasks like SWE-bench.

Tianshu Zhu·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Executable World Models for ARC-AGI-3 in the Era of Coding Agents

Coding agent with executable Python world models, verification, and simplicity-bias refactoring solves 25 public ARC-AGI-3 games without task-specific logic.

Sergey Rodionov·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

LongSeeker proposes Context-ReAct paradigm for elastic context management in long-horizon search agents, maintaining trajectory at variable detail levels.

Yijun Lu·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Low-Cost Black-Box Detection of LLM Hallucinations via Dynamical System Prediction

Koopman operator theory applied to LLM embeddings as dynamical system enables low-cost black-box hallucination detection without sampling or external retrieval.

Dan Wilson·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

SemEval-2026 Task 9 system fine-tunes Gemma 3 (12B/27B) per-language with LoRA and GPT-4o-mini synthetic data augmentation for 22-language polarization detection.

Srikar Kashyap Pulipaka·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Adapting Large Language Models to a Low-Resource Agglutinative Language: A Comparative Study of LoRA and QLoRA for Bashkir

Comparative study of LoRA and QLoRA fine-tuning on Bashkir, a low-resource Turkic language, using models from DistilGPT2 to Qwen2.5-7B.

Mullosharaf K. Arabov·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Aes3D: Aesthetic Assessment in 3D Gaussian Splatting

Aes3D proposes aesthetic assessment framework for 3D Gaussian Splatting, addressing composition and visual appeal evaluation beyond reconstruction fidelity.

Chuanzhi Xu·23 hours ago

Stratechery· ANALYSTNew

An Interview with Joanna Stern About Living With AI

Joanna Stern discusses her book on AI integration and media startup launch in Stratechery interview.

Ben Thompson·6 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Impossibility Triangle of Long-Context Modeling

Theoretical proof that long-context models cannot simultaneously optimize efficiency, compactness, and recall—fundamental trade-off affecting Transformers and SSMs.

Yan Zhou·24 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The First Token Knows: Single-Decode Confidence for Hallucination Detection

First-token confidence (phi_first) from single greedy decode detects LLM hallucinations as effectively as multi-sample semantic self-consistency with lower computational cost.

Mina Gabriel·22 hours ago

r/LocalLLaMA· COMMUNITY

Uploaded Unsloth Qwen3.6-35B-A3B UD XL models with MTP grafted, here are the results

MTP speculative decoding ported to Qwen 3.6 35B shows modest 2.5-6% speedup vs. 2-2.5x on 27B; architecture may limit gains.

u/havenoammo·18 hours ago·45 pts / 16 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Grokability in five inequalities

Grok AI model discovered five new mathematical inequalities and bounds in convex geometry and combinatorics, verified by human authors.

Paata Ivanisvili·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Misaligned by Reward: Socially Undesirable Preferences in LLMs

Reward models fail to capture socially desirable preferences across bias, safety, morality, and ethics—exposing hidden alignment failures in LLM training.

Gayane Ghazaryan·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement

Introduces Concept Field method to detect hallucination and measure novelty in LLM outputs by modeling semantic drift in text corpora using sentence embeddings.

Nicholas S. Kersting·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning

Q2RL algorithm extracts Q-functions from behavior cloning for efficient offline-to-online robot learning, preventing policy collapse via distribution mismatch.

Lakshita Dodeja·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agentic Vulnerability Reasoning on Windows COM Binaries

SLYP agent discovers Windows COM privilege-escalation race conditions via agentic binary exploration and generates debugger-verified proof-of-concept exploits.

Hwiwon Lee·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Understanding In-Context Learning for Nonlinear Regression with Transformers: Attention as Featurizer

Theoretical framework explains transformers' in-context learning on nonlinear regression by showing attention mechanisms construct polynomial and spline bases.

Alexander Hsu·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Evolving Idea Graphs with Learnable Edits-and-Commits for Multi-Agent Scientific Ideation

Evolving Idea Graphs (EIG), a multi-agent LLM framework using learnable graph edits for scientific ideation with novelty, feasibility, clarity metrics.

Jiangwen Dong·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Piper: Efficient Large-Scale MoE Training via Resource Modeling and Pipelined Hybrid Parallelism

Resource modeling and pipelined hybrid parallelism system for efficient large-scale Mixture-of-Experts training on HPC platforms.

Sajal Dash·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Implicit Representations of Grammaticality in Language Models

Research shows pretrained language models implicitly distinguish grammaticality from string probability through internal representations, despite surface statistics.

Yingshan Susan Wang·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics

Memini: associative memory system with multi-timescale dynamics for continual knowledge updating in deployed LLMs without explicit management.

Andreas Pattichis·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

What Matters in Practical Learned Image Compression

Comprehensive study of learned image compression design choices balancing perceptual quality and runtime, introducing novel techniques for practical human-visual-system-optimized codecs.

Kedar Tatwawadi·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

UFAL-CUNI at SemEval-2026 Task 11: An Efficient Modular Neuro-symbolic Method for Syllogistic Reasoning

Neuro-symbolic system combining LLM parser with automated theorem prover for syllogistic reasoning in SemEval-2026 Task 11.

Ivan Kartáč·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Driver-WM: A Driver-Centric Traffic-Conditioned Latent World Model for In-Cabin Dynamics Rollout

Driver-WM: latent world model for predicting driver reactions during L2/L3 automation transitions using in-cabin behavioral dynamics.

Haozhuang Chi·23 hours ago

r/OpenAI· COMMUNITY

Everyone in the US needs to contact their lawmakers to say no to GUARD Act

Reddit post urging opposition to GUARD Act, which would mandate ID/biometric verification for all AI chatbot access in the US.

u/TaeyeonUchiha·16 hours ago·55 pts / 10 comm·+ covered by others

arXiv (cs.AI/CL/LG)· ACADEMIA

Adaptive Inverted-Index Routing for Granular Mixtures-of-Experts

AIR-MoE uses vector quantization for efficient routing in granular mixture-of-experts, reducing computational overhead of token-to-expert assignment.

Klaus-Rudolf Kladny·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Jacobian-Velocity Bounds for Deployment Risk Under Covariate Drift

Drift-aligned tangent regularization (DTR) bounds deployment risk under covariate shift using Jacobian-velocity theorem and Poincaré inequalities.

Jonathan R. Landers·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Almost-Orthogonality in Lp Spaces: A Case Study with Grok

Mathematical analysis refuting Carbery's triangle inequality conjecture for Lp spaces with counterexample and sharp bounds on exponent.

Ziang Chen·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

LineRides: Line-Guided Reinforcement Learning for Bicycle Robot Stunts

LineRides framework enables bicycle robot to learn complex stunts via line-guided RL without demonstrations, using spatial guidelines and sparse keyframe constraints.

Seungeun Rho·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences

Psychometric analysis of 50 LLMs identifies phenomenal experience as primary variance axis via Pinocchio dimension.

Hubert Plisiecki·24 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Order Matters: Improving Domain Adaptation by Reordering Data

ORDERED: variance reduction for unsupervised domain adaptation via optimal data reordering during training.

Andrea Napoli·24 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Estimating the expected output of wide random MLPs more efficiently than sampling

Method estimates expected outputs of wide random MLPs without sampling by propagating activation distributions via cumulants and Hermite expansions.

Wilson Wu·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning

Unified theoretical framework for distributional regret bounds in bandits and episodic RL, with UCBVI-style algorithm achieving gap-independent guarantees.

Harin Lee·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Sharp Capacity Thresholds in Linear Associative Memory: From Winner-Take-All to Listwise Retrieval

Theoretical analysis establishes sharp capacity thresholds for linear associative memory, showing d²∼n log n scaling for top-1 retrieval via phase transition.

Nicholas Barnfield·22 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Modular Reinforcement Learning For Cooperative Swarms

Modular multi-agent reinforcement learning approach for cooperative robot swarms with limited communication and local interaction.

Erel Shtossel·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Training-Time Batch Normalization Reshapes Local Partition Geometry in Piecewise-Affine Networks

Theoretical analysis of batch normalization's effect on geometry of piecewise-affine networks during training via hyperplane switching.

Xuan Qi·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Skill Neologisms: Towards Skill-based Continual Learning

Skill neologisms—soft tokens optimized for new capabilities—enable selective LLM skill extension without catastrophic forgetting or context limits.

Antonin Berthon·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Why Expert Alignment Is Hard: Evidence from Subjective Evaluation

Study shows expert alignment in LLMs varies substantially by evaluator and task subjectivity; reveals tacit criteria and temporal inconsistency as core obstacles.

Tzu-Mi Lin·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior

Manifold steering interventions causally link neural activation geometry to model behavior via structured representation space.

Daniel Wurgaft·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Automatically Finding and Validating Unexpected Side-Effects of Interventions on Language Models

Automated pipeline for auditing unexpected behavioral side-effects of LLM interventions through contrastive multi-token generation analysis.

Quintin Pope·24 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

EP-GRPO: Entropy-Progress Aligned Group Relative Policy Optimization with Implicit Process Guidance

EP-GRPO fixes credit assignment failures in GRPO-based LLM reasoning via token-level entropy, polarity-aware rewards, and zero-variance collapse mitigation.

Song Yu·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Predictive-Causal Gap: An Impossibility Theorem and Large-Scale Neural Evidence

Empirical study finds predictive neural encoders systematically fail to learn causal representations, achieving 49% causal fidelity despite high prediction accuracy across 2695 configurations.

Kejun Liu·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On the Hardness of Junking LLMs

Analysis of LLM jailbreak vulnerability without structured prompts reveals robustness gaps in current safety defenses.

Marco Rando·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Think-Aloud Reshapes Automated Cognitive Model Discovery Beyond Behavior

Think-aloud traces improve automated cognitive model discovery beyond behavior-only constraints in risky decision-making tasks.

Hanbo Xie·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Why Geometric Continuity Emerges in Deep Neural Networks: Residual Connections and Rotational Symmetry Breaking

Geometric continuity in deep networks explained by residual connections and symmetry-breaking nonlinearities coordinating weight updates across layers.

Kyungwon Jeong·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Detecting Hallucinations in Large Language Models via Internal Attention Divergence Signals

Single-pass hallucination detection method for LLMs using attention head KL-divergence without sampling, validated across multiple model families.

Gijs van Dijk·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Uno-Orchestra: Parsimonious Agent Routing via Selective Delegation

Uno-Orchestra: unified LLM multi-agent orchestration policy that jointly learns task decomposition and worker selection via RL, benchmarked on 13 suites.

Zhiqing Cui·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Unintended Negative Impacts of Promotional Language in Patent Evaluation

Large-scale USPTO study finds promotional language in patents negatively correlates with approval probability, contrary to science communication norms.

Bingkun Zhao·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models

Proposes detecting structural hallucinations in diffusion models via local intrinsic dimension analysis as instabilities on model-induced manifolds.

Bartlomiej Sobieski·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning

Adaptive policy selection method improves offline-to-online RL by combining off-policy and online evaluation under interaction budgets.

Alper Kamil Bozkurt·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Semantics: An Evidential Reasoning-Aware Multi-View Learning Framework for Trustworthy Mental Health Prediction

Multi-view evidential reasoning framework for mental health prediction from text with calibrated uncertainty estimation.

Yucheng Ruan·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

How Long Does Infinite Width Last? Signal Propagation in Long-Range Linear Recurrences

Finite-width signal propagation analysis shows when infinite-width approximation breaks down in long linear recurrences.

Mariia Seleznova·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Bayesian Approach for Task-Specific Next-Best-View Selection with Uncertain Geometry

Bayesian framework for active view selection in 3D reconstruction using posterior inference over implicit surfaces.

Jingsen Zhu·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Provable imitation learning for control of instability in partially-observed Vlasov--Poisson equations

Imitation learning for stabilizing Vlasov-Poisson plasma control using sparse macroscopic diagnostics with stability guarantees.

Xiaofan Xia·24 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Look Once, Beam Twice: Camera-Primed Real-Time Double-Directional mmWave Beam Management for Vehicular Connectivity

Vision-based mmWave beam management system for V2X vehicular connectivity using camera sensing and closed-loop learning.

Avhishek Biswas·24 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels

CuBridge: LLM-based framework for generating and reconstructing high-performance CUDA attention kernels with improved correctness and efficiency.

Xing Ma·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Joint Treatment Effect Estimation from Incomplete Healthcare Data: Temporal Causal Normalizing Flows with LLM-driven Evolutionary MNAR Imputation

CausalFlow-T applies DAG-constrained normalizing flows and LLM-driven imputation for treatment effect estimation in incomplete EHR data.

Olivia Jullian Parra·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On the Wasserstein Gradient Flow Interpretation of Drifting Models

Wasserstein Gradient Flow analysis characterizes Generative Modeling via Drifting (GMD) as fixed-point optimization in probability measure space.

Arthur Gretton·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Adaptivity Under Realizability Constraints: Comparing In-Context and Agentic Learning

Theoretical analysis shows adaptive agentic queries don't outperform fixed in-context queries under ReLU realizability constraints.

Anastasis Kratsios·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Transformed Latent Variable Multi-Output Gaussian Processes

T-LVMOGP framework scales Multi-Output Gaussian Processes to high-dimensional outputs via transformed latent variables.

Xiaoyu Jiang·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Building informative materials datasets beyond targeted objectives

Framework for materials science dataset construction balancing targeted property optimization against preservation of untargeted outcomes via diversity-aware selection.

Rafael Espinosa Castañeda·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Gated Multimodal Learning for Interpretable Property Energy Performance Prediction and Retrofit Scenario Analysis

Gated multimodal model combining EPC tabular data and assessor text to predict building energy efficiency scores.

Yunfei Bai·24 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Direct Product Flow Matching: Decoupling Radial and Angular Dynamics for Few-Shot Adaptation

Flow matching method for few-shot vision-language model adaptation using polar decomposition to decouple radial and angular feature dynamics.

Hongxu Chen·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SoK: Robustness in Large Language Models against Jailbreak Attacks

Systematic review of jailbreak attack and defense methods for LLMs with critique of narrow evaluation metrics like attack success rate.

Feiyue Xu·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Proximal Projection for Doubly Sparse Regularized Models

Doubly sparse regularization exploiting Gaussian graphical model structure for high-dimensional regression.

Jia Wei He·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Hypergraph Generation via Structured Stochastic Diffusion

HEDGE: generative model for hypergraphs using structured stochastic diffusion with two-sided heat operator to preserve higher-order interaction structure.

Christopher Nemeth·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization

Preference-based self-distillation method for on-policy training that moves beyond KL matching via reward regularization to improve reasoning stability.

Xin Yu·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Empirical Study of Pop and Jazz Mix Ratios for Genre-Adaptive Chord Generation

Fine-tuning study on 25M-parameter transformer for jazz chord generation—domain adaptation via pop-to-jazz transfer learning.

Jinju Lee·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Conditional outlier detection for clinical alerting

Data-driven anomaly detection flags unusual patient-management actions in EHR systems to reduce clinical errors.

Milos Hauskrecht·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

DualTCN: A Physics-Constrained Temporal Convolutional Network for 2 Time-Domain Marine CSEM Inversion

DualTCN physics-constrained TCN for marine electromagnetic inversion achieves 25% loss reduction over baselines.

Khaled Ahmed·1 day ago

r/LocalLLaMA· COMMUNITY

Analysis of the 100 most popular hardware setups on Hugging Face

Analysis of 100 most popular hardware configurations for local LLM inference on Hugging Face reveals deployment patterns and infrastructure preferences.

u/clem59480·23 hours ago·42 pts / 15 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Learned Neighbor Trust for Collaborative Deployment in Model-Agnostic Decentralized Learning

Decentralized learning framework where heterogeneous nodes train learned neighbor-trust policies for collaborative inference deployment in IoT.

Michael Lanier·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Graph-SND: Sparse Aggregation for Behavioral Diversity in Multi-Agent Reinforcement Learning

Graph-SND: sparse-graph generalization of System Neural Diversity metric for multi-agent RL, reducing quadratic-time computation to O(|E|) with unbiased estimation.

Shawn Ray·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Physiologically Grounded Driver Behavior Classification: SHAP-Driven Elite Feature Selection and Hybrid Gradient Boosting for Multimodal Physiological Signals

SHAP-based feature selection and hybrid boosting classify driving behaviors from multimodal physiological signals (EEG, EMG, GSR).

Sahar Askari·23 hours ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Position: Embodied AI Requires a Privacy-Utility Trade-off

Position paper argues embodied AI deployment in sensitive environments creates systemic privacy crisis requiring fundamental privacy-utility trade-off design.

Xiaoliang Fan·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

When Relations Break: Analyzing Relation Hallucination in Vision-Language Model Under Rotation and Noise

Study of relation hallucination in vision-language models under rotation and noise perturbations with evaluation of augmentation and preprocessing defenses.

Philip Wootaek Shin·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Reliable Modeling of Distribution Shifts via Displacement-Reshaped Optimal Transport

ReshapeOT improves optimal transport for distribution shifts by reshaping ground metrics using observed sample displacements.

Philip Naumann·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Order-based Rehearsal Learning

First order-based rehearsal learning method for avoiding undesired futures; uses ordinal structures instead of graph estimation.

Yu-Xuan Tao·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Scalable inference of spatial regions and temporal signatures from time series

Spatial regionalization method using minimum description length principle to partition time-evolving domains without pre-specifying region count.

Jiayu Weng·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Delving into Non-Exchangeability for Conformal Prediction in Graph-Structured Multivariate Time Series

Conformal prediction applied to graph-structured time series; addresses non-exchangeability via spectral graph theory for rigorous uncertainty quantification.

Ruichao Guo·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Kinematic Discriminants of Deceleration Behavior Modes in Car-Following: Evidence from NGSIM Trajectory Data

Analysis of car-following deceleration behavior using NGSIM trajectory data identifying gap-closing rate and visual looming discriminants.

Eni Solomon Laughter·1 day ago

r/ClaudeAI· COMMUNITY

the part of using claude code nobody talks about

Engineer reflects on cognitive load and knowledge retention challenges when using Claude for rapid feature development.

u/Consistent-Arm-875·9 hours ago·23 pts / 48 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

On the Influence of the Feature Computation Budget on Per-Instance Algorithm Selection for Black-Box Optimization

Study determines optimal feature computation budget fraction for per-instance algorithm selection in black-box optimization.

Koen van der Blom·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Adaptive Learning Strategies for AoA-Based Outdoor Localization: A Comprehensive Framework

Adaptive deep learning framework for angle-of-arrival based outdoor localization in 5G/6G networks with flexible training strategies.

Bac Trinh-Nguyen·1 day ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Full-chip CMP modelling based on Fully Convolutional Network leveraging White Light Interferometry

Fully convolutional neural network for chemical-mechanical polishing modeling in IC manufacturing using white light interferometry.

Jules Exbrayat·1 day ago

r/ClaudeAI· COMMUNITY

Prompt Injection experience - my first time ever

User documents prompt injection attack against Claude via GetAIPerks website, detailing fake system prompt injection technique and model behavior.

u/netmilk·1 day ago·64 pts / 7 comm

OpenAI· FRONTIER

How frontier enterprises are building an AI advantage

OpenAI B2B Signals research examines how enterprises scale AI adoption and agentic workflows to build competitive advantage.

OpenAI·2 days ago

TechCrunch AI· PRESS

Five architects of the AI economy explain where the wheels are coming off

Earlier this week, five people who touch every layer of the AI supply chain sat down at the Milken Global Conference in Beverly Hills, where they talked with TechCrunch about everything from chip shortages to orbital data centers to the possibility that the whole architecture that undergirds the tech is wrong.

Connie Loizos·11 hours ago

r/OpenAI· COMMUNITY

everybody calm down. i got this.

Reddit post with no substantive content; appears to be placeholder or incomplete submission.

u/imfrom_mars_·9 hours ago·75 pts / 11 comm

r/Anthropic· COMMUNITY

Did you notice any improvements?

Reddit discussion asking whether Claude models show performance improvements; lacks substantive technical detail.

u/erikofantastiko·9 hours ago·11 pts / 10 comm

← Front Page Full archive →