The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

llama.cpp DeepSeek v4 Flash experimental inference

llama.cpp adds experimental DeepSeek v4 Flash support with aggressive 2-bit quantization, achieving 17 tokens/sec on M3 Max with 128GB RAM requirement.

u/antirez·17 days ago·42 pts / 37 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

From Rights to Rites: Expectations Management in Smart-Home AI

Qualitative study of how smart-home AI teams (Amazon Alexa, Google Nest, Microsoft Azure) manage user expectations through design practices.

Varad Vishwarupe·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Causal Discovery as Dialectical Aggregation: A Quantitative Argumentation Framework

QACD framework treats causal discovery conditional-independence tests as defeasible arguments, mitigating cascading errors in finite-sample regimes.

Sheng Wei·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Characterizations of Admissible Objective Functions for Hierarchical Clustering

Theoretical characterizations of admissible objective functions for hierarchical clustering, extending Dasgupta and Cohen-Addad frameworks.

Ryuki Tsukuba·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Neural Grammatical Error Correction for Romanian

First Romanian Grammatical Error Correction corpus (10k sentences) with adapted ERRANT toolkit and neural baselines for low-resource settings.

Teodor-Mihai Cotet·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

GraphPlanner: Graph Memory-Augmented Agentic Routing for Multi-Agent LLMs

GraphPlanner: heterogeneous graph memory-augmented router for multi-agent LLM agentic systems with task planning and multi-round cooperation.

Tao Feng·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Tandem: Riding Together with Large and Small Language Models for Efficient Reasoning

Tandem framework pairs large and small LLMs to reduce computational cost of reasoning-intensive inference while maintaining answer quality.

Zichuan Fu·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Applications of the Transformer Architecture in AI-Assisted English Reading Comprehension

Transformer-based interpretable models for English reading comprehension with adversarial bias correction and attention visualization for educational use.

Ping Li·17 days ago

r/LocalLLaMA· COMMUNITY

Benchmark: Windows 11 vs Lubuntu 26.04 on Llama.cpp (RTX 5080 + i9-14900KF). I didn't expect the gap to be this big.

Llama.cpp benchmarks on Windows 11 vs Lubuntu 26.04 with RTX 5080 show significant OS-level performance variance in local inference.

u/Ok_Mine189·17 days ago·40 pts / 39 comm

r/OpenAI· COMMUNITY

OpenAI almost banned me bacuse i tried to automate "youtube download"

Reddit user reports account suspension risk from OpenAI after attempting to automate YouTube downloads; anecdotal account of API guardrail enforcement.

u/foxxytux·17 days ago·51 pts / 31 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6-27B-INT4 clocking 100 tps with 256k context length on 1x RTX 5090 via vllm 0.19

Qwen3.6-27B-INT4 achieves 100+ tokens/sec with 256k context on RTX 5090 via vLLM 0.19, with KLD quantization validation.

u/Kindly-Cantaloupe978·17 days ago·104 pts / 30 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Hamiltonian Graph Inference Networks: Joint structure discovery and dynamics prediction for lattice Hamiltonian systems from trajectory data

HGIN jointly infers interaction graphs and predicts dynamics for lattice Hamiltonian systems from trajectory data without assuming homogeneity.

Ru Geng·17 days ago

r/Anthropic· COMMUNITY

Conversations with Opus 4.7

u/Sharp-University-555·17 days ago·14 pts / 3 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Thinking Like a Clinician: A Cognitive AI Agent for Clinical Diagnosis via Panoramic Profiling and Adversarial Debate

DxChain: clinical LLM agent using memory anchoring, navigation, and verification phases to reduce diagnostic tunnel vision and hallucinations in EHR analysis.

Zhiqi Lv·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

TimingLLM: A Two-Stage Retrieval-Augmented Framework for Pre-Synthesis Timing Prediction from Verilog

TimingLLM: two-stage retrieval-augmented LLM pipeline predicts post-synthesis timing (WNS/TNS) from Verilog without running EDA tools.

Armin Abdollahi·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Limits of Artificial Companionship

Paper argues companion chatbots must legally separate commercial from non-commercial contexts to protect user autonomy against undisclosed promotional content.

Mauricio Figueroa·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Personality Shapes Gender Bias in Persona-Conditioned LLM Narratives Across English and Hindi: An Empirical Investigation

Empirical study shows persona conditioning in LLMs amplifies gender bias differently across English and Hindi in professional narrative generation.

Tanay Kumar·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Partition-of-Unity Gaussian Kolmogorov-Arnold Networks

Proposes Partition-of-Unity Gaussian Kolmogorov-Arnold Networks as spline activation alternative with partition-of-unity properties and kernel interpretation.

Amir Nooeizadegan·17 days ago

r/LocalLLaMA· COMMUNITY

Using PaddleOCR-VL-1.5 with llama-server for book OCR

User demonstrates PaddleOCR-VL-1.5 multimodal inference via llama.cpp server for end-to-end document digitization with layout and table handling.

u/Final-Frosting7742·17 days ago·40 pts / 16 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

When AI reviews science: Can we trust the referee?

Paper documents LLM failure modes in peer review including prompt injection attacks and proposes safeguards for AI-assisted scientific evaluation.

Jialiang Wang·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

XITE: Cross-lingual Interpolation for Transfer using Embeddings

XITE proposes embedding-based cross-lingual data augmentation via interpolation to improve transfer learning in low-resource multilingual settings.

Barah Fazili·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FinGround: Detecting and Grounding Financial Hallucinations via Atomic Claim Verification

FinGround detects computational and citation hallucinations in financial LLM systems via atomic claim verification before EU AI Act enforcement (Aug 2026).

Dongxin Guo·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling

Talker-T2AV decouples semantic and low-level modeling in autoregressive audio-video generation for improved talking head synthesis coherence.

Zhen Ye·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ComplianceNLP: Knowledge-Graph-Augmented RAG for Multi-Framework Regulatory Gap Detection

ComplianceNLP integrates knowledge-graph-augmented RAG to automatically detect regulatory gaps across SEC, MiFID II, Basel III for financial institutions.

Dongxin Guo·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AgentEval: DAG-Structured Step-Level Evaluation for Agentic Workflows with Error Propagation Tracking

AgentEval introduces DAG-based step-level evaluation framework for agentic workflows with error propagation tracking and hierarchical failure taxonomy.

Dongxin Guo·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PhysCodeBench: Benchmarking Physics-Aware Symbolic Simulation of 3D Scenes via Self-Corrective Multi-Agent Refinement

PhysCodeBench benchmarks physics-aware symbolic simulation across 700 samples to evaluate LLM semantic understanding of physical phenomena for robotics/embodied AI.

Tianyidan Xie·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

LLMs Reading the Rhythms of Daily Life: Aligned Understanding for Behavior Prediction and Generation

LLMs applied to model daily human behaviors for predictions and generation across personal assistants and recommendation systems.

Fanjin Meng·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

RouteNLP: Closed-Loop LLM Routing with Conformal Cascading and Distillation Co-Optimization

RouteNLP framework routes queries across tiered LLM models to minimize inference costs while maintaining per-task quality thresholds.

Dongxin Guo·17 days ago

r/ClaudeAI· COMMUNITY

You're right to push back.

u/EchoOfOppenheimer·17 days ago·47 pts / 5 comm·+ covered by others

arXiv (cs.AI/CL/LG)· ACADEMIA

CAPSULE: Control-Theoretic Action Perturbations for Safe Uncertainty-Aware Reinforcement Learning

CAPSULE framework provides hard safety constraints for RL exploration in high-dimensional systems using control-theoretic dynamics models.

Rahul Narava·17 days ago

← Front Page30 stories

← Newer Older →