The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Centering Ecological Goals in Automated Identification of Individual Animals

Analysis of mismatch between automated animal identification method evaluation and real ecological data collection practices in conservation.

Lukas Picek·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

RSRCC: A Remote Sensing Regional Change Comprehension Benchmark Constructed via Retrieval-Augmented Best-of-N Ranking

RSRCC: 126k remote sensing benchmark for localized change question-answering, built via retrieval-augmented best-of-N ranking.

Roie Kazoom·2 months ago

r/LocalLLaMA· COMMUNITY

unsloth Qwen3.6-27B-GGUF

unsloth releases GGUF quantized version of Qwen3.6-27B with embedded weights.

u/jacek2023·2 months ago·494 pts / 103 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

pAI/MSc: ML Theory Research with Humans on the Loop

pAI/MSc: open-source multi-agent system reducing human steering to convert ML theory hypotheses into submission-ready manuscripts.

Mahmoud Abdelmoneum·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Too Sharp, Too Sure: When Calibration Follows Curvature

Study links calibration, curvature, and margins during neural network training; Expected Calibration Error tightly coupled to Hessian properties.

Alessandro Morosini·2 months ago

The Verge AI· PRESS

Now Meta will track what employees do on their computers to train its AI agents

Meta employees' activity at work is now being used to train the company's AI agents. As reported by Reuters, Meta is installing a tool it calls Model Capability Initiative (MCI) on US-based employees' computers that runs in work-related apps and websites, recording mouse movements, clicks, keystrokes, and occasional screenshots. The data from this tool will be used to train the company's AI models to get better at interacting with computers the way humans do, including automating work tasks like those Meta's employees perform on the job. According to Reuters, the data from MCI won't be "used ...

Stevie Bonifield·2 months ago

TechCrunch AI· PRESS

OpenAI teams up with Infosys to bring AI tools to more businesses

Infosys said the integration will be used to help its clients modernize software development, automate workflows and deploy AI systems, initially focusing software engineering, legacy modernization, and DevOps.

Jagmeet Singh·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond ZOH: Advanced Discretization Strategies for Vision Mamba

Compares 6 discretization schemes (ZOH, FOH, BIL, POL, HOH, fourth-order) for Vision Mamba to improve temporal fidelity in dynamic scenes.

Fady Ibrahim·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning

SuperIgor: instruction-following framework where LM generates/refines plans via RL feedback loop, reducing manual annotation needs.

Zoya Volovikova·2 months ago

r/ClaudeAI· COMMUNITY

The "Missing Middle": Why is there no $50/mo Claude tier?

User argues for $40–$60 mid-tier Claude subscription option between $20 Pro and $100 Max plans to address pricing gap.

u/theePharisee·2 months ago·324 pts / 195 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Self-Aware Vector Embeddings for Retrieval-Augmented Generation: A Neuroscience-Inspired Framework for Temporal, Confidence-Weighted, and Relational Knowledge

SmartVector adds temporal, confidence, and relational metadata to embeddings to improve RAG accuracy on versioned queries from 58% baseline.

Naizhong Xu·2 months ago

Ars Technica AI· PRESS

Indian med student rakes in thousands with AI-generated MAGA hottie

"Emily Hart" is a young, AI-created conservative woman who likes to take off her clothes.

Ej Dickson, wired.com ·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Differentially Private Clustered Federated Learning with Privacy-Preserving Initialization and Normality-Driven Aggregation

Clustered federated learning with differential privacy and privacy-preserving initialization reduces data heterogeneity impact on convergence.

Jie Xu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

An explicit operator explains end-to-end computation in the modern neural networks used for sequence and language modeling

S4D state space models mapped to nonlinear oscillator networks, enabling interpretability of long-range dependency capture via wave-based encoding.

Anif N. Shikder·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Hierarchical MARL-Based Approach for Coordinated Retail P2P Trading and Wholesale Market Participation of DERs

Multi-agent RL framework for distributed energy resources to coordinate P2P retail trading and wholesale market participation.

Patrick Wilk·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Trust, Lies, and Long Memories: Emergent Social Dynamics and Reputation in Multi-Round Avalon with LLM Agents

LLM agents in repeated Avalon deception games develop reputation dynamics and social memory across 188 games, studying emergent multi-round behavior.

Suveen Ellawela·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Evaluating Assurance Cases as Text-Attributed Graphs for Structure and Provenance Analysis

Graph diagnostic framework evaluates assurance case structure and detects AI-generated vs. human-authored compliance documentation via link prediction.

Fariz Ikhwantri·2 months ago

r/Anthropic· COMMUNITY

Anthropic: You would get so much more respect from us with honestly. Stop listening to PR firms and just tell us what you're doing

At one point people thought of you as better than OpenAI and Google. We know AI companies are losing money. \- Just say, "We don't release Mythos because it'd be too expensive." \- Just say "We're going to increase the prices of Pro and Max because we're running out of money" ... all this under-the-radar marketing firm BS just means that you've decided to hemorrhage social capital as well as financial capital. Why would you want to do this?

u/HumbleIncident5464·2 months ago·396 pts / 79 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Ask Only When Needed: Proactive Retrieval from Memory and Skills for Experience-Driven Lifelong Agents

ProactAgent enables lifelong learning agents to proactively retrieve past experience and skills during task interaction rather than passively.

Yuxuan Cai·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Amortized Vine Copulas for High-Dimensional Density and Information Estimation

Vine Denoising Copula amortizes vine-copula modeling via reusable bivariate denoising for tractable high-dimensional density estimation.

Houman Safaai·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Where Reasoning Breaks: Logic-Aware Path Selection by Controlling Logical Connectives in LLMs Reasoning Chains

Logical connectives identified as high-entropy fragility points in LLM multi-step reasoning; intervention via selective path control improves stability.

Seunghyun Park·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

LLM StructCore: Schema-Guided Reasoning Condensation and Deterministic Compilation

Schema-guided reasoning framework for clinical CRF completion via two-stage decomposition handles sparse field prediction with strict output contracts.

Serhii Zabolotnii·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

LayerTracer: A Joint Task-Particle and Vulnerable-Layer Analysis framework for Arbitrary Large Language Model Architectures

LayerTracer framework analyzes hierarchical representations and robustness bottlenecks across diverse LLM architectures including Transformer, GateDeltaNet, and Mamba.

Yuhang Wu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On Bayesian Softmax-Gated Mixture-of-Experts Models

Theoretical analysis of Bayesian softmax-gated mixture-of-experts models with focus on posterior distribution asymptotic behavior.

Nicola Bariletto·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Toward Cross-Lingual Quality Classifiers for Multilingual Pretraining Data Selection

Cross-lingual quality classification for multilingual pretraining uses embedding-space consistency to transfer filtering strategies from high-resource to low-resource languages.

Yassine Turki·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Enhancing Research Idea Generation through Combinatorial Innovation and Multi-Agent Iterative Search Strategies

Multi-agent iterative search framework for scientific idea generation combines combinatorial innovation theory with LLM-based planning to reduce repetitive outputs.

Shuai Chen·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Measuring the Machine: Evaluating Generative AI as Pluralist Sociotechical Systems

Conceptual critique of generative AI evaluation paradigms argues benchmarks constitute rather than measure models and calls for descriptive alternatives honoring pluralist contexts.

Rebecca L. Johnson·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Evian: Towards Explainable Visual Instruction-tuning Data Auditing

Evian framework provides fine-grained semantic auditing of vision-language model training data to identify logical fallacies and factual errors beyond coarse-grained quality scores.

Zimu Jia·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines

Scoping review and survey of stuttered-speech research identifies gaps between current systems and end-user needs; recommends grounding speech tech in stakeholder experiences.

Hawau Olamide Toyin·2 months ago

r/LocalLLaMA· COMMUNITY

Qwen3.6-27B released!

Alibaba releases Qwen3.6-27B, open-source 27B dense model with agentic coding surpassing larger models, Apache 2.0 licensed.

u/ResearchCrafty1804·2 months ago·681 pts / 140 comm

← Front Page30 stories

← Newer Older →