The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

AttnLRP explanation method applied to DNABERT-2 genome language model reveals whether Transformer attention captures relevant genomic patterns versus CNNs.

Isabel Kurth·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A-IC3: Learning-Guided Adaptive Inductive Generalization for Hardware Model Checking

A-IC3 augments IC3 hardware model checking with learning-guided inductive generalization to accelerate counterexample generalization and clause synthesis.

Xiaofeng Zhou·21 days ago

r/Anthropic· COMMUNITY

Jensen Huang basically said US chip export controls might be creating the problem they are trying to solve.

He said it on the [Dwarkesh Podcast ](https://mrkt30.com/anthropic-mythos-triggers-chinas-ai-arms-frenzy/)this week and I have not been able to stop thinking about it. His argument was not that China is not a threat. It was that cutting them off and treating them as an enemy is probably not the smartest long term play. His actual words were that victimising them and turning them into an enemy likely is not the best answer. The context here is Huawei targeting 750,000 AI chip shipments this year. It is nowhere near Nvidia's compute but the direction of travel is clear. And if DeepSeek ends u...

u/Odd_Row1657·21 days ago·10 pts / 14 comm

The Verge AI· PRESS

You’re about to feel the AI money squeeze

Earlier this month, millions of OpenClaw users woke up to a sweeping mandate: The viral AI agent tool, which this year took the worldwide tech industry by storm, had been severely restricted by Anthropic. Anthropic, like other leading AI labs, was under immense pressure to lessen the strain on its systems and start turning a profit. So if the users wanted its Claude AI to power their popular agents, they'd have to start paying handsomely for the privilege. "Our subscriptions weren't built for the usage patterns of these third-party tools," wrote Boris Cherny, head of Claude Code, on X. "We wa...

Hayden Field·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Geometric Monomial (GEM): a family of rational 2N-differentiable activation functions

GEM: smooth rational activation functions matching ReLU performance with C^2N differentiability for deep networks.

Eylon E. Krause·21 days ago

Simon Willison· ANALYST

Quoting Maggie Appleton

Maggie Appleton on social signaling benefits of public learning via blogging and podcasting.

Simon Willison·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Fine-Grained Perspectives: Modeling Explanations with Annotator-Specific Rationales

Framework jointly models annotator-specific NLI predictions and explanations using User Passport mechanism for perspective-aware rationales.

Olufunke O. Sarumi·21 days ago

r/ClaudeAI· COMMUNITY

Sometimes the obvious...is not so obvious.

***C.C., old buddy, why did you write 50 lines of code to ensure a constant wasn't mutable?"*** I love Opus, man. "He" reminds me of an old friend who was absolutely brilliant, but give him too many bong hits and he was off in a rabbit hole talking about UFOs, fifth dimensional travel and, "Bob Lazar is full of shit, man!" The mods wanted me to provide the 50 line sample that backs up my opening quote (rightfully so.) It happened with work code, so I can't copypasta, but that little ditty went something like this: *(insert slow jazz here)* ^(1) import inspect import sys impor...

u/d4nnyfr4nky·21 days ago·24 pts / 5 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Transferable SCF-Acceleration through Solver-Aligned Initialization Learning

SAIL: solver-aligned initialization learning improves SCF convergence for molecular geometry by optimizing supervision targets, not extrapolation.

Eike S. Eberhard·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Causal Disentanglement for Full-Reference Image Quality Assessment

Causal disentanglement paradigm for full-reference image quality assessment decouples degradation and content via intervention on latent representations.

Zhen Zhang·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Dilated CNNs for Periodic Signal Processing: A Low-Complexity Approach

R-DCNN: dilated CNN with resampling for low-power periodic signal denoising and waveform estimation under resource constraints.

Eli Gildish·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

GS-Quant: Granular Semantic and Generative Structural Quantization for Knowledge Graph Completion

GS-Quant: granular semantic quantization framework aligns LLM tokens with graph embeddings for knowledge graph completion via hierarchical discrete codes.

Qizhuo Xie·21 days ago

TechCrunch AI· PRESS

AI galaxy hunters are adding to the global GPU crunch

Astronomers are turning to GPUs to find needles in the galactic haystack.

Tim Fernholz·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Large-Scale Data Parallelization of Product Quantization and Inverted Indexing Using Dask

Dask-based distributed product quantization and inverted indexing for large-scale approximate nearest neighbor search.

Ashley N. Abraham·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Task-specific Subnetwork Discovery in Reinforcement Learning for Autonomous Underwater Navigation

Multi-task RL discovers task-specific subnetworks for interpretable, adaptive autonomous underwater vehicle control under uncertainty.

Yi-Ling Liu·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Geometric Characterisation and Structured Trajectory Surrogates for Clinical Dataset Condensation

Geometric characterization of trajectory matching for clinical dataset condensation reveals supervision signal structure and synthetic data scaling.

Pafue Christy Nganjimi·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Multilinguality at the Edge: Developing Language Models for the Global South

Edge deployment and multilingual LMs for Global South: addresses last-mile challenge where multilinguality and hardware constraints intersect.

Lester James V. Miranda·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

To See the Unseen: on the Generalization Ability of Transformers in Symbolic Reasoning

Transformers fail on unseen symbolic reasoning due to unembedding collapse and token copying difficulty, limiting out-of-distribution generalization.

Nevena Lazić·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Promoting Simple Agents: Ensemble Methods for Event-Log Prediction

N-gram models match LSTM/Transformer accuracy on event-log prediction with lower resources and better stability than neural baselines.

Benedikt Bollig·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A-THENA: Early Intrusion Detection for IoT with Time-Aware Hybrid Encoding and Network-Specific Augmentation

A-THENA uses time-aware Transformer encoding for early IoT intrusion detection with temporal packet dynamics.

Ioannis Panopoulos·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

Verbal Process Supervision uses structured natural-language critique as training-free inference scaling, improving GPT-5 reasoning on GPQA, AIME, and LiveCodeBench.

Hao-Yuan Chen·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Using ASP(Q) to Handle Inconsistent Prioritized Data

ASP(Q) handles inconsistent prioritized data with three optimal repair semantics and polynomial-hierarchy query complexity.

Meghyn Bienvenu·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On the Role of Preprocessing and Memristor Dynamics in Reservoir Computing for Image Classification

Memristor-based reservoir computing reduces parameter overhead for image classification via preprocessing and device dynamics.

Rishona Daniels·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Verifying Machine Learning Interpretability Requirements through Provenance

Machine Learning interpretability as Non-Functional Requirement remains unverifiable; proposes provenance-based measurement framework.

Lynn Vonderhaar·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

DryRUN: On the Role of Public Tests in LLM-Driven Code Generation

DryRUN removes dependency on human-provided test cases for LLM code generation by automating test discovery in multi-agent frameworks.

Kaushitha Silva·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Kernel Nonconformity Score for Multivariate Conformal Prediction

Multivariate Kernel Score for conformal prediction adapts to residual geometry and connects Bayesian to frequentist uncertainty quantification.

Louis Meyer·21 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Language as a Latent Variable for Reasoning Optimization

Non-English prompts improve LLM reasoning performance; language acts as latent variable modulating internal inference rather than output medium.

Linjuan Wu·21 days ago

r/OpenAI· COMMUNITY

Image 2.0 nailed it!

Reddit user praise for OpenAI's Image 2.0 capability; no substantive technical details or announcement provided.

u/HaxleRose·21 days ago·88 pts / 10 comm

TechCrunch AI· PRESS

Beehiiv rolls out new creator tools, including webinars and customizable paywalls

Beehiiv is clearly done being just a newsletter platform based on today's launch of a new webinar feature, customizable paywalls, and more.

Lauren Forristal·21 days ago

Google AI (Gemma)· FRONTIER

Here’s how our TPUs power increasingly demanding AI workloads.

Google demonstrates TPU infrastructure capabilities for scaling AI workloads via video content.

Google AI (Gemma)·21 days ago

← Front Page30 stories

← Newer Older →

The Archive

Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2

A-IC3: Learning-Guided Adaptive Inductive Generalization for Hardware Model Checking

Jensen Huang basically said US chip export controls might be creating the problem they are trying to solve.

You’re about to feel the AI money squeeze

Geometric Monomial (GEM): a family of rational 2N-differentiable activation functions

Quoting Maggie Appleton

Fine-Grained Perspectives: Modeling Explanations with Annotator-Specific Rationales

Sometimes the obvious...is not so obvious.

Transferable SCF-Acceleration through Solver-Aligned Initialization Learning

Causal Disentanglement for Full-Reference Image Quality Assessment

Dilated CNNs for Periodic Signal Processing: A Low-Complexity Approach

GS-Quant: Granular Semantic and Generative Structural Quantization for Knowledge Graph Completion

AI galaxy hunters are adding to the global GPU crunch

Large-Scale Data Parallelization of Product Quantization and Inverted Indexing Using Dask

Task-specific Subnetwork Discovery in Reinforcement Learning for Autonomous Underwater Navigation

Geometric Characterisation and Structured Trajectory Surrogates for Clinical Dataset Condensation

Multilinguality at the Edge: Developing Language Models for the Global South

To See the Unseen: on the Generalization Ability of Transformers in Symbolic Reasoning

Promoting Simple Agents: Ensemble Methods for Event-Log Prediction

A-THENA: Early Intrusion Detection for IoT with Time-Aware Hybrid Encoding and Network-Specific Augmentation

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

Using ASP(Q) to Handle Inconsistent Prioritized Data

On the Role of Preprocessing and Memristor Dynamics in Reservoir Computing for Image Classification

Verifying Machine Learning Interpretability Requirements through Provenance

DryRUN: On the Role of Public Tests in LLM-Driven Code Generation

A Kernel Nonconformity Score for Multivariate Conformal Prediction

Language as a Latent Variable for Reasoning Optimization

Image 2.0 nailed it!

Beehiiv rolls out new creator tools, including webinars and customizable paywalls

Here’s how our TPUs power increasingly demanding AI workloads.