The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Using Embedding Models to Improve Probabilistic Race Prediction

Proposes embedding models to improve race prediction for underrepresented surnames absent from Census data used in BISG.

Noan Dasanaike·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

QDTraj: Exploration of Diverse Trajectory Primitives for Articulated Objects Robotic Manipulation

QDTraj generates diverse trajectory primitives for robot manipulation of articulated objects in open-ended household environments.

Mathilde Kappel·20 days ago

TechCrunch AI· PRESS

Nothing introduces an AI-powered dictation tool

Nothing's new on-device dictation tool supports over 100 languages.

Ivan Mehta·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ArmSSL: Adversarial Robust Black-Box Watermarking for Self-Supervised Learning Pre-trained Encoders

ArmSSL adds black-box watermarking to SSL encoders with robustness against adversarial removal for IP protection.

Yongqi Jiang·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Multi-output Extreme Spatial Model for Complex Aircraft Production Systems

Proposes extreme value modeling for rare failure events in aircraft manufacturing rather than mean-response prediction.

Cheolhei Lee·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Controllable Spoken Dialogue Generation: An LLM-Driven Grading System for K-12 Non-Native English Learners

Framework adapts LLM dialogue generation to K-12 English learner proficiency levels using CSE curriculum grading system.

Haidong Yuan·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On the Properties of Feature Attribution for Supervised Contrastive Learning

Analyzes feature attribution properties in supervised contrastive learning versus cross-entropy classification approaches.

Leonardo Arrighi·20 days ago

TechCrunch AI· PRESS

DeepSeek previews new AI model that ‘closes the gap’ with frontier models

DeepSeek says both models are more efficient and performant than DeepSeek V3.2 due to architectural improvements, and have almost "closed the gap" with current leading models, both open and closed, on reasoning benchmarks.

Ram Iyer·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

An Integrated Framework for Explainable, Fair, and Observable Hospital Readmission Prediction: Development and Validation on MIMIC-IV

Framework for hospital readmission prediction on MIMIC-IV with explainability (SHAP), fairness evaluation, and deployment reliability.

Isaac Tosin Adisa·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FeatEHR-LLM: Leveraging Large Language Models for Feature Engineering in Electronic Health Records

FeatEHR-LLM uses LLMs to generate clinically meaningful features from irregular EHR time series while limiting privacy exposure.

Hojjat Karami·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment

RouteLMT learns to route machine translation requests between small and large LLMs based on comparative quality improvement, reducing deployment costs.

Yingfeng Luo·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Aggregate vs. Personalized Judges in Business Idea Evaluation: Evidence from Expert Disagreement

PBIG-DATA dataset with 3K expert scores on LLM-generated business ideas tests whether evaluation judges should model consensus or individual evaluator preferences.

Wataru Hirota·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Different Strokes for Different Folks: Writer Identification for Historical Arabic Manuscripts

Establishes baselines for writer identification in historical Arabic manuscripts using the Muharaf dataset with line-level and page-disjoint protocols.

Hamza A. Abushahla·20 days ago

r/OpenAI· COMMUNITY

Appreciations for work mode in Codex. On track to becoming the first real super app

Reddit user praises Codex work mode, speculates OpenAI building a super app platform.

u/py-net·20 days ago·52 pts / 10 comm

r/singularity· COMMUNITY

This is getting insane (image gen 2)

Reddit post sharing OpenAI image generation samples without technical details, benchmarks, or release announcement.

u/duselkay·20 days ago·103 pts / 30 comm

r/LocalLLaMA· COMMUNITY

Anthropic admits to have made hosted models more stupid, proving the importance of open weight, local models

Anthropic reduced Claude Sonnet 4.6 and Opus 4.6 reasoning effort and pruned session memory for latency, then reverted after user feedback.

u/spaceman_·20 days ago·76 pts / 28 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Measuring and Mitigating Persona Distortions from AI Writing Assistance

Large-scale study (N=14K) shows AI writing assistance distorts perceived writer persona across 29 dimensions including politics, personality, and identity.

Paul Röttger·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Decoding High-Dimensional Finger Motion from EMG Using Riemannian Features and RNNs

End-to-end deep learning framework for continuous finger motion estimation from forearm EMG using Riemannian features and RNNs for prosthetic control.

Martin Colot·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CGC: Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding

CGC framework improves MLLMs' fine-grained multi-image understanding by addressing spatial hallucination and attention leakage through compositional grounding.

Lihao Zheng·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Deep Learning for Model Calibration in Simulation of Itaconic Acid Production

Compares deep learning strategies (DDL and conditional flow matching) for kinetic parameter estimation in itaconic acid fermentation simulation.

Daria Fokina·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FedSPDnet: Geometry-Aware Federated Deep Learning with SPDnet

FedSPDnet introduces geometry-preserving federated learning aggregation strategies for symmetric positive definite matrices with Stiefel constraints.

Thibault Pautrel·20 days ago

r/ClaudeAI· COMMUNITY

Claude limits no longer round to the nearest hour

Claude's usage limits no longer reset on hourly boundaries, preventing strategic timing exploits.

u/Shipposting_Duck·20 days ago·66 pts / 11 comm

The Verge AI· PRESS

Elon Musk and Sam Altman’s court showdown will dish the dirt

Might as well jump, as the poet David Lee Roth once said. | Image: Cath Virginia / The Verge Elon Musk cofounded OpenAI, and then flounced off in a huff when he wasn't anointed CEO, leaving Sam Altman as the last power-hungry man standing. Now, Musk is back with a lawsuit, and a trial is scheduled to start in Oakland, California, on April 27th. Theoretically, it's a legal case about whether OpenAI defrauded Musk. But that's not really what we're all doing here. This is about mess. Over the past couple of years, Musk's legal theories for punishing OpenAI have run the gamut from breach of contr...

Elizabeth Lopatto·20 days ago

TechCrunch AI· PRESS

In another wild turn for AI chips, Meta signs deal for millions of Amazon AI CPUs

Meta has commandeered a big chunk of Amazon's homegrown CPUs (not GPUs) for AI agentic workloads, signaling that a new kind of chip race has begun.

Julie Bort·20 days ago

r/LocalLLaMA· COMMUNITY

DeepSeek V4 is built different...

Reddit discussion of DeepSeek V4 capabilities; original Chinese content translated, lacks substantive technical details.

u/Alternative-Duty-532·20 days ago·170 pts / 26 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Contrastive Semantic Projection: Faithful Neuron Labeling with Contrastive Examples

Two-stage contrastive semantic projection method sharpens neuron-level interpretability labels in deep networks using contrastive examples.

Oussama Bouanani·20 days ago

r/MachineLearning· COMMUNITY

Is the ds/ml slowly being morphed into an AI engineer? [D]

Agents are amazing. Harnesses are cool. But the fundamental role of a data scientist is not to use a generalist model in an existing workflow; it's a completely different field. AI engineering is the body of the vehicle, whereas the actual brain/engine behind it is the data scientist's playground. I feel like I am not alone in this realisation that my role somehow got silently morphed into that of an AI engineer, with the engine's development becoming a complete afterthought. Based on industry requirements and ongoing research, most of the work has quietly shifted from building the engine t...

u/The-Silvervein·20 days ago·32 pts / 8 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

All Eyes on the Workflow: Automated and Efficient Event Discovery from Video Streams

SnapLog extracts event data from video streams via image embeddings and temporal segmentation for business process mining and workflow analysis.

Marco Pegoraro·20 days ago

r/singularity· COMMUNITY

Exactly 1 year ago, Anthropic said fully AI employees were just 1 year away

Reddit post recalling Anthropic's 1-year timeline claim for fully autonomous AI employees from a year prior; no new announcement.

u/Distinct-Question-16·20 days ago·251 pts / 57 comm

r/ClaudeAI· COMMUNITY

I'm somewhat of a coder myself

u/Flope·20 days ago·48 pts / 9 comm

← Front Page30 stories

← Newer Older →