The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Domain-Adapted Small Language Models for Reliable Clinical Triage

Qwen2.5-7B and open-source small models achieve ESI triage accuracy via clinical vignettes without fine-tuning.

Manar Aljohani·9 days ago

OpenAI· FRONTIER

Building the compute infrastructure for the Intelligence Age

OpenAI scales Stargate compute infrastructure to expand data center capacity for large-scale AI training and deployment.

OpenAI·9 days ago

r/Anthropic· COMMUNITY

Now Claude payments are throwing Stripe internal server errors too, what is going on?

Reddit user reports Stripe payment errors on Claude platform alongside account ban issues.

u/olorusopk·9 days ago·10 pts / 10 comm

r/ClaudeAI· COMMUNITY

The "Mother-In-Law Method" - How to get the best code reviews with Claude

Reddit user shares prompt engineering technique to elicit more critical code reviews from Claude by framing code as written by an annoying person.

u/Ancient_Perception_6·9 days ago·26 pts / 12 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

Probabilistic Transformer framework interprets self-attention as factor graphs; investigation extends PT to time series.

Zhangzhi Xiong·9 days ago

r/MachineLearning· COMMUNITY

An interactive semantic map of the latest 10 million published papers [P]

I built a map to help navigate the complex scientific landscape through spatial exploration. How it works: Sourced the latest 10M papers from OpenAlex and generated embeddings using SPECTER 2 on titles and abstracts. Reduced dimensionality with UMAP, then applied Voronoi partitioning on density peaks to create distinct semantic neighborhoods. The floating topic labels are generated via custom labelling algorithms (definitely still a work in progress!). There is also support for both keyword and semantic queries, and there's an analytics layer for ranking institutions, authors, and topi...

u/icannotchangethename·9 days ago·35 pts / 6 comm

The Verge AI· PRESS

Tumbler Ridge families sue OpenAI for not alerting police to the suspect’s ChatGPT activity

Seven families of victims injured or killed in the Tumbler Ridge school shooting in Canada have filed lawsuits against OpenAI and CEO Sam Altman, accusing the company and its leadership of negligence after they failed to alert police to the suspected shooter's ChatGPT activity. The families allege OpenAI stayed silent after its systems flagged activity by shooting suspect Jesse Van Rootselaar in order to protect the company's reputation and upcoming initial public offering (IPO). The Wall Street Journal reports that OpenAI "considered" flagging the 18-year-old's activity to police, which repo...

Emma Roth·9 days ago

The Verge AI· PRESS

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO

ChatGPT is struggling to keep up its once-explosive growth as users uninstall the app or opt for rival chatbots instead. According to data from market intelligence firm Sensor Tower, ChatGPT experienced a 132 percent increase in uninstalls year over year in April. Its uninstall rate was even higher last month, up 413 percent year-over-year, following OpenAI's deal with the Pentagon in February. While ChatGPT is still growing its user base, Sensor Tower says that growth is slowing down - ChatGPT increased its monthly active users by 168 percent in January, but only 78 percent in April. ChatGPT...

Stevie Bonifield·9 days ago

r/Anthropic· COMMUNITY

How to ruin a company's goodwill in 4 weeks

Reddit discussion critiquing Anthropic's recent decisions and their reputational impact; lacks specifics on incidents or timeline.

u/dmd·9 days ago·24 pts / 10 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

FutureWorld: live environment for training LLM-based agents on real-world outcome prediction and continual learning.

Zhixin Han·9 days ago

r/LocalLLaMA· COMMUNITY

16x DGX Sparks - What should I run?

Reddit discussion of 16x NVIDIA DGX Spark cluster setup for home lab; seeks model recommendations.

u/Kurcide·9 days ago·81 pts / 90 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Swap distance minimization shapes the order of subject, object and verb in languages of the world

Swap distance minimization principle explains word order variation across linguistic families independent of dominant SOV/SVO.

Jairo Rios-El-Yazidi·9 days ago

r/LocalLLaMA· COMMUNITY

IK_LLAMA now supports Qwen3.5 MTP Support :O

IK_LLAMA.cpp adds Qwen3.5 MTP support with 50% throughput gain (18-20→30 tok/s) via pipeline parallelism on 27B model.

u/fragment_me·9 days ago·40 pts / 16 comm

r/OpenAI· COMMUNITY

AI Safety Researcher: I wrote about neuralese as a cautionary tale ... AI Researchers: At long last, we invented neuralese from the classic paper, Don't Let The Machines Speak In Neuralese

AI safety researcher discusses neuralese—emergent uninterpretable communication between models—as alignment risk, referencing classic adversarial example literature.

u/EchoOfOppenheimer·9 days ago·53 pts / 13 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

CurEvo: Curriculum-Guided Self-Evolution for Video Understanding

CurEvo integrates curriculum learning into self-evolution frameworks for autonomous video understanding without annotations.

Guiyi Zeng·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A self-evolving agent for explainable diagnosis of DFT-experiment band-gap mismatch

XDFT: closed-loop agent automates diagnosis of DFT band-gap mismatches via hypothesis generation and refinement.

Yue Li·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising

X-WAM unifies 4D world modeling and robotic action execution via multi-view RGB-D video prediction with diffusion priors.

Jun Guo·9 days ago

r/Anthropic· COMMUNITY

The shilling of the /schedule feature is out of control

I'm much more sympathetic towards Anthropic than most users here. Started using CC when it was barely useable and think they are the good guys dealing with a real supply crunch. But every session I get prompted a dozen times to /schedule random tasks for two weeks in advance. Even small features "Want me to /schedule a check in for 2 weeks when this is live"? I realize they are tryign to scale to $100b in a year... they should focus on the product not shilling

u/ianm818·9 days ago·10 pts / 5 comm

The Verge AI· PRESS

Larry’s risky business

Oracular spectacular? | Image: Cath Virginia / The Verge If you want to know whether the AI bubble is bursting, there's only one publicly traded company that will tell you: Oracle. That's right, the database company. Oracle has burned its boats and pivoted to AI, but not in any kind of usual way. It is not a foundation model builder like OpenAI or Anthropic, obviously. It's not quite a neocloud, though it has entered the same bare-metal business as CoreWeave. It is a software-as-a-service company that has made an audacious bet on a very specific future version of AI as Oracle's traditional bu...

Elizabeth Lopatto·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Atomic-Probe Governance for Skill Updates in Compositional Robot Policies

Cross-version skill swap analysis reveals dominant-skill effects in compositional robotic policies on manipulation tasks.

Xue Qin·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Toolkit for Detecting Spurious Correlations in Speech Datasets

Toolkit detects spurious correlations in speech datasets that inflate performance metrics, critical for clinical/health AI validation.

Lara Gauder·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Parameterized Quantum Circuits as Feature Maps: Representation Quality and Readout Effects in Multispectral Land-Cover Classification

Variational quantum classifiers evaluated on satellite land-cover classification vs classical baselines using EuroSAT-MS dataset.

Ralntion Komini·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Laplace Approximation for Bayesian Tensor Network Kernel Machines

Laplace approximation method for uncertainty quantification in tensor network kernel machines as alternative to Gaussian Processes.

Albert Saiapin·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy

Framework for trustworthy clinical AI combining deterministic logic, patient-specific reasoning, and staged autonomy rather than end-to-end black boxes.

Serhii Zabolotnii·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Differentially-Private Text Rewriting reshapes Linguistic Style

Differential privacy text rewriting via LLMs preserves privacy but alters linguistic style and register beyond lexical level.

Stefan Arnold·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Bandit's Blind Spot: The Critical Role of User State Representation in Recommender Systems

Study on user state representation impact in contextual multi-armed bandit recommender systems for personalization.

Pedro R. Pires·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

When to Retrieve During Reasoning: Adaptive Retrieval for Large Reasoning Models

ReaLM-Retrieve framework adapts RAG for reasoning models like DeepSeek-R1 and o1 via step-level uncertainty detection and dynamic retrieval.

Dongxin Guo·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data

SciHorizon-DataEVA agentic system evaluates AI-readiness of heterogeneous scientific data using Sci-TQA2 principles for AI4Science workflows.

Dianyu Liu·9 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

When to Vote, When to Rewrite: Disagreement-Guided Strategy Routing for Test-Time Scaling

Disagreement-guided routing strategy selects test-time scaling method (voting/rewriting) for Large Reasoning Models based on output disagreement.

Zhimin Lin·9 days ago

TechCrunch AI· PRESS

Firestorm Labs raises $82M to take drone factories into the field

A defense startup just raised $82 million to put drone factories inside shipping containers and bring manufacturing to the front lines.

Kate Park·9 days ago

← Front Page30 stories

← Newer Older →

The Archive

Domain-Adapted Small Language Models for Reliable Clinical Triage

Building the compute infrastructure for the Intelligence Age

Now Claude payments are throwing Stripe internal server errors too, what is going on?

The "Mother-In-Law Method" - How to get the best code reviews with Claude

Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

An interactive semantic map of the latest 10 million published papers [P]

Tumbler Ridge families sue OpenAI for not alerting police to the suspect’s ChatGPT activity

ChatGPT downloads are slowing — and may cause problems for OpenAI&#8217;s IPO

How to ruin a company's goodwill in 4 weeks

FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

16x DGX Sparks - What should I run?

Swap distance minimization shapes the order of subject, object and verb in languages of the world

IK_LLAMA now supports Qwen3.5 MTP Support :O

AI Safety Researcher: I wrote about neuralese as a cautionary tale ... AI Researchers: At long last, we invented neuralese from the classic paper, Don't Let The Machines Speak In Neuralese

CurEvo: Curriculum-Guided Self-Evolution for Video Understanding

A self-evolving agent for explainable diagnosis of DFT-experiment band-gap mismatch

Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising

The shilling of the /schedule feature is out of control

Larry’s risky business

Atomic-Probe Governance for Skill Updates in Compositional Robot Policies

A Toolkit for Detecting Spurious Correlations in Speech Datasets

Parameterized Quantum Circuits as Feature Maps: Representation Quality and Readout Effects in Multispectral Land-Cover Classification

Laplace Approximation for Bayesian Tensor Network Kernel Machines

From Black-Box Confidence to Measurable Trust in Clinical AI: A Framework for Evidence, Supervision, and Staged Autonomy

Differentially-Private Text Rewriting reshapes Linguistic Style

The Bandit's Blind Spot: The Critical Role of User State Representation in Recommender Systems

When to Retrieve During Reasoning: Adaptive Retrieval for Large Reasoning Models

SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data

When to Vote, When to Rewrite: Disagreement-Guided Strategy Routing for Test-Time Scaling

Firestorm Labs raises $82M to take drone factories into the field

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO