The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

LLMs can personalize education, although current static-prompt tutoring systems struggle to adapt to diverse academic disciplines. We develop and test a system with subject-aware prompting, based on 14 pedagogical features (e.g., tutor scaffolding, student understanding) extracted from raw transcripts. We first train a prompt routing model in a simulation environment, and then deploy it for online adaptation with actual high-school students. The simulation benchmark shows the router outperforming two static baselines ($0.694$ vs. $0.647$ and $0.64$, $p<0.001$). A/B testing ($N=656$ conversati...

Po-Chin Chang·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PASQA: Pitch-Accent-Focused Speech Quality Assessment Model Trained on Synthetic Speech with Accent Errors

Existing mean opinion score (MOS) prediction models typically predict utterance-level naturalness MOS and can be insensitive to localized pitch-accent errors. We propose Pitch-Accent-focused Speech Quality Assessment (PASQA), which explicitly targets pitch-accent correctness. To train our model, we construct a controlled Japanese accent-error dataset by changing accent patterns using an accent-controllable text-to-speech system, and compute a pseudo accent-quality score from the accent-error rate. PASQA builds on self-supervised representations and employs mora-conditioned fusion, ranking los...

Masaya Kawamura·4 days ago

TechCrunch AI· PRESS

Pixi’s new iOS app turns text messages into interactive AR experiences

Forget stickers, GIFs, and emoji reactions. Pixi is betting that the next evolution of messaging is interactive augmented reality (AR).

Lauren Forristal·4 days ago

OpenAI· FRONTIER

Improving health intelligence in ChatGPT

Learn how GPT-5.5 Instant improves ChatGPT’s health and wellness responses with stronger reasoning, better context, clearer communication, and physician-informed evaluations.

OpenAI·4 days ago

Stratechery· ANALYST

An Interview with Michael Morton About E-Commerce in the Age of AI

An interview with Michael Morton about e-commerce and AI, including the challenges of unfalsifiable bear cases, distribution versus referal models, grocery, and autonomous vehicles.

Ben Thompson·4 days ago

OpenAI· FRONTIER

Using AI to help physicians diagnose rare genetic diseases affecting children

Researchers used an OpenAI reasoning model to help diagnose rare diseases, identifying 18 new diagnoses in previously unsolved cases.

OpenAI·4 days ago

Latent Space· ANALYST

[AINews] Midjourney Medical: scan your organs like you step on a scale

The only bootstrapped frontier lab announces its second product and second

Latent Space·4 days ago

The Verge AI· PRESS

Midjourney Medical goes from generating ‘cat images’ to full-body ultrasound scans

“A scan of an imaging phantom, segmented to validate how cleanly structures separate under controlled conditions.“ | Image: Midjourney Medical Midjourney CEO David Holz just showed off the company's first hardware product and plans to build a San Francisco spa, which he admitted is a bit different from the "cat pictures" produced by its AI image generator. Dubbed The Midjourney Scanner, it's an ultrasound-based full-body scanner that uses a ring of sensors to capture vertical slices of the inside of your body, looking at the composition of your muscle, fat, bone, and organs to start. Holz sai...

Richard Lawler·4 days ago

TechCrunch AI· PRESS

How to turn off AI in your Google Docs

Here's what you need to do to get those pesky "write with Gemini" pop-ups to go away.

Amanda Silberling·4 days ago

Hugging Face· INFRA

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Hugging Face·4 days ago

Hugging Face· INFRA

Is it agentic enough? Benchmarking open models on your own tooling

Hugging Face·4 days ago

Simon Willison· ANALYST

GLM-5.2 is probably the most powerful text-only open weights LLM

Chinese AI lab Z.ai released GLM-5.2 to their coding plan subscribers on June 13th, and then yesterday (June 16th) released the full open weights under an MIT license. Similar in size to their previous GLM-5 and GLM-5.1 releases, this is 753B parameter, 1.51TB monster - with 40 active parameters (Mixture of Experts). GLM-5.2 is a text input only model - Z.ai have a separate vision family most recently represented by GLM-5V-Turbo , but that one isn't open weights. GLM-5.2 has a 1 million token context window, up from GLM-5.1's 200,000. The buzz around this model is strong. Artificial Analysis,...

Simon Willison·4 days ago

TechCrunch AI· PRESS

Roelof Botha joins SpaceX’s board of directors

The former Sequoia Capital leader is filling an "existing vacancy" on SpaceX's board, days after the company went public in the largest IPO ever.

Sean O'Kane·4 days ago

TechCrunch AI· PRESS

After unveiling ridiculously expensive AR glasses, Snap’s stock takes a dive

Snap's long-awaited smart glasses debut hasn't exactly done wonders for the company's stock.

Lucas Ropek·4 days ago

Anthropic· FRONTIER

Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem

Anthropic·4 days ago

Ars Technica AI· PRESS

AI coding agents can autonomously direct robot training

NVIDIA’s self-improvement program for robots enlists teams of AI coding agents.

Jeremy Hsu ·4 days ago

TechCrunch AI· PRESS

World leaders want American AI. They just don’t want America to be able to turn it off.

French President Macron and Indian PM Modi raised alarms at the G7 summit that the U.S. could cut off access to American AI overnight — a fear the Anthropic blackout just made real.

Rebecca Bellan·4 days ago

TechCrunch AI· PRESS

Anthropic becomes first AI startup to join the Frontier carbon removal coalition

Anthropic has joined the Frontier coalition, which received another $915M in pledges to fund carbon removal projects.

Tim De Chant·4 days ago

The Verge AI· PRESS

Anthropic got hit by export rules nobody understands

Anthropic has spent much of this week fighting to get its newest AI models back online after the Trump administration abruptly ordered the company to cut access for all foreign nationals, including users inside the US and its own employees, forcing Anthropic to block access to Fable 5 and Mythos 5 for everyone. "To my knowledge, this is the first time US export controls have been used to control access to an AI model in this way." The Trump administration has not publicly explained the legal basis for the order, but in a statement on its website, Anthropic said the government cited "national ...

Robert Hart·4 days ago

TechCrunch AI· PRESS

Social media’s next evolution: user-controlled algorithms

Social media feeds are becoming more customizable as platforms like Threads, Instagram, and TikTok introduce tools that let users directly influence the algorithms powering their recommendations.

Aisha Malik·4 days ago

TechCrunch AI· PRESS

NEA’s Tiffany Luck on AI IPOs, personal agents, and the ROI reckoning

Tokenmaxxing was the hottest trend in Silicon Valley earlier this year, with CEOs encouraging employees to push AI usage as far as it would go. Then the bill came due. Uber reportedly blew through its annual AI budget in a few months, some companies cut Claude licenses for parts of their org, and Meta killed its internal leaderboard. This tension between […]

Rebecca Bellan, Theresa Loconsolo·4 days ago·+ covered by others

arXiv (cs.AI/CL/LG)· ACADEMIA

Native Active Perception as Reasoning for Omni-Modal Understanding

Passive models for long video understanding typically rely on a "watch-it-all" paradigm, processing frames uniformly regardless of query difficulty, causing computational cost to grow with video duration. Although interactive frameworks have emerged, they often rely on global pre-scanning, and their context cost still scales with video length. We propose OmniAgent, the first native omni-modal agent that formulates video understanding as a POMDP-based iterative Observation-Thought-Action cycle. OmniAgent executes on-demand actions to selectively distill audio-visual cues into a persistent text...

Zhenghao Xing·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Learning User Simulators with Turing Rewards

Learning to simulate human users in interactive settings could advance the training of agent assistants, evaluation of personalization systems, research in the social sciences, and more. Existing approaches generally do so by training a large language model (LLM) to match a single ground truth response, either by maximizing the log probability or by using a similarity reward. We instead propose {Turing-RL}: a Turing-Test-based reinforcement learning approach for training user simulator models. {Turing-RL} uses a discriminative Turing reward with an LLM judge to score how indistinguishable a g...

Yingshan Susan Wang·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Freeing the Law with LOCUS: A Local Ordinance Corpus for the United States

Progress in legal AI increasingly depends on access to authoritative legal text at scale. Yet one of the most consequential layers of American law remains largely absent from existing machine-readable corpora: local ordinances. Local codes govern zoning, housing, business licensing, public health, noise, animal control, and many other domains of everyday regulation, but they are fragmented across vendor platforms designed for human browsing rather than bulk research access. We introduce LOCUS - the Local Ordinance Corpus for the United States - a comprehensive corpus and county-harmonized acc...

Denis Peskoff·4 days ago

Latent Space· ANALYST

🔬 The Self-Driving Lab — Joseph Krause, Radical AI

Radical AI's Joseph Krause on why the moat in materials is the lab, not the model

Brandon Anderson·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Chandra-Gaia Catalog of Counterparts: Resolving ambiguous Gaia matches to X-ray sources in the Chandra Source Catalog using Machine Learning

We present a framework to cross-match sources from the Chandra Source Catalog (CSC v2.1) with optical sources from Gaia Data Release 3. Unlike purely spatial approaches, we use source properties such as magnitudes, colors, and distances to identify true counterparts, detect chance coincidences, and resolve ambiguities when multiple plausible candidates exist. We define a training set of high-confidence matches using NWAY, a Bayesian cross-matching framework that accounts for positional errors and source densities. We train a gradient-boosted classifier (LightGBM) on a variety of features from...

V. Samuel Pérez-Díaz·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

UBP2: Uncertainty-Balanced Preference Planning for Efficient Preference-based Reinforcement Learning

Preference-based RL provides an approach to learning reward models from pairwise comparisons of behaviors, bypassing the need for explicit reward design. However, existing methods typically rely on passive data collection and suffer from poor sample efficiency, especially during the early stages of learning. We introduce a model-based approach that actively directs exploration by jointly reasoning over uncertainties in the reward, dynamics, and value functions. Our method, Uncertainty-Balanced Preference Planning (UBP2), uses ensembles of reward, dynamics, and value function models to evaluat...

Mohamed Nabail·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Rethinking Reward Supervision: Rubric-Conditioned Self-Distillation

Post-training of reasoning language models is commonly driven by supervised distillation and reinforcement learning with verifiable rewards. Distillation often relies on chain-of-thought annotations that are expensive to obtain and may themselves be noisy, incomplete, or partially incorrect; even when the final solution is correct, an imperfect rationale can interfere with learning. Reinforcement learning with verified rewards, on the other hand, typically compresses evaluative feedback into a scalar signal, obscuring which aspects of a response should be improved. We propose \textbf{Rubric-C...

Siyi Gu·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Reference-Driven Multi-Speaker Audio Scene Generation from In-the-Wild Priors

Existing multi-speaker dialogue systems bind speakers to utterances through structured supervision: per-turn tags, multi-stream transcriptions, or learnable speaker embeddings. These systems operate within speech-only pipelines that produce clean vocal sequences without the ambient texture of real conversations. We take a different approach. Our method, ScenA, conditions a text-to-audio flow-matching foundation model, pretrained on large-scale in-the-wild data, directly on multiple reference voices and a free-form natural language prompt that describes an entire multi-speaker audio scene. Lev...

Michael Finkelson·4 days ago

Ars Technica AI· PRESS

"Dangerous" AI models are coming no matter what

AI models with advanced hacking capabilities will soon be the norm.

Lily Hay Newman, WIRED.com ·4 days ago

← Front Page30 stories

← Newer Older →

The Archive

Learning to Prompt: Improving Student Engagement with Adaptive LLM-based High-School Tutoring

PASQA: Pitch-Accent-Focused Speech Quality Assessment Model Trained on Synthetic Speech with Accent Errors

Pixi’s new iOS app turns text messages into interactive AR experiences

Improving health intelligence in ChatGPT

An Interview with Michael Morton About E-Commerce in the Age of AI

Using AI to help physicians diagnose rare genetic diseases affecting children

[AINews] Midjourney Medical: scan your organs like you step on a scale

Midjourney Medical goes from generating ‘cat images’ to full-body ultrasound scans

How to turn off AI in your Google Docs

Beyond LoRA: Can you beat the most popular fine-tuning technique?

Is it agentic enough? Benchmarking open models on your own tooling

GLM-5.2 is probably the most powerful text-only open weights LLM

Roelof Botha joins SpaceX’s board of directors

After unveiling ridiculously expensive AR glasses, Snap’s stock takes a dive

Anthropic opens Seoul office and announces new partnerships across the Korean AI ecosystem

AI coding agents can autonomously direct robot training

World leaders want American AI. They just don’t want America to be able to turn it off.

Anthropic becomes first AI startup to join the Frontier carbon removal coalition

Anthropic got hit by export rules nobody understands

Social media’s next evolution: user-controlled algorithms

NEA’s Tiffany Luck on AI IPOs, personal agents, and the ROI reckoning

Native Active Perception as Reasoning for Omni-Modal Understanding

Learning User Simulators with Turing Rewards

Freeing the Law with LOCUS: A Local Ordinance Corpus for the United States

🔬 The Self-Driving Lab — Joseph Krause, Radical AI

The Chandra-Gaia Catalog of Counterparts: Resolving ambiguous Gaia matches to X-ray sources in the Chandra Source Catalog using Machine Learning

UBP2: Uncertainty-Balanced Preference Planning for Efficient Preference-based Reinforcement Learning

Rethinking Reward Supervision: Rubric-Conditioned Self-Distillation

Reference-Driven Multi-Speaker Audio Scene Generation from In-the-Wild Priors

"Dangerous" AI models are coming no matter what