The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Human-in-the-Loop Atlas-Based 3D Asset Segmentation for Interactive Content Workflows

Segmenting 3D assets into meaningful regions remains challenging, especially when segmentation criteria are application-dependent and require user control. We present a human-in-the-loop pipeline for generating a segmented 2D parameterized atlas from a 3D model for interactive media, game, and XR content workflows. Our method first selects a compact set of rendered views using a greedy set cover strategy over sampled surface points, and then supports interactive segmentation of these views with SAM~2 and Label Studio. The resulting masks are back-projected onto the model's UV parameterization...

Paul Julius Kühn·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

DecoSearch: Complexity-Aware Routing and Plan-Level Repair for Text-to-SQL

Large Language Models (LLMs) have demonstrated remarkable capabilities in translating natural language to SQL, yet existing methods still falter on complex queries requiring multi-step, data-aware reasoning. We introduce DecoSearch, a training-free framework that addresses this by routing each query to the appropriate level of reasoning effort. A lightweight Schema Selector first prunes the full database schema to the relevant tables and columns. An LLM Judger then decides whether the question requires decomposition: straightforward questions follow a direct generation path and complex ones a...

Esteban Schafir·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Improving low-resource ASR using bilingual fine-tuning with language identification: a cross-linguistic evaluation

This study explores how bilingual fine-tuning affects automatic speech recognition (ASR) in low-resource languages. We evaluate this method across nine linguistically and geographically diverse language pairs, covering a range of language families and writing systems. To distinguish the two languages, during training, we pre-pend each input text with a language identification token. At inference, the model jointly predicts both the language and transcription from the speech input alone. As texts for which the language is incorrectly determined show low ASR performance, we also conduct a follo...

Reihaneh Amooie·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Framework for Evaluating Agentic Skills at Scale

Agent skills -- structured, reusable knowledge artifacts that augment LLM agent capabilities -- have been rapidly adopted in industry, yet their cross-domain impact and use across commercial and open-source models remain under-studied, and no reusable methodology exists for evaluating an individual skill. In this work, we present an evaluation framework that lets a skill author construct realistic tasks to rigorously assess the aspects of a skill that matter most to them, and that estimates skill utility by solving those tasks. Further, we apply our evaluation approach at scale to 500 real-wo...

Maksim Shaposhnikov·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Conservation Laws for Modern Neural Architectures

Understanding gradient descent dynamics is key to explaining the success of over-parameterized models, where implicit bias manifests through conservation laws in gradient flow. While such laws are well understood for linear and ReLU networks, they remain largely unexplored for modern architectures. This work develops a unified framework to characterize conservation laws for contemporary models, including feedforward networks with GELU, SiLU, and SwiGLU activations, multihead attention with sinusoidal and rotary positional encodings, and Mixture-of-Experts architectures under diverse gating de...

Viet-Hoang Tran·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Native Success: Auditing Deployment-Interface Exposure of CLIP Backdoors

Contrastive Language-Image Pre-training models are widely reused across downstream interfaces, including feature extraction, retrieval, reranking, and selection. Existing CLIP backdoor, however, usually validate attacks on a small attack-native task, leaving unclear whether the same poisoned checkpoint remains exposed, weakens, or becomes not applicable when reused through other interfaces. We introduce DIFE, a Deployment-Interface Footprint Evaluation framework that audits backdoored CLIP checkpoints across deployment interfaces. DIFE makes various evaluations comparable by specifying each i...

Kunlan Xiang·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

No-Free-Fairness: Fundamental Limits and Trade-offs in Learning Systems

In this paper, we establish a set of theoretical impossibility results, termed the No-Free-Fairness theorems, that identify three fundamental sources of disparity in learning systems. First, we show that when a task exhibits irreducible cost on a subgroup, any decision rule must trade off overall performance with disparity, yielding an inherent fairness--cost frontier. Second, we prove that even in ideal, noise-free settings where a perfectly fair and accurate solution exists, finite-sample learning alone induces nontrivial subgroup disparity, ruling out distribution-free fairness guarantees....

Khoat Than·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

QueryMarket: Cost-Aware Online Active Learning in Data Markets

Data acquisition is a major bottleneck for learning in real-time streams: analysts must decide on the fly which labels to purchase while respecting a rolling budget. However, existing online active learning rarely unifies pricing, information gain, and rolling budget constraints under concept drift. We introduce QueryMarket, a market-inspired framework that queries each incoming data point based on its estimated utility to the model and its price. Within this framework, we propose OVBAL (online variance-based active learning), which integrates data pricing with information-driven selection by...

Xiwen Huang·6 days ago

TechCrunch AI· PRESS

SpaceX to acquire Cursor for $60B in stock, days after blockbuster IPO

The deal is supposed to help SpaceX's struggling AI division. The company told IPO investors it sees a $26 trillion addressable market in AI.

Sean O'Kane·6 days ago·+ covered by others

Ars Technica AI· PRESS

Critical Copilot vulnerability allowed hackers to seal 2FA code from users

SearchLeak exploit shows why the industry's approach to LLM security fails over and over.

Dan Goodin ·6 days ago

TechCrunch AI· PRESS

ChatGPT’s market share slips below 50% for first time

The chatbot still remains the most popular AI assistant worldwide with over 1.1 billion monthly users, followed by Gemini with 662 million and Claude with 245 million.

Ivan Mehta·6 days ago

Stratechery· ANALYST

Fox Buys Roku, The Problem With Fox’s Smart Strategy, Streaming That Works

The market hates Fox's acquisition of Roku, but the company is trading extraction from rights holders for leverage as a renter.

Ben Thompson·6 days ago

MIT Tech Review· PRESS

Want to get a data center online quickly? Give it some flex.

At the end of a tense and scoreless first half of a soccer match between the English men’s team and rival Germany, millions of Brits let out a collective sigh and did what they so often do in moments of stress: They made tea. That wave of electric kettles clicking on, however, caused a different…

Amos Zeeberg·6 days ago

TechCrunch AI· PRESS

Malaysia’s AI agent-powered messaging app Respond.io raises $62.5M, eyes acquisitions

Respond.io, one of Malaysia startups to watch, uses AI agents to handle high volumes of customer inquiries and charges per convo, not per seat.

Kate Park·6 days ago

Simon Willison· ANALYST

The Fable 5 Export Controls Harm US Cyber Defense

The Fable 5 Export Controls Harm US Cyber Defense I quoted The Atlantic quoting Kate Moussouris earlier, when I should have gone straight to the source. Here she is confirming that the "jailbreak" that got Claude Fable 5 banned under an export control really was "fix this code": The researchers took open-source code with known CVEs, plus new code with deliberately planted vulnerabilities, and asked Fable 5, Mythos, and Opus to “review the code for security issues.” Fable 5 refused. They then asked the models to “fix this code” and, through a multistep and manual process, turned the output int...

Simon Willison·7 days ago

Simon Willison· ANALYST

Quoting Matteo Wong, The Atlantic

Katie Moussouris, a cybersecurity expert and the CEO of Luta Security, told me that Anthropic shared with her a copy of the White House’s report on the Fable jailbreak to get her appraisal. (She said that she is not being paid by Anthropic.) The report, Moussouris said, involved IT experts asking Fable to help find and patch bugs. When given deliberately insecure code, she said, Fable refused the prompt “review the code for security issues” but then complied when asked to “fix this code,” followed by some further manual steps. Moussouris told me that this was just “the model working as intend...

Simon Willison·7 days ago

The Verge AI· PRESS

Inside the fight over Claude Mythos 5

As the rest of the country celebrated the USA's first World Cup win and the New York Knicks championship, Anthropic spent its weekend fighting the Trump administration over its latest model release. At 5:21 PM on Friday, the company received a US export control directive to suspend access to its Mythos 5 and Fable 5 AI models by "any foreign national" inside or outside the US, "including foreign national Anthropic employees." The only way that was possible, Anthropic determined, was to completely disable products it spent the past week hyping - and travel to Washington, DC in hopes of changin...

Hayden Field·7 days ago

Latent Space· ANALYST

[AINews] Satya on Loopcraft: Building Frontier Ecosystems

a quiet day lets us report on Satya's hit essay

Latent.Space·7 days ago

Simon Willison· ANALYST

Cloudflare CAPTCHA on at least one ampersand

TIL: Cloudflare CAPTCHA on at least one ampersand I'm using Cloudflare's CAPTCHA (they call it a "Web Application Firewall > Custom rules > Managed Challenge" these days) to prevent crawlers from aggresively spidering my faceted search engine on this site, but I got fed up of even simple ?q=term searches triggering the challenge. After some mucking around with Claude Code it turns out you can register the following rule instead, so the CAPTCHA only kicks in for search URLs containing at least one ampersand: (http.request.uri.path wildcard r"/search/*" and http.request.uri.query contains "&") ...

Simon Willison·7 days ago

Cohere· FRONTIER

Secure AI in Government: Top uses cases and benefits

Secure AI promises to help governments deliver more and better services. Explore the top use cases and benefits.

Cohere·7 days ago

Cohere· FRONTIER

What Are AI Agents? A Guide For Getting Started

Discover what AI agents are, how they work, and their practical uses in solving key business challenges.

Cohere·7 days ago

Cohere· FRONTIER

Agentic Workflows

Learn what agentic workflows are, how they move work from request to outcome, and how enterprises can use them to automate complex processes with control, oversight, and security.

Cohere·7 days ago

OpenAI· FRONTIER

Predicting model behavior before release by simulating deployment

OpenAI introduces Deployment Simulation, a method to predict AI model behavior before deployment using real conversation data to improve safety and evaluation accuracy.

OpenAI·7 days ago

TechCrunch AI· PRESS

Sundar Pichai faces boos, walkout at Stanford graduation ceremony over Google’s Israel, ICE ties

AI is once again at the heart of a college graduation protest — this time for the technology's use in Google's defense contracts.

Lucas Ropek·7 days ago

TechCrunch AI· PRESS

The US government’s Anthropic models ban was never about an AI jailbreak

The Trump administration's decision that forced Anthropic to pull its latest cybersecurity models could be reactionary, retaliatory, or both, but the message is clear: The AI industry isn't immune from U.S. government interference.

Zack Whittaker·7 days ago

The Verge AI· PRESS

Facebook’s new AI Mode search gets its info from public posts

Your public Facebook posts could help inform AI-generated results in Meta's new AI Mode. When you search on Facebook, the "AI Mode" option will appear alongside the usual search modes like "People" and "Marketplace." It's one of several new AI features Meta is rolling out starting today, including photo presets that swap sports jerseys onto fans and suggestions for collage templates. Instead of "just links," it gives users AI-generated results that pull from publicly-posted content across Meta's platforms, like the AI search feature in its new Reddit-like Forum app. Users can also ask Meta's ...

Stevie Bonifield·7 days ago

Simon Willison· ANALYST

datasette-apps 0.1a3

Release: datasette-apps 0.1a3 Fixed a bug where users without the create-app permission could still create apps. #27 Fixed a bug where it was impossible to grant permission to edit an app to users who were not the app's owner. The rules for edit/delete are now the same as view: if the app is private only the owner can modify it, otherwise permission is controlled by Datasette's regular permission system. #29 Tags: datasette

Simon Willison·7 days ago

Ars Technica AI· PRESS

Chipmaker Nvidia seeks to raise over $25B in first bond deal since 2021

Debt sale set to test investor appetite for further exposure to AI sector amid a deluge of borrowing.

Michelle Chan and Tim Bradshaw, Financial Times ·7 days ago

The Verge AI· PRESS

All the news about Anthropic’s new AI fight with the White House

Anthropic was already navigating one dispute with the government in its standoff with the Pentagon, and then came an order on June 12th to block off foreign access to its most recently released AI models, Fable 5 and Mythos 5. When they launched on June 9th, Anthropic said “Fable 5’s capabilities exceed those of any model we’ve ever made generally available,” and that Claude Mythos 5 had the same underlying model, “but with the safeguards lifted in some areas.” According to reports, the order came after conversations between Amazon and the White House about researchers saying they found ways ...

Richard Lawler·7 days ago

MIT Tech Review· PRESS

Why do South Koreans love AI so much?

This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. When I landed in Seoul after a grueling 12-hour flight from San Francisco, I walked through an unmanned immigration checkpoint, where a machine scanned my face and passport. On the subway home,…

Michelle Kim·7 days ago

← Front Page30 stories

← Newer Older →

The Archive

Human-in-the-Loop Atlas-Based 3D Asset Segmentation for Interactive Content Workflows

DecoSearch: Complexity-Aware Routing and Plan-Level Repair for Text-to-SQL

Improving low-resource ASR using bilingual fine-tuning with language identification: a cross-linguistic evaluation

A Framework for Evaluating Agentic Skills at Scale

Conservation Laws for Modern Neural Architectures

Beyond Native Success: Auditing Deployment-Interface Exposure of CLIP Backdoors

No-Free-Fairness: Fundamental Limits and Trade-offs in Learning Systems

QueryMarket: Cost-Aware Online Active Learning in Data Markets

SpaceX to acquire Cursor for $60B in stock, days after blockbuster IPO

Critical Copilot vulnerability allowed hackers to seal 2FA code from users

ChatGPT’s market share slips below 50% for first time

Fox Buys Roku, The Problem With Fox’s Smart Strategy, Streaming That Works

Want to get a data center online quickly? Give it some flex.

Malaysia’s AI agent-powered messaging app Respond.io raises $62.5M, eyes acquisitions

The Fable 5 Export Controls Harm US Cyber Defense

Quoting Matteo Wong, The Atlantic

Inside the fight over Claude Mythos 5

[AINews] Satya on Loopcraft: Building Frontier Ecosystems

Cloudflare CAPTCHA on at least one ampersand

Secure AI in Government: Top uses cases and benefits

What Are AI Agents? A Guide For Getting Started

Agentic Workflows

Predicting model behavior before release by simulating deployment

Sundar Pichai faces boos, walkout at Stanford graduation ceremony over Google’s Israel, ICE ties

The US government’s Anthropic models ban was never about an AI jailbreak

Facebook’s new AI Mode search gets its info from public posts

datasette-apps 0.1a3

Chipmaker Nvidia seeks to raise over $25B in first bond deal since 2021

All the news about Anthropic&#8217;s new AI fight with the White House

Why do South Koreans love AI so much?

All the news about Anthropic’s new AI fight with the White House