The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

DialToM: Benchmark testing LLM Theory of Mind via dialogue trajectory forecasting from mental-state profiles, separating reasoning from correlation.

Neemesh Yadav·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

MedSkillAudit: Domain-specific audit framework for medical research agent skills assessing scientific integrity, reproducibility, and safety.

Yingyong Hou·2 months ago

r/singularity· COMMUNITY

A Chinese startup sells a $3 companion AI device that generates interactive holograms of deceased loved by uploading their photos, voice recordings, and chat histories.

Chinese startup markets $3 AI device generating interactive holograms of deceased people from photos and voice.

u/Distinct-Question-16·2 months ago·1148 pts / 274 comm

Google DeepMind· FRONTIER

Decoupled DiLoCo: A new frontier for resilient, distributed AI training

Google DeepMind introduces Decoupled DiLoCo, a distributed training method improving resilience and efficiency across compute clusters.

Google DeepMind·2 months ago

MIT Tech Review· PRESS

AI needs a strong data fabric to deliver business value

Artificial intelligence is moving quickly in the enterprise, from experimentation to everyday use. Organizations are deploying copilots, agents, and predictive systems across finance, supply chains, human resources, and customer operations. By the end of 2025, half of companies used AI in at least three business functions, according to a recent survey. But as AI becomes…

MIT Technology Review Insights·2 months ago

Stratechery· ANALYST

John Ternus and Apple’s Hardware-Defined Future, SpaceXAI and Cursor

Commentary on Apple's John Ternus appointment and its implications for hardware-AI strategy, with tangential reference to SpaceX-Cursor partnership.

Ben Thompson·2 months ago

OpenAI· FRONTIER

Workspace agents

OpenAI releases workspace agents for ChatGPT to automate workflows and integrate enterprise tools with cloud-based execution.

OpenAI·2 months ago·+ covered by others

OpenAI· FRONTIER

Speeding up agentic workflows with WebSockets in the Responses API

OpenAI optimizes agentic workflows via WebSockets and connection-scoped caching in Responses API, reducing latency and API overhead.

OpenAI·2 months ago

The Verge AI· PRESS

Anthropic’s most dangerous AI model just fell into the wrong hands

Anthropic's Mythos AI model, a powerful cybersecurity tool that the company said could be dangerous in the wrong hands, has been accessed by a "small group of unauthorized users," Bloomberg reports. An unnamed member of the group, identified only as "a third-party contractor for Anthropic," told the publication that members of a private online forum got into Mythos via a mix of tactics, utilizing the contractor's access and "commonly used internet sleuthing tools." The Claude Mythos Preview is a new general-purpose model that's capable of identifying and exploiting vulnerabilities "in every m...

Jess Weatherbed·2 months ago

Simon Willison· ANALYST

Quoting Bobby Holley

Mozilla used Claude Mythos Preview to identify 271 vulnerabilities in Firefox 150, demonstrating practical AI security tooling in production browsers.

Simon Willison·2 months ago

Simon Willison· ANALYST

Changes to GitHub Copilot Individual plans

GitHub Copilot tightens Individual plan limits, pauses signups, restricts Claude Opus 4.7 to $39/month Pro+ tier citing agentic workflow compute demands.

Simon Willison·2 months ago

r/ClaudeAI· COMMUNITY

An open letter to Anthropic

Autistic user testimonial about using Claude Co-work for organizing 20 years of personal creative systems and documents.

u/roblenfestey·2 months ago·3770 pts / 560 comm

Simon Willison· ANALYST

Is Claude Code going to cost $100/month? Probably not - it's all very confusing

Anthropic briefly moved Claude Code from $20 Pro to $100+ Max tier, then reverted; pricing confusion around feature tiers.

Simon Willison·2 months ago

Latent Space· ANALYST

[AINews] OpenAI launches GPT-Image-2

OpenAI launches GPT-Image-2; Cursor secures $10B contract with xAI and $60B acquisition option.

Latent Space·2 months ago

OpenAI· FRONTIER

Introducing OpenAI Privacy Filter

OpenAI releases open-weight model for detecting and redacting PII in text with state-of-the-art accuracy.

OpenAI·2 months ago

TechCrunch AI· PRESS

Meta will record employees’ keystrokes and use it to train its AI models

Meta says that it has a new internal tool that is converting mouse movements and button clicks into data that can train its AI models.

Lucas Ropek·2 months ago

TechCrunch AI· PRESS

Unauthorized group has gained access to Anthropic’s exclusive cyber tool Mythos, report claims

Anthropic told TechCrunch it is investigating the claims, but maintains that there is no evidence that its systems have been impacted.

Lucas Ropek·2 months ago

TechCrunch AI· PRESS

SpaceX is working with Cursor and has an option to buy the startup for $60 billion

Only Elon would do this before an IPO.

Tim Fernholz·2 months ago

The Verge AI· PRESS

SpaceX cuts a deal to maybe buy Cursor for $60 billion

With an IPO looming for Elon Musk's SpaceX / xAI / X combo platter of companies, SpaceX has announced an odd arrangement to either acquire the automated programming platform Cursor for $60 billion or pay a fee of $10 billion. Buying this startup that's focused on AI coding could help xAI's tools compete with market leader Anthropic, as well as the other competitors. A report by The Information this week said Sergey Brin has directed Google's "strike team" to help its agentic AI tools catch up, while Sam Altman reportedly declared a "code red" at OpenAI last year before shutting down Sora to f...

Richard Lawler·2 months ago

r/ClaudeAI· COMMUNITY

Does Claude's $20 Plan No Longer Include Claude Code?

Was looking at buying the $20 Plan today after a demonstration from a friend (and wanting to switch/try my options from Codex), but saw that Claude Code was not included. I wanted to ask if this was a temporary change, or if the Pro plan truly never had Claude Code, and I was mistaken. My friend has a Max plan, so I could just be mistaken. Thanks! Edit: Link to site: [https://claude.com/pricing](https://claude.com/pricing)

u/Coolpop52·2 months ago·23 pts / 8 comm

r/MachineLearning· COMMUNITY

CVPR - How to identify if an accepted paper has ethical issues (plagiarism)? [D]

Reddit discussion: researcher reports CVPR 2026 paper reproduces their June 2025 arXiv work with identical equations but insufficient citation; seeks guidance on plagiarism.

u/sukays·2 months ago·42 pts / 25 comm

r/ClaudeAI· COMMUNITY

Anthropic’s Mythos Model Is Being Accessed by Unauthorized Users

Anthropic's unreleased Mythos model reportedly accessed by unauthorized users, raising security and access control concerns.

u/-IronMan-·2 months ago·64 pts / 30 comm

r/ClaudeAI· COMMUNITY

Just open-sourced a protocol + SDK that lets Claude drive your live app (ships as a Claude Code plugin)

https://github.com/BrainBlend-AI/tesseron Just open-sourced a protocol and TypeScript SDK I built mostly *with* Claude Code. The goal: let *Claude* (or any MCP client) drive a live application (browser tab, *Electron* / *Tauri* desktop app, Node daemon, CLI) by calling typed handlers inside your code, instead of scraping the UI with *Playwright* or *Computer Use*. It's called **Tesseron**. Ships as a Claude Code plugin, so install is one command: ``` /plugin marketplace add BrainBlend-AI/tesseron /plugin install tesseron@tesseron ``` Plugin spawns a small local MCP gateway automatically. ...

u/TheDeadlyPretzel·2 months ago·39 pts / 5 comm

Ars Technica AI· PRESS

Pentagon wants $54B for drones, more than most nations’ military budgets

The proposed Pentagon drone investment rivals Ukraine’s entire military budget.

Jeremy Hsu ·2 months ago

r/Anthropic· COMMUNITY

Claude Code gone from pro plan now?!

User reports Claude Code feature removed from Anthropic Pro plan pricing page.

u/sighlencer·2 months ago·19 pts / 19 comm

Ars Technica AI· PRESS

Mozilla: Anthropic's Mythos found 271 zero-day vulnerabilities in Firefox 150

CTO says new AI model is "every bit as capable" as world's best security researchers.

Kyle Orland ·2 months ago

r/MachineLearning· COMMUNITY

[NeurIPS 2026] Will you be submitting your code alongside your submissions? [D]

NeurIPS 2026 thread: researchers debate whether to submit code alongside papers given trade-offs between credibility and plagiarism risk.

u/undesirable_12·2 months ago·39 pts / 41 comm

r/LocalLLaMA· COMMUNITY

ibm-granite/granite-4.1-8b · Hugging Face

IBM Granite 4.1-8B instruct model release; 8B long-context model with improved tool-calling and RL alignment.

u/jacek2023·2 months ago·50 pts / 20 comm

MIT Tech Review· PRESS

10 Things That Matter in AI Right Now

Amy Nordrum·2 months ago

MIT Tech Review· PRESS

LLMs+

When ChatGPT launched as an experimental prototype in late 2022, OpenAI’s chatbot became an everyday everything app for hundreds of millions of people. LLMs like ChatGPT were the new future: The entire tech industry was consumed by the inferno, with companies racing to spin up rival products. The ashes of the old tech world still…

Will Douglas Heaven·2 months ago

← Front Page30 stories

← Newer Older →

The Archive

DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories

MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

A Chinese startup sells a $3 companion AI device that generates interactive holograms of deceased loved by uploading their photos, voice recordings, and chat histories.

Decoupled DiLoCo: A new frontier for resilient, distributed AI training

AI needs a strong data fabric to deliver business value

John Ternus and Apple’s Hardware-Defined Future, SpaceXAI and Cursor

Workspace agents

Speeding up agentic workflows with WebSockets in the Responses API

Anthropic’s most dangerous AI model just fell into the wrong hands

Quoting Bobby Holley

Changes to GitHub Copilot Individual plans

An open letter to Anthropic

Is Claude Code going to cost $100/month? Probably not - it's all very confusing

[AINews] OpenAI launches GPT-Image-2

Introducing OpenAI Privacy Filter

Meta will record employees’ keystrokes and use it to train its AI models

Unauthorized group has gained access to Anthropic’s exclusive cyber tool Mythos, report claims

SpaceX is working with Cursor and has an option to buy the startup for $60 billion

SpaceX cuts a deal to maybe buy Cursor for $60 billion

Does Claude's $20 Plan No Longer Include Claude Code?

CVPR - How to identify if an accepted paper has ethical issues (plagiarism)? [D]

Anthropic’s Mythos Model Is Being Accessed by Unauthorized Users

Just open-sourced a protocol + SDK that lets Claude drive your live app (ships as a Claude Code plugin)

Pentagon wants $54B for drones, more than most nations’ military budgets

Claude Code gone from pro plan now?!

Mozilla: Anthropic's Mythos found 271 zero-day vulnerabilities in Firefox 150

[NeurIPS 2026] Will you be submitting your code alongside your submissions? [D]

ibm-granite/granite-4.1-8b · Hugging Face

10 Things That Matter in AI Right Now

LLMs+