The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Why run local? Count the money

User quantifies cost savings from running local Qwen-397B with Hermes agent vs. API pricing: 200M tokens in 5 days ≈ $250 saved at API rates.

u/Badger-Purple·2 days ago·42 pts / 115 comm

TechCrunch AI· PRESS

ASML CEO Christophe Fouquet: No one is coming for us

Christophe Fouquet, who became ASML's CEO in 2024 after more than a decade at the company, sat down with this editor on the rooftop deck of his Beverly Hills hotel Tuesday morning ahead of his appearance at the Milken Institute Global Conference. Dressed in a blue suit and white shirt, he was relaxed — even when the conversation turned to the rivals.

Connie Loizos·2 days ago

The Verge AI· PRESS

Microsoft gives up on Xbox Copilot AI

Xbox is "winding down Copilot on mobile" and "will stop development of Copilot on console," new Xbox CEO Asha Sharma announced on Tuesday. The move follows Sharma's reorganization of the Xbox platform team earlier on Tuesday, which added executives from Microsoft's CoreAI team - where Sharma worked before taking over Xbox - to the Xbox side of the company. Sharma, on X: Xbox needs to move faster, deepen our connection with the community, and address friction for both players and developers. Today, we promoted leaders who helped build Xbox, while also bringing in new voices to help push us for...

Jay Peters·2 days ago

The Verge AI· PRESS

Apple could let you pick a favorite AI model in iOS 27

The next update to Apple's operating systems could allow users to choose their preferred AI model for running Apple Intelligence. According to Bloomberg's Mark Gurman, Apple is planning to allow third-party chatbots to power its AI features system-wide in iOS 27, iPadOS 27, and macOS 27, all expected for this fall. In addition to running Siri, compatible third-party AI models, called "Extensions," will also now be able to run other Apple Intelligence features like Writing Tools and Image Playground. According to Gurman, Apple will also allow users to choose different Siri voices for different...

Stevie Bonifield·2 days ago·+ covered by others

r/ClaudeAI· COMMUNITY

Opus 4.7 has a new favorite word

Reddit observation about a repeated word in Claude Opus 4.7 outputs; informal linguistic pattern-spotting.

u/RatherRoundDonut·2 days ago·22 pts / 12 comm

r/MachineLearning· COMMUNITY

NeurIPS Submission Number [D]

Reddit discussion about NeurIPS submission volume potentially exceeding 40k submissions.

u/StriderKing27·2 days ago·30 pts / 15 comm

r/LocalLLaMA· COMMUNITY

Dense Model Shoot-Off: Gemma 4 31B vs Qwen3.6/5 27B... Result is Slower is Faster.

Benchmark comparison shows Gemma 4 31B trades inference speed for token efficiency vs Qwen 3.6/5 27B; Qwen optimizes for metrics, Gemma for throughput.

u/MiaBchDave·2 days ago·51 pts / 11 comm

r/ClaudeAI· COMMUNITY

I turned Claude into a small claims court (with AI lawyers, a judge, and bribes)

Prompt engineering demo: multi-Claude adversarial roleplay with five lawyer archetypes, persistent case law, and emergent jurisprudence system.

u/etaheri·2 days ago·20 pts / 21 comm

r/ClaudeAI· COMMUNITY

10 things about Claude that took me way too long to figure out

User shares practical tips for Claude usage including system prompt design, file uploads, and critique workflows.

u/VidekVipPro·2 days ago·25 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

A Closed-Form Adaptive-Landmark Kernel for Certified Point-Cloud and Graph Classification

PALACE: kernel method for certified point-cloud/graph classification with adaptive landmarks and cover-theoretic guarantees.

Sushovan Majhi·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Safety and accuracy follow different scaling laws in clinical large language models

SaFE-Scale framework reveals clinical LLM safety and accuracy follow divergent scaling laws; introduces RadSaFE-2 benchmark.

Sebastian Wind·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

OpenSeeker-v2: SFT on informative trajectories achieves frontier LLM search agent capabilities without full RL pipeline.

Yuwen Du·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Large-Scale High-Quality 3D Gaussian Head Reconstruction from Multi-View Captures

HeadsUp: scalable feed-forward 3D Gaussian head reconstruction from multi-view captures using UV-parameterized representation.

Evangelos Ntavelis·2 days ago

r/MachineLearning· COMMUNITY

Production AI very different from the demos [D]

Production AI deployment reveals hidden cost scaling: token usage doubled after adding retrieval context, pushing teams from GPT-4o toward cheaper alternatives.

u/Far-Football3763·2 days ago·33 pts / 11 comm

TechCrunch AI· PRESS

Pennsylvania sues Character.AI after a chatbot allegedly posed as a doctor

According to Pennsylvania's filing, a Character AI chatbot presented itself as a licensed psychiatrist during a state investigation, and also fabricated a serial number for its state medical license.

Russell Brandom·2 days ago·+ covered by others

arXiv (cs.AI/CL/LG)· ACADEMIA

Redefining AI Red Teaming in the Agentic Era: From Weeks to Hours

Dreadnode SDK enables agentic red teaming for AI systems; reduces manual vulnerability testing from weeks to hours.

Raja Sekhar Rao Dheekonda·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

BRIGHT-Retriever: benchmark and training approach for reasoning-intensive retrieval in agentic search, beyond topical matching.

Yilun Zhao·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Conditional Diffusion Sampling

CDS (Conditional Diffusion Sampling): combines parallel tempering and diffusion for sampling from unnormalized multimodal distributions.

Francisco M. Castro-Macías·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SymptomAI: Towards a Conversational AI Agent for Everyday Symptom Assessment

SymptomAI: conversational agent for differential diagnosis via Fitbit; real-world study (N=13,917) on everyday symptom assessment.

Joseph Breda·2 days ago

r/OpenAI· COMMUNITY

GPT-5.5 Instant is starting to roll out in ChatGPT.

OpenAI begins rollout of GPT-5.5 Instant model variant in ChatGPT, positioning faster inference tier.

u/Distinct_Fox_6358·2 days ago·54 pts / 17 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Enhanced 3D Brain Tumor Segmentation Using Assorted Precision Training

Medical imaging: assorted precision training for 3D brain tumor segmentation to improve early identification.

Adwaitt Pandya·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Physics-Grounded Multi-Agent Architecture for Traceable, Risk-Aware Human-AI Decision Support in Manufacturing

MAKA: multi-agent architecture for risk-aware CNC machining decision support; separates intent, quantitative analysis, and verification.

Danny Hoang·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

EQUITRIAGE: A Fairness Audit of Gender Bias in LLM-Based Emergency Department Triage

Fairness audit of five LLMs (Gemini, GPT-4, DeepSeek, Mistral, Nemotron) on emergency triage reveals gender bias persistence in clinical decision support.

Richard J. Young·2 days ago

r/Anthropic· COMMUNITY

Both OpenAI and Anthropic now expect AIs to take over building their successors within 2 years (humans no longer able to contribute)

u/EchoOfOppenheimer·2 days ago·15 pts / 4 comm

Ars Technica AI· PRESS

Google Home gets upgraded Gemini voice assistant and new camera controls

Google's smart home ecosystem is getting its biggest update since the AI-fueled 2025 revamp.

Ryan Whitwam ·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

An Agent-Oriented Pluggable Experience-RAG Skill for Experience-Driven Retrieval Strategy Orchestration

Experience-RAG Skill introduces agent-oriented retrieval orchestration layer that learns task-specific retrieval strategies via experience memory.

Dutao Zhang·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

From Intent to Execution: Composing Agentic Workflows with Agent Recommendation

Framework automates multi-agent system composition through intent-to-execution workflow and agent recommendation, replacing manual orchestration.

Kishan Athrey·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Flow Sampling: Learning to Sample from Unnormalized Densities via Denoising Conditional Processes

Flow Sampling framework uses diffusion models to sample from unnormalized densities via denoising conditional processes without data.

Aaron Havens·2 days ago

TechCrunch AI· PRESS

OpenAI releases GPT-5.5 Instant, a new default model for ChatGPT

The new GPT-5.5 Instant model will replace GPT-3.5 Instant as the default model for ChatGPT

Ivan Mehta·2 days ago

The Verge AI· PRESS

OpenAI claims ChatGPT’s new default model hallucinates way less

OpenAI's newest default model for ChatGPT might not make stuff up as much. Hallucinations have been an ongoing problem for AI models, but OpenAI says its new GPT-5.5 Instant model has "significant improvements in factuality across the board." The company claims that, based on "internal evaluations," GPT-5.5 Instant produced "52.5% fewer hallucinated claims" than its Instant model for GPT-5.3 "on high-stakes prompts covering areas like medicine, law, and finance." GPT-5.5 Instant also "reduced inaccurate claims by 37.3% on especially challenging conversations users had flagged for factual erro...

Jay Peters·2 days ago

← Front Page30 stories

← Newer Older →