The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Opus 4.7 is somewhere between seriously clueless and stupidly dangerous. The worst frontier model I have used so far in the past 2 years. We were hoping to get at least our 4.6 back but 4.7 with so many critical logical failures mean you have to babysit it all the time. I'm losing hope in Anthropic.

Opus 4.7 on Max effort decided to create a new email template by itself (which is pretty stupid btw) and mass mailed it to the whole database (some emails were repeatedly sent 20x). Before you ask me - yes, [CLAUDE.md](http://CLAUDE.md) has the exact rule for that, it's supposed to email the tester before any new email templates are to be used in production. I have created this safety rule a few months ago. I feel like the Opus 4.7 is a huge letdown the way it's been downgraded. If Anthropic is "pushing the boundaries", it's probably only in the meaning of how far they can push the...

u/DrHumorous·10 days ago·17 pts / 5 comm

Latent Space· ANALYST

[AINews] not much happened today

No substantive AI industry developments reported.

Latent Space·10 days ago

r/ClaudeAI· COMMUNITY

Suggestions For Making Claude Less Lazy?

User reports Claude Opus 4.6/4.7 exhibiting reduced effort behavior—avoiding research, providing outdated info, and deflecting tasks—starting this week.

u/Sad-Ticket5394·10 days ago·21 pts / 34 comm

r/LocalLLaMA· COMMUNITY

Why isn’t LLM reasoning done in vector space instead of natural language?

Reddit discussion questioning why LLMs use language-based chain-of-thought reasoning instead of latent vector space operations for faster, more compressed inference.

u/ZeusZCC·10 days ago·54 pts / 51 comm·+ covered by others

TechCrunch AI· PRESS

At his OpenAI trial, Musk relitigates an old friendship

It's a story Musk has told before -- in interviews and to author Walter Isaacson for his bestselling biography of Musk -- but Tuesday was the first time he said it under oath.

Connie Loizos·10 days ago

r/LocalLLaMA· COMMUNITY

llama.cpp's Preliminary SM120 Native NVFP4 MMQ Is Merged

llama.cpp merged SM120 native NVFP4 quantization support; community released GGUFs for Gemma-4-31B and Nemotron-Cascade models.

u/ggonavyy·10 days ago·42 pts / 19 comm

r/ClaudeAI· COMMUNITY

Opus 4.7 is just 4.6 with a stick up its butt. Give me my tokens back!

Reddit user complains Claude Opus 4.7 refuses routine tasks due to safety guardrails, citing legislative letter-writing request.

u/MotoKin10·10 days ago·36 pts / 20 comm

Hugging Face· INFRA

DeepInfra on Hugging Face Inference Providers 🔥

Hugging Face·10 days ago

r/OpenAI· COMMUNITY

Dall E 3 vs Image 2.0

Reddit discussion comparing DALL-E 3 and Image 2.0 capabilities; lacks technical depth or official benchmarks.

u/RealMelonBread·10 days ago·61 pts / 16 comm

r/OpenAI· COMMUNITY

OpenAI Really Wants Codex to Shut Up About Goblins

Reddit discussion about OpenAI Codex's tendency to generate goblin-related code comments; appears to be humor/anecdote rather than substantive technical analysis.

u/wiredmagazine·10 days ago·50 pts / 16 comm

r/OpenAI· COMMUNITY

What is going on with the new pretraining

Reddit discussion about OpenAI pretraining updates; lacks specifics and appears to be unverified community speculation.

u/infohoundloselose·10 days ago·50 pts / 10 comm

The Verge AI· PRESS

Elon Musk appeared more petty than prepared

Today the first witness was sworn in in Musk v. Altman: Elon Musk. I was surprised by how flat he seemed. This is not the first time I've seen Musk in court. During his defamation suit, he turned on the charm and the jury responded by finding him not guilty. Today he looked adrift and unprepared. The only times he showed real animation were when he was bragging about how much he'd done for OpenAI. The direct examination is a way of telling a story through questions; it's important to make the narrative clear. For a suit that accuses Sam Altman of straying from OpenAI's mission, Musk spent a w...

Elizabeth Lopatto·10 days ago

r/singularity· COMMUNITY

An IBM training manual from 1979.

Resurfaced IBM 1979 training manual; historical artifact with minimal bearing on current frontier AI development.

u/GrouchyPerspective83·10 days ago·225 pts / 24 comm

r/ClaudeAI· COMMUNITY

Timestamps Please!

Reddit user requests timestamp feature in Claude UI for task tracking and conversation history clarity.

u/Caprikachu·10 days ago·20 pts / 18 comm

Simon Willison· ANALYST

Quoting OpenAI Codex base_instructions

Simon Willison shares a leaked OpenAI Codex system prompt instruction restricting discussion of certain animals.

Simon Willison·10 days ago

r/ClaudeAI· COMMUNITY

Your Claude Code project dashboard is now on the Mac App Store

Storybloq, a Claude Code-integrated project tracker using `.story/` JSON/markdown format, launches Mac App Store companion app.

u/LastNameOn·10 days ago·66 pts / 13 comm·+ covered by others

r/OpenAI· COMMUNITY

Chatgpt always giving long answers for simple questions.

Reddit user complains ChatGPT produces unnecessarily verbose responses to simple queries.

u/Large_Charge1908·10 days ago·51 pts / 12 comm

r/ClaudeAI· COMMUNITY

I built a Kanban board for Claude Code so I can run agent sessions straight from cards

I've been running 4-5 Claude Code sessions in parallel and kept losing track - which terminal had the auth work, which one was the bug fix, what's actually done. So I added a Kanban board to **Vibeyard** (an open-source IDE I'm building for Claude Code). Each card is a task. Click run → it spins up a Claude session scoped to that task. When Claude finishes, the card moves itself to Done. It turned Claude from "a terminal I talk to" into something closer to a team I'm dispatching work to. GitHub: [https://github.com/elirantutia/vibeya...

u/Fun_Can_6448·10 days ago·20 pts / 7 comm

The Verge AI· PRESS

Elon Musk tells the jury that all he wants to do is save humanity

On the stand, Elon Musk is positioning himself as a savior. In the high-profile trial between him and his fellow OpenAI co-founder, now CEO, Sam Altman, Musk opened by going through his background. He went as far back as being raised in South Africa and arriving in Canada for college with "2,500 in Canadian travelers' checks and a bag of clothes and books," then spent an unusually long time talking about his past, from Zip2 to PayPal to the current, more familiar slate of companies he now runs. Why is Musk giving the jury so much of his origin story? Though he may be, depending on the day, th...

Kevin Nguyen·10 days ago

The Verge AI· PRESS

Taylor Swift is stepping up the legal war on AI copycats

Taylor Swift has been at the center of AI imitation controversies for years, and now, she's become the latest celebrity who's escalating attempts to protect herself from AI copycats. As usual, however, the legal system intersects with technology in complicated ways - and Swift's efforts may be a long shot. In trademark applications filed last week, Swift's team asked for protection for two phrases spoken by the singer: Hey, it's Taylor Swift and Hey, it's Taylor. The trademark applications, filed by TAS Rights Management on behalf of Swift, include audio clips of Swift saying the two phrases ...

Emma Roth·10 days ago

r/ClaudeAI· COMMUNITY

I have built something using claude what I was doing on excel from last 13 years

User built Claude-powered financial modeling tool for startup feasibility analysis, replacing 13 years of Excel work with AI-assisted workflows and VC-focused feedback loops.

u/Available-Manager231·10 days ago·23 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Mistral-Medium 3.5 (128B) spotted ?

Mistral-Medium 3.5 (128B) model reference discovered in vLLM repository commit, suggesting potential unreleased weight release.

u/tkon3·10 days ago·40 pts / 12 comm

r/MachineLearning· COMMUNITY

What is the scientific value of administering the standard Rorschach test to LLMs when the training data is almost certainly contaminated? (R) + [D]

A recent paper published in *JMIR Mental Health* (Csigó & Cserey, 2026) caught my attention. The researchers administered the 10 standard Rorschach inkblot cards to three multimodal LLMs (GPT-4o, Grok 3, Gemini 2.0) and coded their responses using the Exner Comprehensive System. They analyzed the models' "perceptual styles," determinants (like human movement vs. color), and human-related content themes. However, I am seriously struggling to understand the methodological validity of this setup, and I’m curious what the scientific community thinks. My main concerns are: Massive Data Cont...

u/Impossible_Echo4029·10 days ago·30 pts / 9 comm

TechCrunch AI· PRESS

Amazon is already offering new OpenAI products on AWS

A day after OpenAI got Microsoft to agree to end exclusive rights, AWS announced a slate of OpenAI model offerings, including a new agent service.

Julie Bort·10 days ago

r/Anthropic· COMMUNITY

Opus 4.7 is insanely bad

4.6 was amazing, it did the job well even if it needed some back and forth sometimes to clarify things. but it reacted well, even to complex modifications. and what was really amazing was the sort of form that pops up to ask you questions to narrow the scope of the request. 4.7 talks too much, drifts away, burns a tone of token and then asks you questions by talking too much again. questions are not even relevant. the outputs are either simplish either badly complex and non-sense. I think anthropic wanted to give 4.7 more depth or something, maybe it does get more ...

u/absolute_cake·10 days ago·15 pts / 4 comm

r/ClaudeAI· COMMUNITY

Compared 11 popular Claude Code workflow systems in one table — here's the canonical pipeline of each

Mapped the canonical pipeline of 11 popular Claude Code workflow systems side-by-side. Yellow tags = sub-loops (repeat per task / per story / until verified); blue = top-level steps. Pipeline length turns out to be a personality trait — OpenSpec ships in 3 steps, BMAD runs 12. Full table + sources: [https://github.com/shanraisshan/claude-code-best-practice#%EF%B8%8F-development-workflows](https://github.com/shanraisshan/claude-code-best-practice#%EF%B8%8F-development-workflows)

u/shanraisshan·10 days ago·25 pts / 8 comm

Anthropic· FRONTIER

Claude for Creative Work

Anthropic positions Claude for creative writing and design tasks; feature/capability announcement targeting non-technical users.

Anthropic·10 days ago

The Verge AI· PRESS

Elon Musk takes the stand in high-profile trial against OpenAI

Elon Musk officially began his testimony in the trial he has brought against OpenAI CEO Sam Altman and company president Greg Brockman. The three were on the initial founding team of OpenAI, with Musk investing up to $38 million early on before the co-founders' relationship soured over disagreements over company structure and mission, including whether or not OpenAI should be folded into Musk-owned Tesla. Musk walked away and, years later, founded xAI - his own direct competitor to OpenAI, which is now owned by Musk's SpaceX. In recent years, Musk has filed no less than four different lawsuit...

Hayden Field·10 days ago

NVIDIA Dev Blog· INFRA

Scaling Biomolecular Modeling Using Context Parallelism in NVIDIA BioNeMo

For decades, computational biology has operated under a reductionist compromise. To fit complex biological systems into the limited memory of a single GPU,... For decades, computational biology has operated under a reductionist compromise. To fit complex biological systems into the limited memory of a single GPU, researchers have had to deconstruct them into isolated fragments—single proteins or small domains. This created a context gap, where larger proteins or complexes could not be folded zero-shot due to GPU hardware memory constraints. Now… Source

Dejun Lin·10 days ago

TechCrunch AI· PRESS

Amazon launches an AI-powered audio Q&A experience on product pages

Amazon's new "Join the chat" feature lets you ask questions about products and receive AI-powered audio responses.

Lauren Forristal·10 days ago

← Front Page30 stories

← Newer Older →