The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Stop letting LLMs edit your .bib [D]

Research community reports frequent LLM hallucinations in bibliography generation, with incorrect author attributions despite correct titles, raising integrity concerns.

u/Pure-Ad9079·1 day ago·40 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR

Qwen3.6-27B with Multi-Token Prediction achieves 2.5x throughput via Unsloth quantization and llama.cpp integration.

u/havenoammo·1 day ago·48 pts / 28 comm

The Verge AI· PRESS

Microsoft’s Office and LinkedIn chief now runs Teams in latest reshuffle

Microsoft's LinkedIn CEO, Ryan Roslansky, took on an expanded role at the company as head of Office last year, and he's now getting more responsibilities as part of the latest leadership reshuffle inside Microsoft. Sources tell me that the Microsoft Teams organization is moving to report to Roslansky, who will now lead a new Work Experiences Group at Microsoft. The changes are part of a broader reshuffle triggered by Rajesh Jha, executive vice president of Microsoft's experiences and devices group, retiring from Microsoft after more than 35 years. Jha was responsible for the teams behind Wind...

Tom Warren·1 day ago

r/LocalLLaMA· COMMUNITY

Bad news: Apple drops high-memory Mac Studio configs

Apple discontinues high-memory Mac Studio configurations (256GB, 512GB), limiting local LLM inference options to 96GB max.

u/jzn21·1 day ago·47 pts / 20 comm

Google DeepMind· FRONTIER

AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields

Google DeepMind releases AlphaEvolve, a Gemini-powered coding agent demonstrating applications across business, infrastructure, and scientific domains.

Google DeepMind·1 day ago

r/OpenAI· COMMUNITY

"Water wars."

Reddit discussion about water consumption and waste impacts of AI model training, lacking specifics or novel data.

u/Total-Squirrel4634·1 day ago·58 pts / 42 comm

The Verge AI· PRESS

Chrome’s AI features may be hogging 4GB of your computer storage

Google Chrome may be taking up more of your storage than expected thanks to a large on-device AI model file that, in some cases, is being automatically downloaded to the browser's system folders. Users who have noticed unexplained drops in their available desktop device storage are now discovering that Chrome is installing a 4GB weights.bin file inside their browser directory when certain AI features are enabled. The weights.bin file in question is connected to Google's Gemini Nano AI model, which powers Chrome AI tools like scam detection, writing assistance, autofill, and suggestion feature...

Jess Weatherbed·1 day ago

r/Anthropic· COMMUNITY

Let's talk about ban policy

Should users be banned? If Anthropic wants to be the next Google, meaning revolutionize the internet and the way computers are used. Should users be banned? I've been reading a lot of horror stories lately about people getting banned for stupid things like "research work," standard usage, or simply security research. Who decides? Exactly, the model. Then you get banned without the possibility of appeal because same model read appeals. Sure, people create new accounts, but it's only a matter of time before Claude Code collects device fingerprints. Perhaps it's already doing so. Should C...

u/Beginning_Ad2239·1 day ago·10 pts / 22 comm

Stratechery· ANALYST

Microsoft Earnings, Apple Earnings

Microsoft announces agentic business model shift; Apple faces chip/memory constraints despite Mac AI gains.

Ben Thompson·1 day ago

r/ClaudeAI· COMMUNITY

Seems Claude is now aware of its own memory? Tested via number guessing game

A month ago, there was a post that shows that Claude couldn't access its own memory: [https://www.reddit.com/r/ClaudeAI/comments/1seune4/claude\_cheated\_at\_a\_number\_guessing\_game\_got/](https://www.reddit.com/r/ClaudeAI/comments/1seune4/claude_cheated_at_a_number_guessing_game_got/) The community was summarised as saying this in their posts: >The community points out that Claude can't see its own <thinking> blocks from previous turns. However, now it seems that Claude can access its memory reliably, though: * It often seems to pick 7 or 42 for me * In my second screenshot wi...

u/Harvzor·1 day ago·21 pts / 9 comm

r/LocalLLaMA· COMMUNITY

2.5x faster inference with Qwen 3.6 27B using MTP - Finally a viable option for local agentic coding - 262k context on 48GB - Fixed chat template - Drop-in OpenAI and Anthropic API endpoints

Qwen 3.6 27B achieves 2.5x inference speedup via MTP speculative decoding in llama.cpp; 262k context on 48GB with fixed chat templates.

u/ex-arman68·1 day ago·85 pts / 23 comm

r/Anthropic· COMMUNITY

Let's talk about Opus 4.7

My experience is withOpus 4.7 is it's not worth it for most use cases It thinks forever, hallucinates a lot, and costs a ton of money. Not saying it's bad but Sonnet 4.6 is enough for everything I'm doing. I haven't found a single task where Opus 4.7 actually excels without bloating the response. Anyone else feeling the same? What are you using Opus for that actually justifies it?

u/Nash0o7·1 day ago·10 pts / 3 comm

r/singularity· COMMUNITY

Dario Amodei spent last year warning of an AI white-collar bloodbath. Now he's changing the narrative

Dario Amodei shifts from AI job displacement warnings to Jevon's paradox framing; speculation on whether view change reflects genuine belief update or political/regulatory calculus.

u/socoolandawesome·1 day ago·100 pts / 58 comm

r/Anthropic· COMMUNITY

How many of you stopped using Claude because of the very low limits set on paid plans?

Link:

u/pm-me-your-pm-now·2 days ago·13 pts / 5 comm

r/ClaudeAI· COMMUNITY

Prompt Injection experience - my first time ever

User documents prompt injection attack against Claude via GetAIPerks website, detailing fake system prompt injection technique and model behavior.

u/netmilk·2 days ago·64 pts / 7 comm

r/ClaudeAI· COMMUNITY

I can't believe this

Just researched some historic facts concerning russian propaganda. Then I discovered this source in Claudes answer. Am I paying for Claude to be provided with grokipedia "facts"? Please, Dario, Anthropic board, Anthropic team. Fix that.

u/CommitteeOk5696·2 days ago·24 pts / 5 comm

r/LocalLLaMA· COMMUNITY

Google is making local AI available to mainstream users ;)

Reddit speculation about Google's local AI availability; lacks specifics, credibility unclear.

u/jacek2023·2 days ago·67 pts / 65 comm

r/OpenAI· COMMUNITY

Me and ChatGPT everyday😅

Personal anecdote about ChatGPT usage; not technical or industry-relevant content.

u/imfrom_mars_·2 days ago·89 pts / 10 comm

TechCrunch AI· PRESS

Peter Sarlin’s QuTwo reaches $380M valuation in angel round

QyTw0, the Finnish AI lab founded by former AMD Silo AI CEO Peter Sarlin, is now valued at €325 million (approximately $380 million) after raising a €25 million angel round ($29 million). It's a sign of enduring tailwinds for AI, quantum computing, and sovereign tech, especially for Europe-made companies.

Anna Heim·2 days ago

TechCrunch AI· PRESS

Marc Lore says that AI will soon enable anyone open a restaurant

Wonder wants to turn its robotic kitchens into AI-powered “restaurant factories,” letting anyone spin up a virtual food brand with a prompt.

Sarah Perez·2 days ago

r/ClaudeAI· COMMUNITY

Are Anthropic folks actually seeing Reddit feedback on Opus 4.7?

Reddit discussion questioning whether Anthropic monitors community feedback on Claude Opus 4.7 regarding cost, consistency, and control for future model iterations.

u/ki-pam·2 days ago·21 pts / 32 comm

r/OpenAI· COMMUNITY

I asked ChatGPT for a 'Perfectly Normal Family Picnic', but told it to hide a few subtle details that get more terrifying the longer you look.

Reddit user shares creative prompt experiment with ChatGPT generating hidden horror narrative within benign family scene.

u/Worldly_Manner_5273·2 days ago·60 pts / 20 comm

Latent Space· ANALYST

[AINews] Silicon Valley gets Serious about Services

Silicon Valley pivots toward AI services as business model, signaling shift from model-centric to application-layer opportunity.

Latent Space·2 days ago

r/Anthropic· COMMUNITY

1 msg 70% usage on PRO with Sonnet

User reports Claude Code consuming 70% of 5-hour PRO token limit on single Sonnet 4.6 interaction with poor output quality.

u/Dredyltd·2 days ago·11 pts / 15 comm

r/ClaudeAI· COMMUNITY

Claude Code hooks are the feature most people skip. Spoiler: they're really useful

Claude Code hooks enable automated test/format workflows by running shell commands at workflow checkpoints, improving iteration cycles.

u/EastMove5163·2 days ago·23 pts / 24 comm

r/LocalLLaMA· COMMUNITY

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Empirical quantization degradation analysis for Qwen 3.6 27B across 8 compression levels via chess state-tracking task.

u/bobaburger·2 days ago·62 pts / 36 comm

r/ClaudeAI· COMMUNITY

Claude runs a single echo command with string literal "just for a thinking break"

Has anyone else seen it do this? Is it purposfully doing this to waste tokens, or is there an actual reason?

u/meyriley04·2 days ago·24 pts / 5 comm

r/OpenAI· COMMUNITY

State of the art LLMs

Reddit discussion on current SOTA LLMs; lacks specificity, substance, or novel claims.

u/Frosty-Day-7515·2 days ago·137 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Qwen 3.6 27B MTP on v100 32GB: 54 t/s

Qwen 27B achieves 54 t/s on V100 GPU with MTP optimization in llama.cpp, nearly 2x baseline speed for code review and tool use tasks.

u/m94301·2 days ago·41 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Bleeding Llama: Critical Unauthenticated Memory Leak in Ollama

Cyera reports critical unauthenticated memory leak vulnerability in Ollama enabling unauthorized data access.

u/exintrovert420·2 days ago·41 pts / 10 comm

← Front Page30 stories

← Newer Older →