The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

r/OpenAI· COMMUNITY

Why has ChatGPT been so contrarian as of late?

User complaints about ChatGPT adopting overly contrarian behavior and excessive caveating in responses.

u/stanbuckley·2 months ago·83 pts / 67 comm

r/LocalLLaMA· COMMUNITY

PrismML — Introducing Ternary Bonsai: Top Intelligence at 1.58 Bits

PrismML releases Ternary Bonsai, a 1.58-bit quantized model achieving high compression with minimal quality loss.

u/cafedude·2 months ago·121 pts / 29 comm

r/ClaudeAI· COMMUNITY

Amazon to invest up to $25 billion in Anthropic as part of $100 billion cloud deal

Amazon to invest up to $25 billion in Anthropic as part of broader $100 billion cloud infrastructure deal.

u/couldliveinhope·2 months ago·1086 pts / 61 comm

r/LocalLLaMA· COMMUNITY

Kimi K2.6 is a legit Opus 4.7 replacement

Kimi K2.6 benchmarked as viable Opus 4.7 alternative, handling 85% of tasks with vision and tool-use at lower cost.

u/bigboyparpa·2 months ago·980 pts / 305 comm

r/Anthropic· COMMUNITY

don't worry yall everything under control, sonnet 4.7 on the way and it'll fix the opus mistakes...

User speculates that Claude Sonnet 4.7 will fix Opus model errors.

u/Aggravating_Bad4639·2 months ago·209 pts / 13 comm

Hugging Face· INFRA

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

Hugging Face·2 months ago

Latent Space· ANALYST

[AINews] Moonshot Kimi K2.6: the world's leading Open Model refreshes to catch up to Opus 4.6 (ahead of DeepSeek v4?)

Moonshot releases Kimi K2.6, an open-weight model claiming performance parity with Claude Opus 4.6.

Latent.Space·2 months ago

Hugging Face· INFRA

AI and the Future of Cybersecurity: Why Openness Matters

Hugging Face·2 months ago

Cohere· FRONTIER

Why MoE models get more from speculative decoding

Cohere explains MoE models' efficiency gains with speculative decoding via expert routing correlation and bandwidth optimization.

Cohere·2 months ago

OpenAI· FRONTIER

Scaling Codex to enterprises worldwide

OpenAI launches Codex Transformation Partners with Accenture, PwC, Infosys to accelerate enterprise code generation deployment.

OpenAI·2 months ago

r/singularity· COMMUNITY

GPT-Image-2 is rolling out

OpenAI rolling out GPT-Image-2 image generation model.

u/piggledy·2 months ago·120 pts / 25 comm

TechCrunch AI· PRESS

Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return

Amazon has made another circular AI deal: It's investing another $5 billion in Anthropic. Anthropic has agreed to spend $100 billion on AWS in return.

Julie Bort·2 months ago

NVIDIA Dev Blog· INFRA

Maximizing Memory Efficiency to Run Bigger Models on NVIDIA Jetson

The boom in open source generative AI models is pushing beyond data centers into machines operating in the physical world. Developers are eager to deploy these... The boom in open source generative AI models is pushing beyond data centers into machines operating in the physical world. Developers are eager to deploy these models at the edge, enabling physical AI agents and autonomous robots to automate heavy-duty tasks. A key challenge is efficiently running multi-billion-parameter models on edge devices with limited memory. With ongoing constraints on… Source

Anshuman Bhat·2 months ago

NVIDIA Dev Blog· INFRA

Run High-Throughput Reinforcement Learning Training with End-to-End FP8 Precision

As LLMs transition from simple text generation to complex reasoning, reinforcement learning (RL) plays a central role. Algorithms like Group Relative Policy... As LLMs transition from simple text generation to complex reasoning, reinforcement learning (RL) plays a central role. Algorithms like Group Relative Policy Optimization (GRPO) power this transition, enabling reasoning-grade models to continuously improve through iterative feedback. Unlike standard supervised fine-tuning, RL training loops are bifurcated into two distinct, high-intensity phases: a… Source

Guyue Huang·2 months ago

r/ClaudeAI· COMMUNITY

No update needed

Empty post with no content.

u/UnC0mfortablyNum·2 months ago·480 pts / 20 comm

TechCrunch AI· PRESS

Google rolls out Gemini in Chrome in seven new countries

Google is rolling out Gemini in Chrome in Australia, Indonesia, Japan, the Philippines, Singapore, South Korea, and Vietnam.

Ivan Mehta·2 months ago

r/ClaudeAI· COMMUNITY

The Opus 4.6 vs 4.7 Controversy in one image

Image post comparing user sentiment divergence on Claude Opus 4.6 vs 4.7; no text summary provided.

u/AvroLancaster·2 months ago·371 pts / 64 comm

r/Anthropic· COMMUNITY

Mythos must have said something to them lol

Speculation about internal changes at Anthropic without evidence.

u/Informal-Fig-7116·2 months ago·551 pts / 62 comm

r/LocalLLaMA· COMMUNITY

Gemma-4-E2B's safety filters make it unusable for emergencies

Google Gemma-4-E2B's safety filters render model unusable for emergency preparedness; blocks medical, water purification, maintenance info.

u/Unfounded_898·2 months ago·415 pts / 278 comm

r/singularity· COMMUNITY

Palantir's summary of CEO Alexander Karp's manifesto is generating buzz. Read the 22 bullet points.

Palantir CEO Alexander Karp publishes manifesto; Business Insider summarizes 22 key points.

u/SnoozeDoggyDog·2 months ago·438 pts / 233 comm

r/singularity· COMMUNITY

Anthropic expands Amazon partnership with 5GW compute, $100B commitment, big bet on Trainium chips

Anthropic and Amazon expand compute partnership to 5GW with $100B commitment, prioritizing AWS Trainium chips.

u/Outside-Iron-8242·2 months ago·256 pts / 20 comm

r/LocalLLaMA· COMMUNITY

ubergarm/Kimi-K2.6-GGUF Q4_X now available

Community release of Kimi K2.6 GGUF Q4_X quantization with 584GB+ VRAM requirement and imatrix optimization planned.

u/VoidAlchemy·2 months ago·119 pts / 50 comm

The Verge AI· PRESS

Silicon Valley has forgotten what normal people want

The long-term risks of the All-In Podcast, illustrated. | Image: Cath Virginia / The Verge, Turbosquid, Getty Images One of the most mortifying things about knowing a lot of techies is listening to them tell me excitedly about some very important discovery that they believe they have made. Recently, I ran into an acquaintance of mine, who began talking my ear off about an amazing discovery he'd made with LLMs. Knowledge, it turns out, is structured into language! You could put one word into ChatGPT and it might understand what you wanted, or make up a word and see if it understood what you me...

Elizabeth Lopatto·2 months ago

r/ClaudeAI· COMMUNITY

Guys, I think I solved the car wash question with Opus 4.7!

User reports using Claude Opus 4.7 with multi-agent approach for car-wash problem; no technical details provided.

u/TotalGod·2 months ago·329 pts / 32 comm

r/singularity· COMMUNITY

Google ramps up agentic AI efforts amid pressure from Anthropic

Google assembles strike team to improve coding models amid competition from Anthropic.

u/Outside-Iron-8242·2 months ago·130 pts / 24 comm

r/LocalLLaMA· COMMUNITY

Why doesn't any OSS tool treat llama.cpp as a first class citizen?

Community discussion on why OSS AI tools prioritize Ollama over llama.cpp despite engineering parity.

u/rm-rf-rm·2 months ago·293 pts / 104 comm

r/Anthropic· COMMUNITY

Claude code failed after consuming 90% session token, codex fixed in 15 minutes with 3%

Claude Code consumed 90% of session tokens on failed debugging task vs. Codex fixed in 15 minutes with 3%.

u/sreekanth850·2 months ago·39 pts / 57 comm

TechCrunch AI· PRESS

It’s not just one thing — it’s another thing

This sentence construction ("It's not just this — it's that") has become so common in AI-generated writing that it's no longer just a clue that a piece of writing may be synthetic — it's almost a guarantee.

Amanda Silberling·2 months ago

r/MachineLearning· COMMUNITY

How exactly one goes about networking in conferences? [D]

PhD student asks for advice on networking at ICLR conference for internship opportunities.

u/howtorewriteaname·2 months ago·76 pts / 29 comm

r/ClaudeAI· COMMUNITY

Claude Electrician 4.7 Part 2

Satirical post mocking Claude Opus 4.7's reasoning loops on basic electrical troubleshooting task.

u/Clean-Data-259·2 months ago·203 pts / 30 comm

← Front Page30 stories

← Newer Older →