The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

[New Optimizer] 🌹 Rose: low VRAM, easy to use, great results, Apache 2.0 [P]

Rose: stateless PyTorch optimizer with low VRAM footprint and fast convergence, released under Apache 2.0.

u/ECF630·20 days ago·31 pts / 16 comm

DeepSeek V4 Pro underwhelms on Arena (crowdsourced user preference benchmark, not a capability benchmark)

DeepSeek V4 Pro shows weaker-than-expected performance on LMSYS Arena user preference voting, a crowdsourced benchmark distinct from capability measurement.

u/Hemingbird·20 days ago·100 pts / 84 comm

r/LocalLLaMA· COMMUNITY

Takeaways & discussion about the DeepSeek V4 architecture

Technical deep-dive on DeepSeek V4 architecture: hybrid sparse attention, manifold-constrained connections, and FP4 quantization innovations vs. V3.

u/benja0x40·20 days ago·45 pts / 26 comm

r/LocalLLaMA· COMMUNITY

My New AI build - please be kind!

User shares local hardware build specs for AI workloads including CPU, GPU setup, and thermal management configuration.

u/Ell2509·20 days ago·41 pts / 40 comm

r/LocalLLaMA· COMMUNITY

DS4-Flash vs Qwen3.6

Reddit comparison thread between DS4-Flash and Qwen3.6 models lacking substantive analysis or benchmark data.

u/flavio_geo·20 days ago·101 pts / 36 comm

Anthropic· FRONTIER

An update on our election safeguards

Anthropic outlines safeguards for Claude during US midterms and global elections to mitigate disinformation and manipulation risks.

Anthropic·20 days ago

The Verge AI· PRESS

China’s DeepSeek previews new AI model a year after jolting US rivals

Chinese AI company DeepSeek released a preview of its hotly anticipated next-generation AI model V4 on Friday, saying that the open-source model can compete with leading closed-source systems from US rivals including Anthropic, Google, and OpenAI. DeepSeek says V4 marks a major improvement over prior models, especially in coding, a capability that has become central to AI agents and helped drive the success of tools like ChatGPT Codex and Claude Code. The release is also a milestone for China's chip industry, with DeepSeek explicitly highlighting compatibility with domestic Huawei technology....

Robert Hart·20 days ago

The Verge AI· PRESS

Prestigious photo contest answers ‘what is a photo?’

The three finalists for the World Press Photo of the year. | Image: World Press Photo We love to muse over how "real" photography is defined here at The Verge now that generative AI is so prolific, and the World Press Photo competition might have the answer. The prestigious award celebrates the best of photojournalism, where capturing reality is paramount. The winning entry for 2026 - "Separated by ICE," captured by photojournalist Carol Guzy - was announced yesterday. The harrowing photograph shows children clinging to their father after an immigration hearing. The photo had to abide by spec...

Jess Weatherbed·20 days ago

r/LocalLLaMA· COMMUNITY

OpenCode or ClaudeCode for Qwen3.5 27B

Reddit discussion comparing OpenCode vs ClaudeCode inference tools for Qwen 3.5 27B on Linux.

u/Ok-Scarcity-7875·20 days ago·41 pts / 77 comm

r/ClaudeAI· COMMUNITY

How nosy 🧐

Reddit post title with no content; insufficient information to assess.

u/binklfoot·20 days ago·84 pts / 41 comm

r/OpenAI· COMMUNITY

How can GPT 5.5 Pro be lower than GPT 5.4 Pro on the benchmark of HLE (w/ tools)?

Reddit discussion questioning GPT 5.5 Pro underperforming GPT 5.4 Pro on HLE benchmark with tools.

u/Lucky_Creme_5208·20 days ago·52 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6 35B-A3B is quite useful on 780m iGPU (llama.cpp,vulkan)

Qwen 3.6 35B-A3B MoE model achieves 250+ tok/s on AMD Radeon 780M iGPU via llama.cpp Vulkan.

u/itroot·20 days ago·44 pts / 26 comm

r/ClaudeAI· COMMUNITY

Ok dude

You didn't have to bring my mother into this.

u/PinDropNonsense·20 days ago·30 pts / 5 comm

r/LocalLLaMA· COMMUNITY

DeepSeek-v4 has a comical 384K max output capability

DeepSeek-v4 demonstrates 384K token context window by generating 100KB single-file HTML application on user request.

u/zsydeepsky·20 days ago·65 pts / 13 comm

r/ClaudeAI· COMMUNITY

From the client*

u/FaithlessnessKey1230·20 days ago·43 pts / 5 comm

r/MachineLearning· COMMUNITY

ICML 2026 - Final Predictions on Average Score Needed Before Scores Come Out in 1 week? [D]

Reddit discussion speculating on ICML 2026 acceptance score thresholds before notification on April 30.

u/Fit_Scale_1464·20 days ago·30 pts / 47 comm

r/singularity· COMMUNITY

Big model feel with GPT 5.5

Reddit user argues GPT 5.5 feels more intuitive despite lower-than-expected benchmark gains, citing improved argument coverage.

u/MohMayaTyagi·20 days ago·110 pts / 26 comm

r/ClaudeAI· COMMUNITY

Claude + Codex = Excellence

I have a 20x Claude account and have been using Opus 4.7 exclusively for all code. I noticed even after asking multiple times to do code review, Opus would still not get there 100%. Here is what I did: 1. Installed Codex cli and ran it in a Tmux session 2. Claude created PR for Codex to review 3. Claude pinged Codex via shell so I can see the Codex thinking and approve any file permission. Claude set a wake up window. 4. Codex reviewed and updated comments in PR. 5. Claude woke up and validated the comments before editing code. Surprisingly Claude missed a lot of things...

u/99xAgency·20 days ago·20 pts / 5 comm

r/ClaudeAI· COMMUNITY

That's me and claud 🤣

u/arsaldotchd·20 days ago·40 pts / 6 comm

r/LocalLLaMA· COMMUNITY

Deepseek v4 people

Reddit discussion thread about Deepseek v4; lacks substantive detail or official announcement.

u/markeus101·20 days ago·76 pts / 16 comm

Simon Willison· ANALYST

DeepSeek V4 - almost on the frontier, a fraction of the price

DeepSeek releases V4-Pro (1.6T params, 49B active) and V4-Flash (284B/13B) with 1M context, largest open-weights models, MIT licensed.

Simon Willison·20 days ago

r/ClaudeAI· COMMUNITY

Opus 4.7 doesn't want to make the change?

Reddit user reports Claude refusing game dev prompts, suspects safety filter over-blocking benign 'self-destruct' game mechanic naming.

u/KiriHair·20 days ago·20 pts / 19 comm

Latent Space· ANALYST

[AINews] GPT 5.5 and OpenAI Codex Superapp

Latent Space newsletter item referencing GPT 5.5 and OpenAI Codex Superapp with minimal detail; unclear if announcement or speculation.

Latent Space·20 days ago

r/LocalLLaMA· COMMUNITY

No Multimodality yet in DeepSeek-V4. But I'll wait.

DeepSeek-V4 does not include multimodal capabilities; user speculates on future roadmap.

u/Right-Law1817·20 days ago·57 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Buried lede: Deepseek v4 Flash is incredibly inexpensive from the official API for its weight category

DeepSeek v4 Flash offers competitive pricing for its model size on the official API.

u/jwpbe·20 days ago·66 pts / 13 comm

r/singularity· COMMUNITY

DeepSeek confirms Huawei-based V4 inference: "After the 950 supernodes are launched at scale in the second half of this year, the price of Pro is expected to be reduced significantly."

DeepSeek plans Huawei V4 inference scale-up with 950 supernodes H2 2024, targeting Pro pricing reduction.

u/Recoil42·20 days ago·238 pts / 20 comm

Simon Willison· ANALYST

Millisecond Converter

Simon Willison releases a utility tool to convert millisecond durations to human-readable time formats.

Simon Willison·20 days ago

r/singularity· COMMUNITY

DeepSeek V4 Benchmarks!

DeepSeek V4 benchmark results released; comparative performance data on frontier model capability.

u/BreadfruitChoice3071·20 days ago·155 pts / 24 comm

r/ClaudeAI· COMMUNITY

Opus 4.7 is weird

Reddit user reports subjective quality regression in Claude Opus 4.7 compared to 4.5, citing reduced intuition and increased need for explicit guidance.

u/Formal-Complex-2812·20 days ago·24 pts / 27 comm

Simon Willison· ANALYST

It's a big one

Simon Willison's newsletter includes a new chapter on Agentic Engineering Patterns plus curated links and blog posts.

Simon Willison·20 days ago

← Front Page30 stories

← Newer Older →

The Archive

[New Optimizer] 🌹 Rose: low VRAM, easy to use, great results, Apache 2.0 [P]

DeepSeek V4 Pro underwhelms on Arena (crowdsourced user preference benchmark, not a capability benchmark)

Takeaways &amp; discussion about the DeepSeek V4 architecture

My New AI build - please be kind!

DS4-Flash vs Qwen3.6

An update on our election safeguards

China’s DeepSeek previews new AI model a year after jolting US rivals

Prestigious photo contest answers ‘what is a photo?’

OpenCode or ClaudeCode for Qwen3.5 27B

How nosy 🧐

How can GPT 5.5 Pro be lower than GPT 5.4 Pro on the benchmark of HLE (w/ tools)?

Qwen3.6 35B-A3B is quite useful on 780m iGPU (llama.cpp,vulkan)

Ok dude

DeepSeek-v4 has a comical 384K max output capability

From the client*

ICML 2026 - Final Predictions on Average Score Needed Before Scores Come Out in 1 week? [D]

Big model feel with GPT 5.5

Claude + Codex = Excellence

That's me and claud 🤣

Deepseek v4 people

DeepSeek V4 - almost on the frontier, a fraction of the price

Opus 4.7 doesn't want to make the change?

[AINews] GPT 5.5 and OpenAI Codex Superapp

No Multimodality yet in DeepSeek-V4. But I'll wait.

Buried lede: Deepseek v4 Flash is incredibly inexpensive from the official API for its weight category

DeepSeek confirms Huawei-based V4 inference: "After the 950 supernodes are launched at scale in the second half of this year, the price of Pro is expected to be reduced significantly."

Millisecond Converter

DeepSeek V4 Benchmarks!

Opus 4.7 is weird

It's a big one

Takeaways & discussion about the DeepSeek V4 architecture