The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

r/singularity· COMMUNITY

Tesla has officially confirmed that this will be the new Optimus factory at Giga Texas. Long term, this new factory will have an annual production capacity of 10 million robots.

Tesla confirms Optimus factory at Giga Texas with 10M annual production capacity.

u/Worldly_Evidence9113·21 days ago·191 pts / 177 comm

r/LocalLLaMA· COMMUNITY

Qwen having its Jack Torrance moment

Reddit post title reference with no substantive content provided.

u/anguillias·21 days ago·95 pts / 17 comm

r/OpenAI· COMMUNITY

Asked GPT Image 2 for a New Yorker cartoon, and pretty much got one.

User demonstrates GPT Image 2 generating New Yorker-style cartoons with high stylistic fidelity.

u/jbum·21 days ago·213 pts / 197 comm

r/LocalLLaMA· COMMUNITY

When are we getting consumer inference chips?

User questions absence of consumer inference chips ($200 devices running Llama 3 locally) despite industry investment.

u/SnooStories2864·21 days ago·70 pts / 145 comm

r/MachineLearning· COMMUNITY

We benchmarked 18 LLMs on OCR (7k+ calls) — cheaper/old models oftentimes win. Full dataset + framework open-sourced. [R]

Benchmark of 18 LLMs on OCR tasks (7.5k calls) shows smaller/older models often outperform expensive flagships; open-source dataset and evaluation framework released.

u/TimoKerre·21 days ago·42 pts / 19 comm

TechCrunch AI· PRESS

India’s app market is booming — but global platforms are capturing most of the gains

Non-gaming apps, led by streaming and AI, are driving growth, even as India's spending per user lags global peers.

Jagmeet Singh·21 days ago

r/ClaudeAI· COMMUNITY

TIL Claude Web has Recipe feature

Claude Web adds recipe feature with unit conversion, serving adjustment, and cooking timer functionality.

u/SousouNoThorfinn·21 days ago·25 pts / 10 comm

r/OpenAI· COMMUNITY

GPT-Image-2 vs Nano Banana 2, nb2 tried its best...

User compares GPT-Image-2 vs Nano Banana 2 image generation realism on portrait prompt.

u/Fresh-Resolution182·21 days ago·804 pts / 195 comm

r/ClaudeAI· COMMUNITY

Everyone complaining about Opus 4.7, but its been working just fine for me

User reports Opus 4.7 working as expected with faster convergence despite longer execution time.

u/croovies·21 days ago·125 pts / 130 comm

r/LocalLLaMA· COMMUNITY

I have never seen a agent willing to work so much like Qwen 3.6 27B

User reports Qwen3.6-27B exhibits strong agentic behavior and willingness to iterate on coding tasks in local deployment.

u/cviperr33·21 days ago·43 pts / 13 comm

r/LocalLLaMA· COMMUNITY

Nvidia RTX 3090 vs Intel Arc Pro B70 llama.cpp Benchmarks

llama.cpp Vulkan and SYCL benchmarks comparing Nvidia RTX 3090 vs Intel Arc Pro B70 on prompt processing and token generation.

u/tovidagaming·21 days ago·60 pts / 39 comm

Latent Space· ANALYST

[AINews] Tasteful Tokenmaxxing

Commentary on tokenization strategies as a recurring theme in AI industry discourse, without specific technical claims or announcements.

Latent Space·21 days ago

r/OpenAI· COMMUNITY

I made a Manga for my daughters while experimenting with Images2, wow am I impressed.

User creates manga narrative using Images 2 model, starting from kokeshi doll reference to generate character artwork.

u/S-Plantagenet·21 days ago·204 pts / 38 comm

r/OpenAI· COMMUNITY

GPT-2 cooked this “photo of a screen” prompt - MacBook + Photo Booth + late-night vibes

User shares detailed prompt engineering technique for GPT-2 image model to generate photorealistic MacBook screenshots with physical imperfections.

u/DataGirlTraining·21 days ago·426 pts / 50 comm

r/LocalLLaMA· COMMUNITY

Note the new recommended sampling parameters for Qwen3.6 27B

Qwen3.6-27B official sampling parameters documented: temperature 1.0 for thinking tasks, 0.6 for coding, 0.7 for instruct mode.

u/Thrumpwart·21 days ago·167 pts / 34 comm

r/singularity· COMMUNITY

SONY AI | Project Ace, for the first time AI/robotics is competitive against pro table tennis players.

Sony AI Project Ace achieves competitive-level table tennis performance with AI/robotics system, published in Nature.

u/GraceToSentience·21 days ago·315 pts / 35 comm

r/LocalLLaMA· COMMUNITY

Qwen 3.6 is actually useful for vibe-coding, and way cheaper than Claude

User demonstrates Qwen3.6-27B running locally via llama-server with 200k context on dual RTX 3090, achieving coding performance cheaper than Claude.

u/sdfgeoff·21 days ago·333 pts / 97 comm

r/singularity· COMMUNITY

OpenAI preparing for a big launch

Rumor of forthcoming OpenAI product launch without concrete details or timeline.

u/Bizzyguy·21 days ago·897 pts / 251 comm

r/LocalLLaMA· COMMUNITY

I tested Qwen3.6-27B, Qwen3.6-35B-A3B, Qwen3.5-27B and Gemma 4 on the same real architecture-writing task on an RTX 5090

Comparative evaluation of Qwen3.6-27B, Qwen3.6-35B, Qwen3.5-27B, and Gemma4 on structured document task using RTX 5090.

u/Gazorpazorp1·21 days ago·93 pts / 54 comm

Hugging Face· INFRA

How to Use Transformers.js in a Chrome Extension

Hugging Face·21 days ago

xAI· FRONTIER

Grok Voice Think Fast 1.0

xAI releases Grok Voice Think Fast 1.0, a voice agent API for real-time conversational AI applications.

xAI·21 days ago

TechCrunch AI· PRESS

Tesla just increased its capex to $25B. Here’s where the money is going.

Tesla's planned capex for 2026 is three times higher than what the company has historically spent. Its CFO said, as a result, Tesla will have a negative free cash flow the rest of the year.

Kirsten Korosec·21 days ago

NVIDIA Dev Blog· INFRA

Simplify Sparse Deep Learning with Universal Sparse Tensor in nvmath-python

In a previous post, we introduced the Universal Sparse Tensor (UST), enabling developers to decouple a tensor’s sparsity from its memory layout for greater... In a previous post, we introduced the Universal Sparse Tensor (UST), enabling developers to decouple a tensor’s sparsity from its memory layout for greater flexibility and performance. We’re excited to announce the integration of the UST into nvmath-python v0.9.0 to accelerate sparse scientific and deep learning applications. This post provides a walkthrough of key UST features… Source

Aart J.C. Bik·21 days ago

r/ClaudeAI· COMMUNITY

Maybe Anthopic can use Claude Design to fix this horribly confusing double burger menu in the Windows Desktop app?

User requests UI improvement to Windows Desktop app burger menu.

u/Prince-of-Privacy·21 days ago·199 pts / 18 comm

r/ClaudeAI· COMMUNITY

based on a true story. im the developer

Low-context meme post from subreddit.

u/Heavy_Plan7527·21 days ago·2341 pts / 85 comm

r/LocalLLaMA· COMMUNITY

Forgive my ignorance but how is a 27B model better than 397B?

Is Qwen just incredibly good at doing dense and not so good at doing MoE? I get that dense is generally better than MoE but 27B being better than 397B just doesn’t sit right with me. What are those additional experts even doing then?

u/No_Conversation9561·21 days ago·1013 pts / 267 comm

TechCrunch AI· PRESS

Google updates Workspace to make AI your new office intern

Google has introduced a host of new automated functions into Workspace, all of which are driven by Workspace Intelligence, its new AI system.

Lucas Ropek·21 days ago

r/Anthropic· COMMUNITY

Internal Mozilla bug report states only 3 of 271 bugs were found by Mythos . Contradicting public reporting .

Link: mozilla.org

u/hasanahmad·21 days ago·20 pts / 7 comm·+ covered by others

TechCrunch AI· PRESS

Hands on with X’s new AI-powered custom feeds

X's AI-powered custom timelines are replacing Communities, with Grok-curated feeds...and new ad slots.

Sarah Perez·21 days ago

r/LocalLLaMA· COMMUNITY

Given how good Qwen become, is it time to grab a 128gb m5 max?

Discussion on whether to upgrade Mac hardware for running Qwen models locally as performance gap to Claude Opus narrows.

u/Rabus·21 days ago·96 pts / 131 comm

← Front Page30 stories

← Newer Older →