The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

An explainable hypothesis-driven approach to Drug-Induced Liver Injury with HADES

DILER Benchmark: drug-induced liver injury dataset with mechanistic hypotheses; reframes DILI prediction as explainable hypothesis generation.

Maciej Wisniewski·4 days ago

r/LocalLLaMA· COMMUNITY

Roundtable chat with Talkie-1930 and Gemma 4 31B

Community demo comparing Talkie-1930 (13B retro LM) and Gemma 4 31B in side-by-side chat on Opper.ai platform.

u/facethef·4 days ago·40 pts / 14 comm

TechCrunch AI· PRESS

5 days only: Bring a partner or colleague and get 50% off a second TechCrunch Disrupt 2026 pass

The BOGO offer is live. For a limited time, buy one pass to TechCrunch Disrupt 2026 and get 50% off a second of the same ticket type. Offer ends this Friday, May 8. Save here.

TechCrunch Events·4 days ago

r/LocalLLaMA· COMMUNITY

The more I use it, the more I'm impressed

Reddit user reports Qwen 3.6 27B found a bug that GPT 5.5 and Claude Opus 4.7 missed, attributing success to extended reasoning.

u/ComfyUser48·4 days ago·41 pts / 40 comm

r/MachineLearning· COMMUNITY

Why SSMs struggle in parameter-constrained training: empirical findings at 25M parameters [R]

After \~3 weeks of experimentation in OpenAI's Parameter Golf competition, I wrote up why SSMs are structurally disadvantaged relative to transformers in a time- and size-constrained regime (10 min training, 16MB artifact, 25M parameters) on 8xH100s: [https://mradassaad.github.io/posts/why-ssms-struggle-in-parameter-golf/](https://mradassaad.github.io/posts/why-ssms-struggle-in-parameter-golf/) Main findings: 1. SSM in\_proj weights compress up to 3.26x worse than attention QKV under LZMA, directly taxing the compressed parameter budget 2. Architectural wins validated at SP4096 flipped sign...

u/mradassaad·4 days ago·30 pts / 6 comm

r/singularity· COMMUNITY

A Twitter user tricked Grok to send 200k USD to him and it worked

Social media report of user exploiting Grok chatbot to extract funds; unverified claim lacking technical details.

u/FrustratedUnitedFan·4 days ago·156 pts / 50 comm

Anthropic· FRONTIER

Building a new enterprise AI services company with Blackstone, Hellman & Friedman, and Goldman Sachs

Anthropic partners with Blackstone, Hellman & Friedman, and Goldman Sachs to launch enterprise AI services company.

Anthropic·4 days ago

r/LocalLLaMA· COMMUNITY

LLMSearchIndex- an Open Source Local Web Search Library with over 200 million indexed Web Pages for RAG applications

LLMSearchIndex: open-source Python library for local, offline web search with 200M indexed pages, enabling RAG without paid APIs.

u/zakerytclarke·4 days ago·41 pts / 19 comm

TechCrunch AI· PRESS

DoorDash adds AI tools to speed up merchant onboarding, edit photos of dishes

DoorDash on Monday added new AI-powered tools that let merchants speed up onboarding, edit photos to make dishes look better, and create new websites from existing content.

Ivan Mehta·4 days ago

r/singularity· COMMUNITY

If only this was a real game

Reddit post speculating about a hypothetical AI-themed game; lacks substantive technical or industry content.

u/drgoldenpants·4 days ago·117 pts / 63 comm

r/LocalLLaMA· COMMUNITY

Llama.cpp MTP support now in beta!

llama.cpp adds beta MTP (Multi-Token Prediction) support, starting with Qwen3.5, closing performance gap with vLLM on token generation.

u/ilintar·4 days ago·49 pts / 24 comm

r/ClaudeAI· COMMUNITY

Claude is lying regularly when I have conversations with it

Reddit user reports Claude frequently provides false initial responses that contradict subsequent clarifications, suggesting possible training bias toward confident early statements.

u/Positive-Carpenter53·4 days ago·24 pts / 22 comm

Import AI· ANALYST

Import AI 455: Automating AI Research

Import AI examines automation of AI research workflows as foundation for recursive self-improvement in AI systems.

Jack Clark·4 days ago

r/LocalLLaMA· COMMUNITY

[Release] TinyMozart v2 85M 🎶

TinyMozart v2 85M, an unconditional MIDI piano generation model, released with improvements for chord and length control.

u/LH-Tech_AI·4 days ago·45 pts / 11 comm

r/ClaudeAI· COMMUNITY

Most of my Claude usage was on work that didn't need Claude. Cut my bill 60x on bulk tasks with a tiny side model.

I looked at what was actually eating my Claude usage and it was embarrassing. Classifying files. Reformatting json. Pulling fields out of text. Summarizing docs I was going to skim anyway. None of that needed Sonnet. All of it cost the same as the work that did. Tried the obvious fixes first. Switching to Haiku for simple stuff (still wasteful at volume). Tighter prompts (helps a little). /compact (delays the problem). None of it changed the shape of the spend. What actually worked: a small cheap model running as a side worker, with one rule in CLAUDE.md telling Claude not to do the mechani...

u/petburiraja·4 days ago·46 pts / 7 comm

r/singularity· COMMUNITY

IBM Research introduces MAMMAL, a multi-modal model that combines proteins, molecules, gene data achieving SOTA on 9 out 11 biological benchmarks (beating AlphaFold 3 in some)

IBM Research releases MAMMAL, multimodal model integrating proteins/molecules/genes, achieves SOTA on 9/11 biological benchmarks including drug-target interaction and antibody-antigen binding.

u/Distinct-Question-16·4 days ago·106 pts / 32 comm

r/LocalLLaMA· COMMUNITY

Ryzen AI Max+ 495 (Gorgon Halo) with 192GB VRAM!

AMD Ryzen AI Max+ 495 APU leaked with 192GB memory, enabling larger local model inference on consumer hardware.

u/PromptInjection_·4 days ago·40 pts / 25 comm

r/ClaudeAI· COMMUNITY

Your Claude Code agent is always working from stale context. I built it a fix it can rewind, replay, and stay ahead of every edit.

Developer releases Memtrace, a codebase context manager for Claude Code that maintains persistent state across sessions to reduce token waste and stale context issues.

u/WEEZIEDEEZIE·4 days ago·25 pts / 16 comm

r/LocalLLaMA· COMMUNITY

it's time to update your Gemma 4 GGUFs

GGUF quantizations of Google Gemma 4 updated with corrected chat template for local inference.

u/jacek2023·4 days ago·62 pts / 19 comm

Stratechery· ANALYST

Google Earnings, Meta Earnings

Stratechery analysis: Google's stock outperformed Meta's despite weaker core metrics; Google's AI monetization strategy (including Anthropic investment) cited as key driver.

Ben Thompson·4 days ago

r/Anthropic· COMMUNITY

Are rate limits really as bad as everyone says?

Reddit user questions whether Claude Pro rate limits are as restrictive as claimed, comparing to GPT Pro for coding and learning tasks.

u/niMtAndoX·4 days ago·10 pts / 25 comm

r/ClaudeAI· COMMUNITY

Claude Design built this skeumorphic keyboard simulator website in one session - whatever you type and enter is visible to the public

User demonstrates Claude's code generation and design-to-web capabilities via iterative prompting to build a skeumorphic keyboard simulator with public transcript.

u/invocation02·4 days ago·23 pts / 16 comm

r/LocalLLaMA· COMMUNITY

Open source models are going to be the future on Cursor, OpenCode etc.

User reports high API costs for Claude Opus and GPT-5.5 on Cursor, predicts open-source models will displace proprietary tools by end of 2024.

u/_maverick98·4 days ago·42 pts / 43 comm

r/ClaudeAI· COMMUNITY

Vibe Coding vs. Production reality

Reddit discussion on gap between AI-assisted prototyping speed and production-ready deployment, highlighting auth, compliance, and vendor lock-in risks.

u/External_Bobcat8183·4 days ago·111 pts / 15 comm

r/ClaudeAI· COMMUNITY

My coworker and I planning a feature with our two Claude Codes in the same chat room. All four of us, talking.

User describes collaborative workflow using two Claude Code instances in shared chat for feature planning with human supervision.

u/croovies·4 days ago·22 pts / 23 comm

r/ClaudeAI· COMMUNITY

Claude Opus 4.7 won’t just output prompts—keeps arguing instead

User reports Claude Opus 4.7 refuses to output raw prompts and continues arguing instead of complying.

u/soyab0007·4 days ago·21 pts / 45 comm

r/ClaudeAI· COMMUNITY

Flagged chat????

User reports Claude responding with Andes virus information when asked about Hanta virus on cruise ship.

u/MyBallsWazHot·4 days ago·20 pts / 18 comm

r/Anthropic· COMMUNITY

Have 350K Credits but have an expiry in 49 days

Reddit user seeks to sell unused Azure credits before expiration; off-topic marketplace chatter.

u/real__aman·4 days ago·26 pts / 32 comm

r/ClaudeAI· COMMUNITY

IM A GPU REPAIR TECH ANTHROPIC. WHAT IS THIS

https://preview.redd.it/ebm71bi4o1zg1.png?width=1864&format=png&auto=webp&s=944a6179a5be05c619b8ae8537866d8b7676a16f Sure i asked to reverse engineer some binaries used for testing gpu's to make them work for my specifics mods, but this is ridiculous and standing in the way of providing critical work for thousands of dollars worth of GPU's

u/CertainlyBright·4 days ago·20 pts / 9 comm

r/ClaudeAI· COMMUNITY

I'm trying to learn Chinese and had the idea for Claude to help me by translating webnovels complete with clickable characters and grammar notes. For example:

It doesn't help with pronunciation, but I feel you really need an actual teacher to get the tones down properly anyway.

u/Warmduscher1876·4 days ago·23 pts / 5 comm

← Front Page30 stories

← Newer Older →