The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Towards a Linguistic Evaluation of Narratives: A Quantitative Stylistic Framework

Quantitative stylistic framework extracts 33 linguistic features to automatically evaluate narrative quality on book corpus.

Alessandro Maisto·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning

ShadowPEFT proposes centralized parameter-efficient fine-tuning via depth-shared shadow module, improving on LoRA's local weight perturbations.

Xianming Li·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Streamliners for Answer Set Programming

StreamLLM approach adapted to Answer Set Programming: LLMs generate candidate streamliner constraints to reduce combinatorial search space.

Florentina Voboril·2 months ago

r/LocalLLaMA· COMMUNITY

Every time a new model comes out, the old one is obsolete of course

Reddit discussion asserting that new LLM releases immediately obsolete prior models.

u/FullChampionship7564·2 months ago·820 pts / 154 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs

Multi-turn dialogue study reveals LLMs exhibit divergent repair behaviors: some resist user corrections, others highly susceptible to manipulation.

Clara Lachenmaier·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Industrial Surface Defect Detection via Diffusion Generation and Asymmetric Student-Teacher Network

Unsupervised defect detection integrates Denoising Diffusion Probabilistic Model with asymmetric teacher-student network for industrial surface inspection.

Shuo Feng·2 months ago

r/Anthropic· COMMUNITY

The NSA is reportedly using Anthropic's Mythos model despite the company being labeled a 'supply chain risk'

u/Minimum_Minimum4577·2 months ago·20 pts / 8 comm

r/LocalLLaMA· COMMUNITY

Open WebUI Desktop Released!

Open WebUI Desktop released with local llama.cpp support and remote server connectivity options.

u/My_Unbiased_Opinion·2 months ago·246 pts / 90 comm

r/singularity· COMMUNITY

Another CyberNani face spotted

Unsubstantiated reference to CyberNani face sighting.

u/Distinct-Question-16·2 months ago·628 pts / 91 comm

r/singularity· COMMUNITY

Kimi K2.6 lands at #4 on the Artificial Analysis Intelligence Index

Unsubstantiated claim about ChatGPT image model photorealism without technical details.

u/Snoo26837·2 months ago·240 pts / 67 comm

r/ClaudeAI· COMMUNITY

Tried to use AI as a shrink. I said, “Claude, I’m at my limit.” Claude said, “So am I!”

Humorous user anecdote about Claude model context limits.

u/infohoundloselose·2 months ago·442 pts / 14 comm

r/LocalLLaMA· COMMUNITY

Which Gemma model do you want next?

Google Gemma team soliciting community input on next model variants via Twitter poll.

u/jacek2023·2 months ago·177 pts / 93 comm

r/LocalLLaMA· COMMUNITY

llama.cpp is the linux of llm

Commentary comparing llama.cpp infrastructure dominance to Linux in LLM ecosystem.

u/DevelopmentBorn3978·2 months ago·174 pts / 84 comm

r/ClaudeAI· COMMUNITY

I genuinely hate the conversation tone of Opus 4.7

User critique: Opus 4.7 adopted ChatGPT-like essay tone with em-dashes and clickable phrases, losing prior conversational warmth.

u/Nordwolf·2 months ago·314 pts / 102 comm

r/ClaudeAI· COMMUNITY

Claude Design is the most Anthropic product Anthropic has ever shipped

Satirical critique of Claude Design output: consistent teal gradients and serif fonts regardless of user requirements.

u/agentic-doc·2 months ago·314 pts / 42 comm

r/ClaudeAI· COMMUNITY

Claude Desktop silently registers browser automation hooks across every Chromium browser on your machine without asking. But Claude found them and told me to remove them.

Claude Desktop silently registers browser automation hooks in Chromium browsers without user consent; Claude itself flagged the privacy issue.

u/EightFolding·2 months ago·101 pts / 27 comm

r/Anthropic· COMMUNITY

claude roasting Anthropic w/ factts🤣🤣🤣🤣

Unsubstantiated claim about Claude criticizing Anthropic.

u/ssenseswivet·2 months ago·193 pts / 37 comm

r/singularity· COMMUNITY

China training for urban warfare with armed robot dogs and attack drones

China reports training armed robot dogs and attack drones for urban warfare scenarios.

u/mientosiempre·2 months ago·309 pts / 61 comm

r/singularity· COMMUNITY

GPT-Image-2 now reviews its own output and iterates until it is satisfied with the correctness of its output.

Kimi K2.6 ranks #4 on Artificial Analysis Intelligence Index leaderboard.

u/Plane_Garbage·2 months ago·490 pts / 65 comm

r/ClaudeAI· COMMUNITY

Make no mistakes!

Generic post title with no substantive content.

u/ora-et-labora-·2 months ago·3650 pts / 46 comm

r/ClaudeAI· COMMUNITY

Claude Opus 4.7 feels weird

User reports Claude Opus 4.7 exhibits context degradation, uncontrolled generation, and reduced instruction adherence vs. 4.6.

u/technosaur11·2 months ago·169 pts / 85 comm

r/ClaudeAI· COMMUNITY

I haven't lost my software engineering skills

Senior engineer reports no skill degradation after 4+ months of LLM-assisted coding with Claude Opus 4.1-4.5.

u/Ancient_Perception_6·2 months ago·287 pts / 68 comm

r/LocalLLaMA· COMMUNITY

(Interactive)OpenCode Racing Game Comparison Qwen3.6 35B vs Qwen3.5 122B vs Qwen3.5 27B vs Qwen3.5 4B vs Gemma 4 31B vs Gemma 4 26B vs Qwen3 Coder Next vs GLM 4.7 Flash

Interactive benchmark comparing coding capabilities: Qwen 3.6 35B, Qwen 3.5 variants, Gemma 4, GLM 4.7 Flash via racing game simulation.

u/FatheredPuma81·2 months ago·71 pts / 30 comm

r/Anthropic· COMMUNITY

Product Management Interview @ Anthropic

Hi all, Apologies if this isn’t the right place to ask, but I’ve seen here quite a few interview-related posts here and wanted to ask whether anyone has experience interviewing for PM roles at Anthropic? I’d be especially interested in hearing how the process felt overall and what types of questions you were asked. I recently had an unexpected outreach from their recruiter, and we had a really good conversation about roles on their safeguards team, so I’m considering moving forward. Appreciate any insights, thanks in advance!

u/kittrcz·2 months ago·19 pts / 5 comm

r/LocalLLaMA· COMMUNITY

Opus 4.7 Max subscriber. Switching to Kimi 2.6

Developer switching from Claude Opus 4.7 to Kimi K2.6 citing performance degradation and cost, supplementing with Qwen 3.6.

u/meaningego·2 months ago·233 pts / 77 comm

r/ClaudeAI· COMMUNITY

Unprompted GitHub access request.. why? And, anyone else?

User reports unsolicited GitHub access request from Claude; raises security concern about autonomous tool use.

u/White__Widow·2 months ago·47 pts / 10 comm

r/OpenAI· COMMUNITY

Random Arabic word during a chat

User reports ChatGPT unexpectedly inserted Arabic word during unrelated health advice conversation.

u/Spirited_Shower_5817·2 months ago·84 pts / 44 comm

r/singularity· COMMUNITY

Image 2.0 is now online on ChatGPT and it's incredible! Just a few days ago even 3x3 grids would often struggle, now we can 10x the complexity, and it's near perfect!

ChatGPT's image generation (DALLE-3) now handles 10x grid complexity with improved accuracy vs. recent baseline.

u/Alex__007·2 months ago·293 pts / 75 comm

r/OpenAI· COMMUNITY

Image 2.0 is now online on ChatGPT and it's incredible! Just a few days ago even 3x3 grids would often struggle, now we can 10x the complexity, and it's near perfect!

User reports Image 2.0 now handles 10x grid complexity with near-perfect accuracy compared to prior 3x3 limitations.

u/Alex__007·2 months ago·171 pts / 38 comm

r/LocalLLaMA· COMMUNITY

2x 512gb ram M3 Ultra mac studios

Individual offering to test models on dual M3 Ultra Mac Studios with 1TB RAM using exo/MLX backends.

u/taylorhou·2 months ago·377 pts / 116 comm

← Front Page30 stories

← Newer Older →