The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

r/Anthropic· COMMUNITY

Casually beating every other deep research agent out there with a simple Claude Code harness

Open-source research agent built on Claude Code outperforms OpenAI and NVIDIA systems in deep research benchmarking.

u/heisdancingdancing·3 days ago·11 pts / 11 comm

r/ClaudeAI· COMMUNITY

I just got RickRolled by claude making a web app that can remotely control smart tvs for a client of mine...I'm not even mad. I asked him to try testing a Youtube video hahahahaha

u/Rangizingo·3 days ago·22 pts / 8 comm

Ars Technica AI· PRESS

Canadian election databases use "canary traps"—and they work

Intentional errors can be useful.

Nate Anderson ·3 days ago

r/singularity· COMMUNITY

Anthropic co-founder Jack Clark says AI is nearing the point where it can automate AI research

Jack Clark (Anthropic co-founder) estimates 30% probability AI research automation by end-2027, 60%+ by end-2028, citing rapid progress from coding to ML systems research.

u/Outside-Iron-8242·3 days ago·128 pts / 43 comm

r/ClaudeAI· COMMUNITY

The em dashes ( — ) | The unsaid AI SLOP Tax

Reddit discussion on em dashes as an unintended fingerprint of AI-generated content, creating social pressure to avoid natural writing patterns.

u/Familiar-Classroom47·3 days ago·34 pts / 22 comm

r/LocalLLaMA· COMMUNITY

White House Considers Vetting A.I. Models Before They Are Released

White House exploring pre-release vetting requirements for AI models, raising policy questions for open-weights distribution.

u/fallingdowndizzyvr·3 days ago·49 pts / 54 comm·+ covered by others

TechCrunch AI· PRESS

Image AI models now drive app growth, beating chatbot upgrades

Appfigures finds visual model launches generate 6.5x more downloads — but most don’t convert that spike into revenue.

Sarah Perez·3 days ago

Ars Technica AI· PRESS

Influential study touting ChatGPT in education retracted over red flags

The retracted study on ChatGPT in education was already cited hundreds of times.

Jeremy Hsu ·3 days ago

r/Anthropic· COMMUNITY

To People who are Having Problems with Wandering Opus 4.7

Reddit user describes subjective experience with Claude Opus 4.7 behavior and pattern-matching cognition; anecdotal observation without technical evidence.

u/Jessgitalong·3 days ago·11 pts / 34 comm

r/ClaudeAI· COMMUNITY

Top 6 Claude Skills: 15th April to 3rd May

Found some Open Source Claude skills from last 15 days. Some of them are pretty decent to use, personally liked the npm downloads one. Check out: **- brand-alchemy:** A brand strategy and naming skill that interrogates your thoughts for branding first, then applies phonosemantics, category design frameworks, and auto-checks domain availability across any TLD. **- npm-downloads-to-leads:** Give it a list of npm packages. It pulls 12 weeks of download data, scores each one by growth velocity, maps maintainers to GitHub and X, and gives you a ranked lead brief who built it, how to reach the...

u/Sam_Tech1·3 days ago·21 pts / 7 comm

r/OpenAI· COMMUNITY

Is Codex the best right now?

Reddit discussion questioning Codex's current competitive position and download trends; lacks substantive analysis or new information.

u/LeTanLoc98·3 days ago·72 pts / 33 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection

SpecKV adapts speculative decoding's speculation length dynamically based on target model compression, improving LLM inference throughput.

Shikhar Shukla·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Unsupervised Machine Learning for Detecting Structural Anomalies in European Regional Statistics

Unsupervised ML framework detects structural anomalies in European regional socio-economic statistics using Eurostat NUTS2 data.

Bogdan Oancea·3 days ago

Simon Willison· ANALYST

TRE Python binding — ReDoS robustness demo

Simon Willison demonstrates TRE regex engine's resistance to ReDoS attacks via experimental Python binding, comparing resilience against standard library.

Simon Willison·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Multi-fidelity surrogates for mechanics of composites: from co-kriging to multi-fidelity neural networks

Review of multi-fidelity surrogate modeling techniques for composite materials prediction combining low and high-fidelity simulation data.

Haizhou Wen·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Enhancing RL Generalizability in Robotics through SHAP Analysis of Algorithms and Hyperparameters

SHAP-based framework decomposes RL algorithm and hyperparameter contributions to generalization gaps in robotic control tasks.

Lingxiao Kong·3 days ago

r/ClaudeAI· COMMUNITY

built a plugin so my parallel Claude Code sessions can message each other instead of me alt-tabbing

I usually have two or more Claude Code sessions open at once. One in the backend repo, one in the frontend. Half the time I'd be in the frontend asking "wait, what shape did the user object end up as?", then alt-tab, ask the backend session, copy the answer, alt-tab back, paste. The other Claude was right there. It already knew. I was the bottleneck. So I wrote a plugin called Relay. In the frontend window I just say: ▎ask the backend session what the user object looks like The backend session sees the question between turns, answers it, and the reply pops up in my frontend session as a n...

u/vildanbina·3 days ago·26 pts / 15 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Standing on the Shoulders of Giants: Stabilized Knowledge Distillation for Cross--Language Code Clone Detection

Knowledge distillation from LLMs to compact open-source models for cross-language code clone detection without black-box inference costs.

Mohamad Khajezade·3 days ago

r/Anthropic· COMMUNITY

Opus 4.7 is beyond bad

Reddit user reports degraded performance in Claude Opus 4.7 compared to 4.6, speculating smaller base model or optimization tradeoffs.

u/AbsoluteRoster·3 days ago·31 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Trust, but Verify: Peeling Low-Bit Transformer Networks for Training Monitoring

Layer-wise peeling framework monitors transformer training dynamics by locally optimizing each layer against intermediate representations.

Arian Eamaz·3 days ago

r/OpenAI· COMMUNITY

ChatGPT started responding without thinking? Did you know this is enabled by default?

Reddit user reports ChatGPT extended thinking feature enabled by default; likely user-facing feature discussion without technical depth.

u/Moist_Emu6168·3 days ago·53 pts / 25 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

(POSTER) From Sensors to Insight: Rapid, Edge-to-Core Application Development for Sensor-Driven Applications

Pattern-based AI-assisted methodology for rapid sensor-driven application development using Pegasus workflows on FABRIC testbed.

Komal Thareja·3 days ago·+ covered by others

arXiv (cs.AI/CL/LG)· ACADEMIA

A second-order method on the Stiefel manifold via Newton$\unicode{x2013}$Schulz

Second-order retraction-free optimization method on Stiefel manifolds via Newton-Schulz iteration with quadratic convergence.

Xinhui Xiong·3 days ago

r/OpenAI· COMMUNITY

Lesson from sam altman

Reddit post recounts Sam Altman interview on talent retention at OpenAI during Meta's AI hiring competition.

u/CartographerFeisty66·3 days ago·52 pts / 10 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

A Closed-Form Persistence-Landmark Pipeline for Certified Point-Cloud and Graph Classification

PLACE: closed-form persistent-homology pipeline for point cloud and graph classification with margin-based guarantees and per-prediction certificates.

Sushovan Majhi·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition

VideoNet benchmark with 1,000 domain-specific actions revives action recognition evaluation for vision-language models.

Tanush Yadav·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

HAAS: A Policy-Aware Framework for Adaptive Task Allocation Between Humans and Artificial Intelligence Systems

HAAS framework enables adaptive task allocation between humans and AI systems in software engineering and manufacturing contexts.

Vicente Pelechanoa·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces

JACTUS unifies parameter-efficient fine-tuning and model compression into single joint optimization framework.

Jingze Ge·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

First-Order Efficiency for Probabilistic Value Estimation via A Statistical Viewpoint

Statistical approach improves Monte Carlo estimation of Shapley values and semivalues for model explainability.

Ziqi Liu·3 days ago

r/ClaudeAI· COMMUNITY

Don't like em dashes? Add this to your preferences or .md

User shares prompt injection technique to reduce em dash usage in Claude via system preferences.

u/shiftingsmith·3 days ago·26 pts / 32 comm

← Front Page30 stories

← Newer Older →