The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Opus 4.7 reminds me of Haiku. It's a bit of nostalgia using it and all of the pain.

Reddit user compares Claude Opus 4.7 unfavorably to 4.6, noting reduced capability on abstract prompts similar to Haiku.

u/Wide-Ad-1349·22 days ago·25 pts / 13 comm

NVIDIA Dev Blog· INFRA

Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20

AI integration is redefining mainstream enterprise applications, from productivity software like Microsoft Office to more complex design and engineering tools.... AI integration is redefining mainstream enterprise applications, from productivity software like Microsoft Office to more complex design and engineering tools. This shift requires the modern data center to move beyond single-purpose silos. For developers, gaining access to dedicated GPU compute can often be a bottleneck. Virtual machines (VMs) solve part of this challenge by providing secure… Source

Phoebe Lee·22 days ago

The Verge AI· PRESS

AI failure could trigger the next financial crisis, warns Elizabeth Warren

"I know a bubble when I see one." That's what Sen. Elizabeth Warren (D-MA), who led the push to create a new consumer financial regulator in the wake of the 2008 recession, told a crowd at a Vanderbilt Policy Accelerator event in Washington, DC on Wednesday. Warren warned of what she called "striking" parallels to that crisis in the AI industry. While she believes the technology has "enormous potential," she warned that AI companies' massive spending and borrowing practices are creating a tinderbox and Congress should step in. Though the AI industry has grown rapidly, Warren said the pace isn...

Lauren Feiner·22 days ago

r/Anthropic· COMMUNITY

It’s not like Mythos solved P vs NP - let’s all chill

I don’t get what the fuss is about Mythos is, from the reporting I’ve seen…. Mythos found a critical vulnerability in OpenBSD which is known for robust security, which went unnoticed by humans for 27 years. So what? Sure, maybe\* it was a super obscure bug to find \*had to have been very obscure to avoid 27 years of reviews by humans I repeat - so what? Anthropic - the company with the models used for the majority of serious coding etc, used all the data it had access to, and presumably a lot of compute, to train a computer to be able to find bugs made by humans that humans missed when ...

u/SpecialAttention9861·22 days ago·52 pts / 66 comm

The Verge AI· PRESS

OpenAI now lets teams make custom bots that can do work on their own

OpenAI is giving users of its Business, Enterprise, Edu, and Teachers plans access to cloud-based "workspace" agents available in ChatGPT that can perform business tasks. In its blog post, OpenAI gives examples of agents like one that finds product feedback on the web and sends a report in Slack and a sales agent that can draft follow-up emails in Gmail. These new agents follow increasing interest in agents across the AI landscape, especially after OpenClaw - the AI agent formerly known as Clawdbot and Moltbot that touts itself as the "AI that actually does things" - went viral. OpenClaw foun...

Jay Peters·22 days ago

r/OpenAI· COMMUNITY

AI arms race now

u/Embarrassed-Slip8094·22 days ago·1454 pts / 93 comm

NVIDIA Dev Blog· INFRA

Advancing Emerging Optimizers for Accelerated LLM Training with NVIDIA Megatron

Higher-order optimization algorithms such as Shampoo have been effectively applied in neural network training for at least a decade. These methods have achieved... Higher-order optimization algorithms such as Shampoo have been effectively applied in neural network training for at least a decade. These methods have achieved significant success more recently when applied to leading LLMs. In particular, Muon (MomentUm Orthogonalized by Newton-Schulz) was used to train some of today’s best open source models, including Kimi K2 and GLM-5. Source

Hao Wu·22 days ago

r/ClaudeAI· COMMUNITY

I've been using Claude Cowork since launch. Here's what actually works for non-technical tasks (no code).

User guide for non-technical knowledge work using Claude Cowork, emphasizing multi-step workflows without coding.

u/geekeek123·22 days ago·186 pts / 34 comm

TechCrunch AI· PRESS

How SpaceX preempted a $2B fundraise with a $60B buyout offer

Cursor was on track to close a $2 billion funding round this week but chose to halt discussions after SpaceX offered a $10 billion "collaboration fee" and a path to a $60 billion acquisition.

Marina Temkin·22 days ago

r/LocalLLaMA· COMMUNITY

Dense vs. MoE gap is shrinking fast with the 3.6-27B release

Benchmark analysis shows Qwen3.6-27B dense vs. Qwen3.6-35B-A3B MoE gap narrowing; MoE gains +4.9pp on SWE-bench coding but dense leads overall.

u/Usual-Carrot6352·22 days ago·262 pts / 79 comm

r/Anthropic· COMMUNITY

Is this seriously the solution to rate limits? Just pay $100/mo now?

Claude Code is being moved behind a $100/mo paywall. Would you pay that for an AI coding tool?

u/Saykudan·22 days ago·86 pts / 96 comm

Latent Space· ANALYST

Shopify’s AI Phase Transition: 2026 Usage Explosion, Unlimited Opus-4.6 Token Budget, Tangle, Tangent, SimGym — with Mikhail Parakhin, Shopify CTO

Shopify CTO discusses 2026 AI adoption roadmap, unlimited Claude Opus 4.6 token budget, and internal tools (Tangle, Tangent, SimGym).

Latent Space·22 days ago

r/LocalLLaMA· COMMUNITY

Qwen3 TTS is seriously underrated - I got it running locally in real-time and it's one of the most expressive open TTS models I've tried

User reports Qwen3 TTS running locally in real-time with expressive output, integrated into Persona Engine lip-sync pipeline.

u/fagenorn·22 days ago·507 pts / 92 comm

TechCrunch AI· PRESS

Google Cloud launches two new AI chips to compete with Nvidia

Google's newest TPUs are faster and cheaper than the previous versions. But the company is still embracing Nvidia in its cloud — for now.

Julie Bort·22 days ago

Ars Technica AI· PRESS

Anthropic tested removing Claude Code from the Pro plan

Untenable demand has Anthropic exploring new approaches to rationing its service.

Samuel Axon ·22 days ago

r/Anthropic· COMMUNITY

Must be magic or something

Reddit user reports Claude Opus 4.7 performs worse than 4.6 for their use case; expresses frustration with degradation.

u/Nnaz123·22 days ago·76 pts / 16 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

SpeechParaling-Bench: A Comprehensive Benchmark for Paralinguistic-Aware Speech Generation

SpeechParaling-Bench expands paralinguistic feature evaluation for Large Audio-Language Models from 50 to 100+ features across 1,000+ English-Chinese speech queries.

Ruohan Liu·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Parallel-SFT: Improving Zero-Shot Cross-Programming-Language Transfer for Code RL

Parallel-SFT improves zero-shot cross-programming-language transfer for code RL by enabling skills learned in high-resource languages like Python to transfer to lower-resource languages.

Zhaofeng Wu·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AVISE: Framework for Evaluating the Security of AI Systems

AVISE is an open-source modular framework for identifying vulnerabilities and evaluating security of AI systems using extended Red Queen attack theory.

Mikko Lempinen·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FedSIR: Spectral Client Identification and Relabeling for Federated Learning with Noisy Labels

FedSIR proposes spectral client identification and relabeling to mitigate label noise in federated learning across distributed clients.

Sina Gholami·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Closing the Domain Gap in Biomedical Imaging by In-Context Control Samples

CS-ARM-BN addresses batch effects in biomedical imaging via meta-learning adaptation using in-context control samples to improve domain generalization.

Ana Sanchez-Fernandez·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Global Offshore Wind Infrastructure: Deployment and Operational Dynamics from Dense Sentinel-1 Time Series

Global Sentinel-1 SAR time series dataset monitors offshore wind infrastructure deployment and operations from 2016Q1 onward at high temporal resolution.

Thorsten Hoeser·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Stream-CQSA: Avoiding Out-of-Memory in Attention Computation via Flexible Workload Scheduling

Stream-CQSA avoids out-of-memory failures in long-context LLMs by decomposing quadratic attention via cyclic quorum set theory into independent subsequence computations.

Yiming Bian·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Study shows Transformers, LSTMs, RNNs, and word embeddings converge on period-T periodic features for number representation; proves Fourier sparsity is necessary but insufficient for mod-T separability.

Deqing Fu·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control

ParetoSlider enables multi-objective RL post-training for diffusion models with inference-time control over conflicting reward criteria without early scalarization.

Shelly Golan·22 days ago

The Verge AI· PRESS

Watch Sony’s elite ping-pong robot beat top-ranked players

Ace is the first robot that can beat the best human players while following the official rules of table tennis. | Image: Sony AI Humans have been building ping-pong playing robots for decades, such as Omron's FOREPHUS that challenged amateur competitors at CES 2017. What sets Ace apart from the rest is that the robot, which was developed by Sony's AI division, is the first that can hold its own against top-ranked human players and occasionally even beat them in matches that follow the official rules of the International Table Tennis Federation (ITTF). AI is already capable of besting humans a...

Andrew Liszewski·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Diagnosing CFG Interpretation in LLMs

RoboGrid framework reveals LLMs maintain surface syntax but fail semantic/behavioral adherence to context-free grammars under recursion depth and complexity stress-tests.

Hanqi Li·22 days ago

r/ClaudeAI· COMMUNITY

The most complete Claude Code cheat sheet 🧠

User shares comprehensive cheat sheet for Claude Code covering shortcuts, workflows, and MCP setup.

u/OneClimate8489·22 days ago·306 pts / 29 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Model

OMIBench evaluates vision-language models on multi-image Olympiad-level reasoning across biology, chemistry, mathematics, and physics.

Qiguang Chen·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Relative Principals, Pluralistic Alignment, and the Structural Value Alignment Problem

Principal-agent framework reconceptualizes AI alignment as structural governance problem across objectives, information, and multiple stakeholders.

Travis LaCroix·22 days ago

← Front Page30 stories

← Newer Older →