The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Anthropic invests $100 million into the Claude Partner Network

Anthropic launches Claude Partner Network with $100M investment to support enterprise Claude adoption.

Anthropic·3 months ago

Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning

Agentic AI systems need models with the specialized depth to solve dense technical problems autonomously. They must excel at reasoning, coding, and long-context... Agentic AI systems need models with the specialized depth to solve dense technical problems autonomously. They must excel at reasoning, coding, and long-context analysis, while remaining efficient enough to run continuously at scale. Multi-agent systems generate up to 15x the tokens of standard chats, re-sending history, tool outputs, and reasoning steps at every turn. Over long tasks… Source

Chris Alexiuk·3 months ago

OpenAI· FRONTIER

Designing AI agents to resist prompt injection

ChatGPT agents defend against prompt injection via action constraints and sensitive data protection in agent workflows.

OpenAI·3 months ago

OpenAI· FRONTIER

From model to agent: Equipping the Responses API with a computer environment

OpenAI builds agent runtime using Responses API with shell tools and hosted containers for secure, stateful agent execution.

OpenAI·3 months ago

Anthropic· FRONTIER

Introducing The Anthropic Institute

Anthropic launches The Anthropic Institute to address societal challenges from advanced AI systems.

Anthropic·3 months ago

Meta AI· FRONTIER

Four MTIA Chips in Two Years: Scaling AI Experiences for Billions

Meta describes MTIA custom silicon strategy: four chip iterations in two years for cost-efficient AI serving.

Meta AI·3 months ago

OpenAI· FRONTIER

Rakuten fixes issues twice as fast with Codex

Rakuten reduces issue resolution time by 50% using Codex for code-based troubleshooting workflows.

OpenAI·3 months ago

OpenAI· FRONTIER

Wayfair boosts catalog accuracy and support speed with OpenAI

Wayfair uses OpenAI models for e-commerce support automation, catalog accuracy, and ticket triage at scale.

OpenAI·3 months ago

Anthropic· FRONTIER

Sydney will become Anthropic’s fourth office in Asia-Pacific

Anthropic establishes Sydney as its fourth Asia-Pacific office.

Anthropic·3 months ago

NVIDIA Dev Blog· INFRA

NVIDIA RTX Innovations Are Powering the Next Era of Game Development

NVIDIA RTX ray tracing and AI-powered neural rendering technologies are redefining how games are made, enabling a new standard for visuals and performance. At... NVIDIA RTX ray tracing and AI-powered neural rendering technologies are redefining how games are made, enabling a new standard for visuals and performance. At GDC 2026, NVIDIA unveiled the latest path tracing innovations elevating visual fidelity, on-device AI models enabling players to interact with their favorite experiences in new ways, and enterprise solutions accelerating game development… Source

Ike Nnoli·3 months ago

NVIDIA Dev Blog· INFRA

Reliable AI Coding for Unreal Engine: Improving Accuracy and Reducing Token Costs

Agentic code assistants are moving into daily game development as studios build larger worlds, ship more DLCs, and support distributed teams. These assistants... Agentic code assistants are moving into daily game development as studios build larger worlds, ship more DLCs, and support distributed teams. These assistants can accelerate development by helping with tasks like generating gameplay scaffolding, refactoring repetitive systems, and answering engine-specific questions faster. This post outlines how developers can build reliable AI coding… Source

Paul Logan·3 months ago

OpenAI· FRONTIER

Improving instruction hierarchy in frontier LLMs

OpenAI's IH-Challenge training method improves instruction hierarchy and prompt injection resistance in frontier LLMs.

OpenAI·3 months ago

OpenAI· FRONTIER

New ways to learn math and science in ChatGPT

ChatGPT adds interactive visual explanations for math and science education with real-time formula exploration.

OpenAI·3 months ago

Hugging Face· INFRA

Introducing Storage Buckets on the Hugging Face Hub

Hugging Face·3 months ago

Hugging Face· INFRA

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Hugging Face·3 months ago

Meta AI· FRONTIER

Mapping the World's Forests with Greater Precision: Introducing Canopy Height Maps v2

Meta and World Resources Institute release Canopy Height Maps v2, open-source forest monitoring model and global dataset.

Meta AI·3 months ago

NVIDIA Dev Blog· INFRA

CUDA 13.2 Introduces Enhanced CUDA Tile Support and New Python Features

CUDA 13.2 arrives with a major update: NVIDIA CUDA Tile is now supported on devices of compute capability 8.X architectures (NVIDIA Ampere and NVIDIA Ada), as... CUDA 13.2 arrives with a major update: NVIDIA CUDA Tile is now supported on devices of compute capability 8.X architectures (NVIDIA Ampere and NVIDIA Ada), as well as 10.X, 11.X and 12.X architectures (NVIDIA Blackwell). In an upcoming release of the CUDA Toolkit, all GPU architectures starting with Ampere will be fully supported. If you’re using Ampere, Ada, or Blackwell GPU architectures… Source

Jonathan Bentz·3 months ago

NVIDIA Dev Blog· INFRA

Implementing Falcon-H1 Hybrid Architecture in NVIDIA Megatron Core

In the rapidly evolving landscape of large language model (LLM) development, NVIDIA Megatron Core has emerged as the foundational framework for training massive... In the rapidly evolving landscape of large language model (LLM) development, NVIDIA Megatron Core has emerged as the foundational framework for training massive transformer models at scale. The open source library offers industry-leading parallelism and GPU-optimized performance. Now developed GitHub-first in the NVIDIA/Megatron-LM repo, Megatron Core is increasingly shaped by contributions from… Source

Mireille Fares·3 months ago

NVIDIA Dev Blog· INFRA

Enhancing Distributed Inference Performance with the NVIDIA Inference Transfer Library

Deploying large language models (LLMs) requires large-scale distributed inference, which spreads model computation and request handling across many GPUs and... Deploying large language models (LLMs) requires large-scale distributed inference, which spreads model computation and request handling across many GPUs and nodes to scale to more users while reducing latency. Distributed inference frameworks use techniques such as disaggregated serving, KV cache loading, and wide expert parallelism. In disaggregated serving environments… Source

Seonghee Lee·3 months ago

NVIDIA Dev Blog· INFRA

Removing the Guesswork from Disaggregated Serving

Deploying and optimizing large language models (LLMs) for high-performance, cost-effective serving can be an overwhelming engineering problem. The ideal... Deploying and optimizing large language models (LLMs) for high-performance, cost-effective serving can be an overwhelming engineering problem. The ideal configuration for any given workload (such as hardware, parallelism, and prefill/decode split) resides in a massive, multi-dimensional search space that is impossible to explore manually or through exhaustive testing. AIConfigurator… Source

Tianhao Xu·3 months ago

Google DeepMind· FRONTIER

From games to biology and beyond: 10 years of AlphaGo’s impact

Google DeepMind retrospective on AlphaGo's 10-year impact on scientific discovery and trajectory toward AGI.

Google DeepMind·3 months ago

Import AI· ANALYST

Import AI 448: AI R&D; Bytedance's CUDA-writing agent; on-device satellite AI

Import AI 448 discusses AI R&D productivity, ByteDance's CUDA code generation agent, and edge satellite inference.

Jack Clark·3 months ago

OpenAI· FRONTIER

OpenAI to acquire Promptfoo

OpenAI acquires Promptfoo, an AI security platform for identifying and remediating vulnerabilities in AI systems.

OpenAI·3 months ago

Hugging Face· INFRA

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Hugging Face·3 months ago

Hugging Face· INFRA

LeRobot v0.5.0: Scaling Every Dimension

Hugging Face·3 months ago

Anthropic· FRONTIER

Partnering with Mozilla to improve Firefox’s security

Anthropic partners with Mozilla to enhance Firefox security features.

Anthropic·3 months ago

OpenAI· FRONTIER

Codex Security: now in research preview

Codex Security, an AI security agent in research preview, detects and patches vulnerabilities with higher confidence and reduced false positives.

OpenAI·3 months ago

OpenAI· FRONTIER

How Balyasny Asset Management built an AI research engine

Balyasny Asset Management uses OpenAI models and agent workflows to build an AI-driven investment research engine.

OpenAI·3 months ago

OpenAI· FRONTIER

How Descript engineers multilingual video dubbing at scale

Descript uses OpenAI reasoning models to automate multilingual video dubbing at scale while preserving timing and meaning.

OpenAI·3 months ago

Anthropic· FRONTIER

Where things stand with the Department of War

Dario Amodei outlines Anthropic's engagement with U.S. Department of War on national security AI applications.

Anthropic·3 months ago

← Front Page30 stories

← Newer Older →