The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

In LLM training, Expert Parallel (EP) communication for hyperscale mixture-of-experts (MoE) models is challenging. EP communication is essentially all-to-all,... In LLM training, Expert Parallel (EP) communication for hyperscale mixture-of-experts (MoE) models is challenging. EP communication is essentially all-to-all, but due to its dynamics and sparseness (only topk experts per AI token instead of all experts), it’s challenging to implement and optimize. This post details an efficient MoE EP communication solution, Hybrid-EP, and its use in the… Source

Fan Yu·5 months ago

Anthropic· FRONTIER

Anthropic partners with Allen Institute and Howard Hughes Medical Institute to accelerate scientific discovery

Anthropic partners with Allen Institute and Howard Hughes Medical Institute to accelerate scientific discovery.

Anthropic·5 months ago

Import AI· ANALYST

Import AI 443: Into the mist: Moltbook, agent ecologies, and the internet in transition

Import AI 443 examines agent ecology systems, Moltbook framework, and adversarial agent corruption risks.

Jack Clark·5 months ago

OpenAI· FRONTIER

Snowflake and OpenAI partner to bring frontier intelligence to enterprise data

OpenAI and Snowflake announce $200M partnership embedding frontier models and agents directly in Snowflake's data platform.

OpenAI·5 months ago

xAI· FRONTIER

xAI joins SpaceX

SpaceX acquires xAI, consolidating AI development with rocket/hardware infrastructure.

xAI·5 months ago

OpenAI· FRONTIER

Introducing the Codex app

OpenAI releases Codex app for macOS enabling parallel multi-agent coding workflows with long-running task support.

OpenAI·5 months ago

NVIDIA Dev Blog· INFRA

Advancing GPU Programming with the CUDA Tile IR Backend for OpenAI Triton

NVIDIA CUDA Tile is a GPU-based programming model that targets portability for NVIDIA Tensor Cores, unlocking peak GPU performance. One of the great things... NVIDIA CUDA Tile is a GPU-based programming model that targets portability for NVIDIA Tensor Cores, unlocking peak GPU performance. One of the great things about CUDA Tile is that you can build your own DSL on top of it. This post shares the work NVIDIA is doing to integrate CUDA Tile as a backend for OpenAI Triton, an open source Python DSL designed to write DL kernels for GPUs. Source

Jie Xin·5 months ago

NVIDIA Dev Blog· INFRA

Establishing a Scalable Sparse Ecosystem with the Universal Sparse Tensor

Sparse tensors are vectors, matrices, and higher-dimensional generalizations with many zeros. They are crucial in various fields such as scientific computing,... Sparse tensors are vectors, matrices, and higher-dimensional generalizations with many zeros. They are crucial in various fields such as scientific computing, signal processing, and deep learning due to their efficiency in storage, computation, and power. Despite their benefits, handling sparse tensors manually or through existing libraries is often cumbersome, error-prone, nonportable… Source

Aart J.C. Bik·5 months ago

NVIDIA Dev Blog· INFRA

Practical Security Guidance for Sandboxing Agentic Workflows and Managing Execution Risk

AI coding agents enable developers to work faster by streamlining tasks and driving automated, test-driven development. However, they also introduce a... AI coding agents enable developers to work faster by streamlining tasks and driving automated, test-driven development. However, they also introduce a significant, often overlooked, attack surface by running tools from the command line with the same permissions and entitlements as the user, making them computer use agents, with all the risks those entail. The primary threat to these tools is… Source

Rich Harang·5 months ago

Google DeepMind· FRONTIER

Project Genie: Experimenting with infinite, interactive worlds

Project Genie lets Google AI Ultra subscribers create and explore infinite interactive worlds via experimental prototype.

Google DeepMind·5 months ago

OpenAI· FRONTIER

Inside OpenAI’s in-house data agent

OpenAI describes internal data agent using GPT-5 and Codex with memory for reasoning over large datasets.

OpenAI·5 months ago

Hugging Face· INFRA

Introducing Daggr: Chain apps programmatically, inspect visually

Hugging Face·5 months ago

OpenAI· FRONTIER

Retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and OpenAI o4-mini in ChatGPT

OpenAI announces retirement of GPT-4o, GPT-4.1, and o4-mini from ChatGPT effective February 13, 2026; API unaffected.

OpenAI·5 months ago

OpenAI· FRONTIER

Taisei Corporation shapes the next generation of talent with AI

Taisei Corporation deploys ChatGPT Enterprise for internal HR and talent development workflows.

OpenAI·5 months ago

Anthropic· FRONTIER

ServiceNow chooses Claude to power customer apps and increase internal productivity

ServiceNow adopts Claude to power customer-facing apps and boost internal productivity.

Anthropic·5 months ago

NVIDIA Dev Blog· INFRA

Ensuring Balanced GPU Allocation in Kubernetes Clusters with Time-Based Fairshare

NVIDIA Run:ai v2.24 introduces time-based fairshare, a new scheduling mode that brings fair-share scheduling with time awareness for over-quota resources to... NVIDIA Run:ai v2.24 introduces time-based fairshare, a new scheduling mode that brings fair-share scheduling with time awareness for over-quota resources to Kubernetes clusters. This capability, built on the open source KAI Scheduler that powers NVIDIA Run:ai, addresses a long-standing challenge in shared GPU infrastructure. Consider two teams with equal priority sharing a cluster. Source

Ekin Karabulut·5 months ago

NVIDIA Dev Blog· INFRA

Speeding Up Variable-Length Training with Dynamic Context Parallelism and NVIDIA Megatron Core

This post introduces Dynamic Context Parallelism (Dynamic-CP), a scheduling approach in NVIDIA Megatron Core used for LLM post-training or DiT pre-training. It... This post introduces Dynamic Context Parallelism (Dynamic-CP), a scheduling approach in NVIDIA Megatron Core used for LLM post-training or DiT pre-training. It dynamically selects the CP size per microbatch to efficiently handle variable-length sequences, achieving up to 1.48x speedup on real-world datasets. In large-scale model training, an often-overlooked bottleneck arises from the… Source

Kunlun Li·5 months ago

NVIDIA Dev Blog· INFRA

Updating Classifier Evasion for Vision Language Models

Advances in AI architectures have unlocked multimodal functionality, enabling transformer models to process multiple forms of data in the same context. For... Advances in AI architectures have unlocked multimodal functionality, enabling transformer models to process multiple forms of data in the same context. For instance, vision language models (VLMs) can generate output from combined image and text input, enabling developers to build systems that interpret graphs, process camera feeds, or operate with traditionally human interfaces like desktop… Source

Joseph Lucas·5 months ago

OpenAI· FRONTIER

EMEA Youth & Wellbeing Grant

OpenAI launches €500K EMEA youth safety and wellbeing grant program for NGOs and researchers.

OpenAI·5 months ago

OpenAI· FRONTIER

The next chapter for AI in the EU

OpenAI publishes EU Economic Blueprint 2.0 highlighting AI adoption initiatives and partnerships across Europe.

OpenAI·5 months ago

Hugging Face· INFRA

We Got Claude to Build CUDA Kernels and teach open models!

Hugging Face·5 months ago

xAI· FRONTIER

Grok Imagine API

Grok Imagine API offers video generation with stated advances in quality, cost, latency.

xAI·5 months ago

OpenAI· FRONTIER

Keeping your data safe when an AI agent clicks a link

OpenAI details safeguards protecting agent data when clicking links, preventing URL exfiltration and prompt injection.

OpenAI·5 months ago

NVIDIA Dev Blog· INFRA

Accelerating Diffusion Models with an Open, Plug-and-Play Offering

Recent advances in large-scale diffusion models have revolutionized generative AI across multiple domains, from image synthesis to audio generation, 3D asset... Recent advances in large-scale diffusion models have revolutionized generative AI across multiple domains, from image synthesis to audio generation, 3D asset creation, molecular design, and beyond. These models have demonstrated unprecedented capabilities in producing high-quality, diverse outputs across various conditional generation tasks. Despite these successes… Source

Weili Nie·5 months ago

Hugging Face· INFRA

OpenAI· FRONTIER

PVH reimagines the future of fashion with OpenAI

PVH Corp adopts ChatGPT Enterprise for fashion design, supply chain, and consumer engagement.

OpenAI·5 months ago

Hugging Face· INFRA

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Hugging Face·5 months ago

OpenAI· FRONTIER

TRUSTBANK uses AI agents to personalize Furusato Nozei gifts

TRUSTBANK and Recursive deploy Choice AI with OpenAI models for personalized Furusato Nozei gift recommendations.

OpenAI·5 months ago

← Front Page30 stories

← Newer Older →

The Archive

Optimizing Communication for Mixture-of-Experts Training with Hybrid Expert Parallel

Anthropic partners with Allen Institute and Howard Hughes Medical Institute to accelerate scientific discovery

Import AI 443: Into the mist: Moltbook, agent ecologies, and the internet in transition

Snowflake and OpenAI partner to bring frontier intelligence to enterprise data

xAI joins SpaceX

Introducing the Codex app

Advancing GPU Programming with the CUDA Tile IR Backend for OpenAI Triton

Establishing a Scalable Sparse Ecosystem with the Universal Sparse Tensor

Practical Security Guidance for Sandboxing Agentic Workflows and Managing Execution Risk

Project Genie: Experimenting with infinite, interactive worlds

Inside OpenAI’s in-house data agent

Introducing Daggr: Chain apps programmatically, inspect visually

Retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and OpenAI o4-mini in ChatGPT

Taisei Corporation shapes the next generation of talent with AI

ServiceNow chooses Claude to power customer apps and increase internal productivity

Ensuring Balanced GPU Allocation in Kubernetes Clusters with Time-Based Fairshare

Speeding Up Variable-Length Training with Dynamic Context Parallelism and NVIDIA Megatron Core

Updating Classifier Evasion for Vision Language Models

EMEA Youth & Wellbeing Grant

The next chapter for AI in the EU

We Got Claude to Build CUDA Kernels and teach open models!

Grok Imagine API

Keeping your data safe when an AI agent clicks a link

Accelerating Diffusion Models with an Open, Plug-and-Play Offering

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs

Anthropic partners with the UK Government to bring AI assistance to GOV.UK services

PVH reimagines the future of fashion with OpenAI

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

TRUSTBANK uses AI agents to personalize Furusato Nozei gifts