The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

OpenAI· FRONTIER

An update on our mental health-related work

OpenAI updates mental health safety work: parental controls, distress detection, trusted contacts.

OpenAI·4 months ago

Anthropic· FRONTIER

Statement from Dario Amodei on our discussions with the Department of War

Dario Amodei discusses Anthropic's Department of War coordination regarding national security uses of Claude.

Anthropic·4 months ago

Google DeepMind· FRONTIER

Nano Banana 2: Combining Pro capabilities with lightning-fast speed

Google releases Nano Banana 2 image generation model with advanced world knowledge and subject consistency at fast inference speeds.

Google DeepMind·4 months ago

OpenAI· FRONTIER

Pacific Northwest National Laboratory and OpenAI partner to accelerate federal permitting

DraftNEPABench benchmark evaluates AI coding agents on federal permitting; shows 15% NEPA drafting time reduction potential.

OpenAI·4 months ago

OpenAI· FRONTIER

OpenAI Codex and Figma launch seamless code-to-design experience

OpenAI Codex and Figma integrate to enable seamless code-to-design iteration.

OpenAI·4 months ago

Cohere· FRONTIER

The AI Advantage: How Financial Institutions Win With AI

Cohere analysis of AI adoption in financial services: productivity gains, operational efficiency, and implementation pathways.

Cohere·4 months ago

Cohere· FRONTIER

AI for Financial Institutions: Watch the webinar

Cohere webinar on AI applications in financial services; generic promotional content.

Cohere·4 months ago

Cohere· FRONTIER

AI In Banking: Transforming Finance For The Digital Age

Overview of AI use cases in banking sector covering automation and risk management; commentary without new findings.

Cohere·4 months ago

Cohere· FRONTIER

Generative AI in Finance | Use Cases, Benefits & the Future

Generative AI applications in finance sector with implementation guidance; marketing content without technical depth.

Cohere·4 months ago

Hugging Face· INFRA

Mixture of Experts (MoEs) in Transformers

Hugging Face·4 months ago

Anthropic· FRONTIER

Anthropic acquires Vercept to advance Claude's computer use capabilities

Anthropic acquires Vercept to enhance Claude's computer use and agentic capabilities.

Anthropic·4 months ago

NVIDIA Dev Blog· INFRA

Making Softmax More Efficient with NVIDIA Blackwell Ultra

LLM context lengths are exploding, and architectures are moving toward complex attention schemes like Multi-Head Latent Attention (MLA) and Grouped Query... LLM context lengths are exploding, and architectures are moving toward complex attention schemes like Multi-Head Latent Attention (MLA) and Grouped Query Attention (GQA). As a result, AI ”speed of thought” is increasingly governed not by the massive throughput of matrix multiplications, but by the transcendental math of the softmax function. Transcendentals refer to functions that cannot be… Source

Jamie Li·4 months ago

OpenAI· FRONTIER

Disrupting malicious uses of AI | February 2026

OpenAI threat report examines malicious AI use combining models with websites and social platforms.

OpenAI·4 months ago

Anthropic· FRONTIER

Anthropic’s Responsible Scaling Policy: Version 3.0

Anthropic releases Responsible Scaling Policy v3.0, detailing AI safety protocols for model development.

Anthropic·4 months ago

OpenAI· FRONTIER

Arvind KC appointed Chief People Officer

OpenAI appoints Arvind KC as Chief People Officer to lead organizational scaling and culture.

OpenAI·4 months ago

Anthropic· FRONTIER

Detecting and preventing distillation attacks

Anthropic publishes research on detecting and preventing model distillation attacks.

Anthropic·4 months ago

NVIDIA Dev Blog· INFRA

Using NVFP4 Low-Precision Model Training for Higher Throughput Without Losing Accuracy

As the sizes of AI models and datasets continue to increase, relying only on higher-precision BF16 training is no longer sufficient. Key challenges such as... As the sizes of AI models and datasets continue to increase, relying only on higher-precision BF16 training is no longer sufficient. Key challenges such as training throughput expectations, memory limits, and rising costs are becoming the primary barriers to scaling transformer models. Using lower-precision training can address these challenges. By reducing the numeric precision used during… Source

Aditya Vavre·4 months ago

Import AI· ANALYST

Import AI 446: Nuclear LLMs; China's big AI benchmark; measurement and AI policy

Import AI 446 covers nuclear power LLMs, China's comprehensive AI benchmark suite, and measurement for policy.

Jack Clark·4 months ago

OpenAI· FRONTIER

Why we no longer evaluate SWE-bench Verified

OpenAI discontinues SWE-bench Verified due to contamination and training leakage; recommends SWE-bench Pro.

OpenAI·4 months ago

OpenAI· FRONTIER

OpenAI announces Frontier Alliance Partners

OpenAI launches Frontier Alliance Partners program to help enterprises deploy AI agents from pilots to production with secure, scalable infrastructure.

OpenAI·4 months ago

Anthropic· FRONTIER

Making frontier cybersecurity capabilities available to defenders

Claude Code Security, integrated into Claude Code, scans codebases for vulnerabilities and patches in limited preview.

Anthropic·4 months ago

OpenAI· FRONTIER

Our First Proof submissions

OpenAI submits proof attempts to First Proof math challenge, demonstrating research-grade reasoning on expert-level problems.

OpenAI·4 months ago

Hugging Face· INFRA

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Hugging Face·4 months ago

Hugging Face· INFRA

Train AI models with Unsloth and Hugging Face Jobs for FREE

Hugging Face·4 months ago

NVIDIA Dev Blog· INFRA

Accelerating Data Processing with NVIDIA Multi-Instance GPU and Locality Domains

NVIDIA flagship data center GPUs in the NVIDIA Ampere, NVIDIA Hopper, and NVIDIA Blackwell families all feature non-uniform memory access (NUMA) behaviors, but... NVIDIA flagship data center GPUs in the NVIDIA Ampere, NVIDIA Hopper, and NVIDIA Blackwell families all feature non-uniform memory access (NUMA) behaviors, but expose a single memory space. Most programs therefore do not have an issue with memory non-uniformity. However, as bandwidth increases in newer generation GPUs, there are significant performance and power gains to be had when taking into… Source

Mukul Joshi·4 months ago

Google DeepMind· FRONTIER

Gemini 3.1 Pro: A smarter model for your most complex tasks

Google DeepMind releases Gemini 3.1 Pro, an upgraded model for complex reasoning and multi-step problem solving.

Google DeepMind·4 months ago

OpenAI· FRONTIER

Advancing independent research on AI alignment

OpenAI commits $7.5M to The Alignment Project for independent AI alignment research addressing AGI safety and security.

OpenAI·4 months ago

OpenAI· FRONTIER

Introducing OpenAI for India

OpenAI expands into India with local infrastructure, enterprise partnerships, and workforce development initiatives.

OpenAI·4 months ago

NVIDIA Dev Blog· INFRA

Unlock Massive Token Throughput with GPU Fractioning in NVIDIA Run:ai

As AI workloads scale, achieving high throughput, efficient resource usage, and predictable latency becomes essential. NVIDIA Run:ai addresses these challenges... As AI workloads scale, achieving high throughput, efficient resource usage, and predictable latency becomes essential. NVIDIA Run:ai addresses these challenges through intelligent scheduling and dynamic GPU fractioning. GPU fractioning is wholly delivered by NVIDIA Run:ai in any environment—cloud, NCP, and on-premises. This post presents the joint benchmarking effort between NVIDIA and AI… Source

Boskey Savla·4 months ago

NVIDIA Dev Blog· INFRA

Topping the GPU MODE Kernel Leaderboard with NVIDIA cuda.compute

Python dominates machine learning for its ergonomics, but writing truly fast GPU code has historically meant dropping into C++ to write custom kernels and to... Python dominates machine learning for its ergonomics, but writing truly fast GPU code has historically meant dropping into C++ to write custom kernels and to maintain bindings back to Python. For most Python developers and researchers, this is a significant barrier to entry. Frameworks like PyTorch address this by implementing kernels in CUDA C++—either handwritten or by leveraging libraries… Source

Daniel Rodriguez·4 months ago

← Front Page30 stories

← Newer Older →