The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Search Live is expanding globally

Google expands Search Live globally across all supported languages and locations with AI Mode.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Liza Ma"],"title":["Director, Product Management"],"department":["Search"],"company":[""]}·3 months ago

Meta AI· FRONTIER

Introducing TRIBE v2: A Predictive Foundation Model Trained to Understand How the Human Brain Processes Complex Stimuli

Meta releases TRIBE v2, a foundation model predicting human brain responses to complex visual stimuli.

Meta AI·3 months ago

Google DeepMind· FRONTIER

Protecting people from harmful manipulation

Google DeepMind researches AI manipulation risks in finance and health domains, proposing new safety countermeasures.

Google DeepMind·3 months ago

NVIDIA Dev Blog· INFRA

Maximize AI Infrastructure Throughput by Consolidating Underutilized GPU Workloads

In production Kubernetes environments, the difference between model requirements and GPU size creates inefficiencies. Lightweight automatic speech recognition... In production Kubernetes environments, the difference between model requirements and GPU size creates inefficiencies. Lightweight automatic speech recognition (ASR) or text-to-speech (TTS) models may require only 10 GB of VRAM, yet occupy an entire GPU in standard Kubernetes deployments. Because the scheduler maps a model to one or more GPUs and can’t easily share across GPUs across models… Source

Sagar Desai·3 months ago

Google DeepMind· FRONTIER

Lyria 3 Pro: Create longer tracks in more

Google releases Lyria 3 Pro for longer music generation with structural awareness, expanding availability across Google products.

Google DeepMind·3 months ago

Google AI (Gemma)· FRONTIER

Lyria 3 Pro: Create longer tracks in more Google products

Google brings Lyria 3 Pro music generation to professional creative tools and products.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Myriam Hamed Torres"],"title":["Senior Product Manager"],"department":["Google DeepMind"],"company":[""]}·3 months ago

Google AI (Gemma)· FRONTIER

Build with Lyria 3, our newest music generation model

Lyria 3 music generation model launches in paid preview via Gemini API and AI Studio.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Alisa Fortin"],"title":["Product Manager"],"department":["Google DeepMind"],"company":[""]}·3 months ago

NVIDIA Dev Blog· INFRA

How Centralized Radar Processing on NVIDIA DRIVE Enables Safer, Smarter Level 4 Autonomy

In the current state of automotive radar, machine learning engineers can't work with camera-equivalent raw RGB images. Instead, they work with the output of... In the current state of automotive radar, machine learning engineers can’t work with camera-equivalent raw RGB images. Instead, they work with the output of radar constant false alarm rate (CFAR), which is similar to computer vision (CV) edge detections. The communications and compute architectures haven’t kept pace with trends in AI and the needs of Level 4 autonomy, despite radar being a staple… Source

Lachlan Dowling·3 months ago

NVIDIA Dev Blog· INFRA

Designing Protein Binders Using the Generative Model Proteina-Complexa

Developing new protein-based therapies and catalysts involves the challenging task of designing protein binders, or proteins that bind to a target protein or... Developing new protein-based therapies and catalysts involves the challenging task of designing protein binders, or proteins that bind to a target protein or small molecule. The search space for possible amino acid sequence permutations and resulting 3D protein structures for a designed binder is vast, and achieving strong, specific binding requires careful optimization of the interactions between… Source

Kyle Gion·3 months ago

NVIDIA Dev Blog· INFRA

Scaling Token Factory Revenue and AI Efficiency by Maximizing Performance per Watt

In the AI era, power is the ultimate constraint, and every AI factory operates within a hard limit. This makes performance per watt—the rate at which power is... In the AI era, power is the ultimate constraint, and every AI factory operates within a hard limit. This makes performance per watt—the rate at which power is converted into revenue-generating intelligence—the defining metric for modern AI infrastructure. AI data centers now operate as token factories tied directly to the energy ecosystem, where access to land, power… Source

Kibibi Moseley·3 months ago

OpenAI· FRONTIER

Inside our approach to the Model Spec

OpenAI publishes Model Spec: public framework defining model behavior policies balancing safety, user freedom, and transparency.

OpenAI·3 months ago

OpenAI· FRONTIER

Introducing the OpenAI Safety Bug Bounty program

OpenAI launches Safety Bug Bounty program targeting agentic vulnerabilities, prompt injection, and data exfiltration risks.

OpenAI·3 months ago

NVIDIA Dev Blog· INFRA

Building NVIDIA Nemotron 3 Agents for Reasoning, Multimodal RAG, Voice, and Safety

Agentic AI is an ecosystem where specialized models work together to handle planning, reasoning, retrieval, and safety guardrailing. As these systems scale,... Agentic AI is an ecosystem where specialized models work together to handle planning, reasoning, retrieval, and safety guardrailing. As these systems scale, developers need models that can understand real-world multimodal data, converse naturally with users globally, and operate safely across languages and modalities. At GTC 2026, NVIDIA introduced a new generation of NVIDIA Nemotron models… Source

Chintan Patel·3 months ago

OpenAI· FRONTIER

Helping developers build safer AI experiences for teens

OpenAI releases gpt-oss-safeguard teen safety policies enabling prompt-based age-specific content moderation for developers.

OpenAI·3 months ago

OpenAI· FRONTIER

Powering product discovery in ChatGPT

ChatGPT introduces Agentic Commerce Protocol for visually immersive shopping with product discovery, comparisons, and merchant APIs.

OpenAI·3 months ago

OpenAI· FRONTIER

Update on the OpenAI Foundation

OpenAI Foundation commits $1B to disease curing, economic opportunity, AI resilience, and community programs.

OpenAI·3 months ago

Hugging Face· INFRA

A New Framework for Evaluating Voice Agents (EVA)

Hugging Face·3 months ago

NVIDIA Dev Blog· INFRA

NVIDIA IGX Thor Powers Industrial, Medical, and Robotics Edge AI Applications

Industrial and medical systems are rapidly increasing the use of high-performance AI to improve worker productivity, human-machine interaction, and downtime... Industrial and medical systems are rapidly increasing the use of high-performance AI to improve worker productivity, human-machine interaction, and downtime management. From factory automation cells to autonomous mobile platforms to surgical rooms, operators are deploying increasingly complex generative AI models, more sensors, and higher‑fidelity data streams at the edge. Source

Suhas Hariharapura Sheshadri·3 months ago

Mistral AI· FRONTIER

Speaking of Voxtral

Mistral open-sources Voxtral, a fast, adaptable TTS model for voice agents with real-time synthesis.

Mistral AI·3 months ago

Import AI· ANALYST

Import AI 450: China's electronic warfare model; traumatized LLMs; and a scaling law for cyberattacks

Import AI 450 covers Chinese electronic warfare model, LLM psychological effects research, and cyberattack scaling laws.

Jack Clark·3 months ago

NVIDIA Dev Blog· INFRA

Building a Zero-Trust Architecture for Confidential AI Factories

AI is moving from experimentation to production. However, most data enterprises need exists outside the public cloud. This includes sensitive information like... AI is moving from experimentation to production. However, most data enterprises need exists outside the public cloud. This includes sensitive information like patient records, market research, and legacy systems containing enterprise knowledge. There’s also a risk of using private data with AI models, and adoption is often slowed or blocked by privacy and trust concerns. Source

Hema Bontha·3 months ago

NVIDIA Dev Blog· INFRA

Deploying Disaggregated LLM Inference Workloads on Kubernetes

As large language model (LLM) inference workloads grow in complexity, a single monolithic serving process starts to hit its limits. Prefill and decode stages... As large language model (LLM) inference workloads grow in complexity, a single monolithic serving process starts to hit its limits. Prefill and decode stages have fundamentally different compute profiles, yet traditional deployments force them onto the same hardware, leaving GPUs underutilized and scaling inflexible. Disaggregated serving addresses this by splitting the inference pipeline… Source

Anish Maddipoti·3 months ago

Cohere· FRONTIER

Introducing Command R7B: Fast and efficient generative AI | Cohere Blog

Cohere releases Command R7B, compact generative model optimized for speed/efficiency on commodity GPUs and edge devices.

Cohere·3 months ago

OpenAI· FRONTIER

Creating with Sora Safely

Sora 2 and Sora app integrate concrete safety protections addressing video synthesis abuse and creation platform risks.

OpenAI·3 months ago

Hugging Face· INFRA

Build a Domain-Specific Embedding Model in Under a Day

Hugging Face·3 months ago

OpenAI· FRONTIER

How we monitor internal coding agents for misalignment

OpenAI uses chain-of-thought monitoring to detect misalignment risks in internal coding agents via real-world deployment analysis.

OpenAI·3 months ago

OpenAI· FRONTIER

OpenAI to acquire Astral

OpenAI acquires Astral to accelerate Python developer tools and expand Codex capabilities.

OpenAI·3 months ago

NVIDIA Dev Blog· INFRA

How to Build Deep Agents for Enterprise Search with NVIDIA AI-Q and LangChain

While consumer AI offers powerful capabilities, workplace tools often suffer from disjointed data and limited context. Built with LangChain, the NVIDIA AI-Q... While consumer AI offers powerful capabilities, workplace tools often suffer from disjointed data and limited context. Built with LangChain, the NVIDIA AI-Q blueprint is an open source template that bridges this gap. LangChain recently introduced an enterprise agent platform built with NVIDIA AI to support scalable, production-ready agent development. This tutorial, available as an NVIDIA… Source

Sean Lopp·3 months ago

NVIDIA Dev Blog· INFRA

Building the AI Grid with NVIDIA: Orchestrating Intelligence Everywhere

AI-native services are exposing a new bottleneck in AI infrastructure: As millions of users, agents, and devices demand access to intelligence, the challenge is... AI-native services are exposing a new bottleneck in AI infrastructure: As millions of users, agents, and devices demand access to intelligence, the challenge is shifting from peak training throughput to delivering deterministic inference at scale—predictable latency, jitter, and sustainable token economics. NVIDIA announced at GTC 2026 that telcos and distributed cloud providers are… Source

Sree Sankar·3 months ago

Hugging Face· INFRA

State of Open Source on Hugging Face: Spring 2026

Hugging Face·3 months ago

← Front Page30 stories

← Newer Older →