The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Hugging Face·4 months ago

A new way to express yourself: Gemini can now create music

Gemini app integrates Lyria 3 music generation model, enabling 30-second track creation from text or image prompts.

Google DeepMind·4 months ago

How NVIDIA Extreme Hardware-Software Co-Design Delivered a Large Inference Boost for Sarvam AI’s Sovereign Models

As global AI adoption accelerates, developers face a growing challenge: delivering large language model (LLM) performance that meets real-world latency and cost... As global AI adoption accelerates, developers face a growing challenge: delivering large language model (LLM) performance that meets real-world latency and cost requirements. Running models with tens of billions of parameters in production, especially for conversational or voice-based AI agents, demands high throughput, low latency, and predictable service-level performance. Source

Utkarsh Uppal·4 months ago

OpenAI· FRONTIER

Introducing EVMbench

OpenAI and Paradigm introduce EVMbench, a benchmark for evaluating AI agents on smart contract vulnerability detection and exploitation.

OpenAI·4 months ago

Hugging Face· INFRA

One-Shot Any Web App with Gradio's gr.HTML

Hugging Face·4 months ago

Anthropic· FRONTIER

Anthropic and the Government of Rwanda sign MOU for AI in health and education

Anthropic and Rwanda government sign MOU to deploy AI in health and education sectors.

Anthropic·4 months ago

Anthropic· FRONTIER

Introducing Claude Sonnet 4.6

Anthropic releases Claude Sonnet 4.6 with frontier performance in coding, agents, and professional applications.

Anthropic·4 months ago

NVIDIA Dev Blog· INFRA

Build AI-Ready Knowledge Systems Using 5 Essential Multimodal RAG Capabilities

Enterprise data is inherently complex: real-world documents are multimodal, spanning text, tables, charts and graphs, images, diagrams, scanned pages, forms,... Enterprise data is inherently complex: real-world documents are multimodal, spanning text, tables, charts and graphs, images, diagrams, scanned pages, forms, and embedded metadata. Financial reports carry critical insights in tables, engineering manuals rely on diagrams, and legal documents often include annotated or scanned content. Retrieval-augmented generation (RAG) was created to ground… Source

Shruthii Sathyanarayanan·4 months ago

Google DeepMind· FRONTIER

Accelerating discovery in India through AI-powered science and education

Google DeepMind launches National Partnerships for AI initiative in India to scale AI applications in science and education.

Google DeepMind·4 months ago

Anthropic· FRONTIER

Anthropic and Infosys collaborate to build AI agents for telecommunications and other regulated industries

Anthropic partners with Infosys to develop AI agents for telecommunications and regulated industries.

Anthropic·4 months ago

Import AI· ANALYST

Import AI 445: Timing superintelligence; AIs solve frontier math proofs; a new ML research benchmark

Import AI 445 examines superintelligence timeline predictions, frontier math theorem solving, and new ML research benchmark.

Jack Clark·4 months ago

Anthropic· FRONTIER

Anthropic opens Bengaluru office and announces new partnerships across India

Anthropic opens Bengaluru office and announces partnerships across India to expand regional presence.

Anthropic·4 months ago

Anthropic· FRONTIER

Anthropic partners with CodePath to bring Claude to the US’s largest collegiate computer science program

Anthropic partners with CodePath to integrate Claude into US's largest collegiate computer science program.

Anthropic·4 months ago

Anthropic· FRONTIER

Chris Liddell appointed to Anthropic’s board of directors

Chris Liddell appointed to Anthropic's board of directors.

Anthropic·4 months ago

OpenAI· FRONTIER

GPT-5.2 derives a new result in theoretical physics

GPT-5.2 proposes novel gluon amplitude formula in theoretical physics, later formally proved by OpenAI and academic collaborators.

OpenAI·4 months ago

OpenAI· FRONTIER

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT

OpenAI introduces Lockdown Mode and Elevated Risk labels in ChatGPT to defend against prompt injection and data exfiltration attacks.

OpenAI·4 months ago

OpenAI· FRONTIER

Scaling social science research

OpenAI releases GABRIEL, an open-source toolkit using GPT to convert qualitative text and images into quantitative data for social science research.

OpenAI·4 months ago

OpenAI· FRONTIER

Beyond rate limits: scaling access to Codex and Sora

OpenAI describes real-time access infrastructure combining rate limits, usage tracking, and credits for Sora and Codex.

OpenAI·4 months ago

Cohere· FRONTIER

Cohere expands partnership with SAP to provide Europe sovereign AI solutions

Cohere and SAP expand partnership to deploy sovereign AI solutions for European enterprises through SAP Sovereign Cloud.

Cohere·4 months ago

Hugging Face· INFRA

Custom Kernels for All from Codex and Claude

Hugging Face·4 months ago

Anthropic· FRONTIER

Anthropic raises $30 billion in Series G funding at $380 billion post-money valuation

Anthropic raises $30B Series G at $380B valuation; $14B run-rate revenue growing 10x annually.

Anthropic·4 months ago

Google DeepMind· FRONTIER

Gemini 3 Deep Think: Advancing science, research and engineering

Gemini 3 Deep Think updated for specialized reasoning in science, research, and engineering problem-solving.

Google DeepMind·4 months ago

Anthropic· FRONTIER

Anthropic is donating $20 million to Public First Action

Anthropic donates $20 million to Public First Action.

Anthropic·4 months ago

OpenAI· FRONTIER

Introducing GPT-5.3-Codex-Spark

OpenAI releases GPT-5.3-Codex-Spark, a real-time coding model with 15x faster generation and 128k context, in research preview.

OpenAI·4 months ago

Hugging Face· INFRA

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

Hugging Face·4 months ago

Anthropic· FRONTIER

Covering electricity price increases from our data centers

Anthropic covers electricity cost increases from its data centers.

Anthropic·4 months ago

OpenAI· FRONTIER

Harness engineering: leveraging Codex in an agent-first world

OpenAI technical staff discuss engineering patterns for building agent systems with Codex as foundation model.

OpenAI·4 months ago

NVIDIA Dev Blog· INFRA

R²D²: Scaling Multimodal Robot Learning with NVIDIA Isaac Lab

Building robust, intelligent robots requires testing them in complex environments. However, gathering data in the physical world is expensive, slow, and often... Building robust, intelligent robots requires testing them in complex environments. However, gathering data in the physical world is expensive, slow, and often dangerous. It is nearly impossible to safely train for real-world critical risks, such as high-speed collisions or hardware failures. Worse, real-world data is usually biased toward “normal” conditions, leaving robots unprepared for the… Source

Oyindamola Omotuyi·4 months ago

NVIDIA Dev Blog· INFRA

Using Accelerated Computing to Live-Steer Scientific Experiments at Massive Research Facilities

Scientists and engineers who design and build unique scientific research facilities face similar challenges. These include managing massive data rates that... Scientists and engineers who design and build unique scientific research facilities face similar challenges. These include managing massive data rates that exceed current computational infrastructure capacity to extract scientific insights and driving the experiments in real time. These challenges are obstacles to maximizing the impact of scientific discoveries and significantly slow the pace of… Source

Quynh L. Nguyen·4 months ago

NVIDIA Dev Blog· INFRA

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

NVIDIA TensorRT LLM enables developers to build high-performance inference engines for large language models (LLMs), but deploying a new architecture... NVIDIA TensorRT LLM enables developers to build high-performance inference engines for large language models (LLMs), but deploying a new architecture traditionally requires significant manual effort. To address this challenge, today we are announcing the availability of AutoDeploy as a beta feature in TensorRT LLM. AutoDeploy compiles off-the-shelf PyTorch models into inference-optimized… Source

Lucas Liebenwein·4 months ago

← Front Page30 stories

← Newer Older →

The Archive

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

A new way to express yourself: Gemini can now create music

How NVIDIA Extreme Hardware-Software Co-Design Delivered a Large Inference Boost for Sarvam AI’s Sovereign Models

Introducing EVMbench

One-Shot Any Web App with Gradio's gr.HTML

Anthropic and the Government of Rwanda sign MOU for AI in health and education

Introducing Claude Sonnet 4.6

Build AI-Ready Knowledge Systems Using 5 Essential Multimodal RAG Capabilities

Accelerating discovery in India through AI-powered science and education

Anthropic and Infosys collaborate to build AI agents for telecommunications and other regulated industries

Import AI 445: Timing superintelligence; AIs solve frontier math proofs; a new ML research benchmark

Anthropic opens Bengaluru office and announces new partnerships across India

Anthropic partners with CodePath to bring Claude to the US’s largest collegiate computer science program

Chris Liddell appointed to Anthropic’s board of directors

GPT-5.2 derives a new result in theoretical physics

Introducing Lockdown Mode and Elevated Risk labels in ChatGPT

Scaling social science research

Beyond rate limits: scaling access to Codex and Sora

Cohere expands partnership with SAP to provide Europe sovereign AI solutions

Custom Kernels for All from Codex and Claude

Anthropic raises $30 billion in Series G funding at $380 billion post-money valuation

Gemini 3 Deep Think: Advancing science, research and engineering

Anthropic is donating $20 million to Public First Action

Introducing GPT-5.3-Codex-Spark

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

Covering electricity price increases from our data centers

Harness engineering: leveraging Codex in an agent-first world

R²D²: Scaling Multimodal Robot Learning with NVIDIA Isaac Lab

Using Accelerated Computing to Live-Steer Scientific Experiments at Massive Research Facilities

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy