The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Jupyter Agents: training LLMs to reason with notebooks

Hugging Face·8 months ago

OpenAI· FRONTIER

SafetyKit scales risk agents with OpenAI’s most capable models

SafetyKit product leverages GPT-5 for content moderation and compliance enforcement with improved accuracy over legacy systems.

OpenAI·8 months ago

OpenAI· FRONTIER

Scaling accounting capacity with OpenAI

Basis built AI agents using o3, o3-Pro, GPT-4.1, and GPT-5 delivering 30% time savings for accounting firms.

OpenAI·9 months ago

Anthropic· FRONTIER

Our framework for developing safe and trustworthy agents

Anthropic publishes framework for developing safe and trustworthy autonomous agents with specified governance principles.

Anthropic·9 months ago

OpenAI· FRONTIER

Resolving digital threats 100x faster with OpenAI

Outtake uses GPT-4.1 and OpenAI o3 agents to detect security threats 100x faster.

OpenAI·10 months ago

OpenAI· FRONTIER

Model ML is helping financial firms rebuild with AI from the ground up

Model ML CEO discusses AI-native infrastructure and autonomous agents for financial services transformation.

OpenAI·10 months ago

Hugging Face· INFRA

Back to The Future: Evaluating AI Agents on Predicting Future Events

Hugging Face·10 months ago

OpenAI· FRONTIER

No-code personal agents, powered by GPT-4.1 and Realtime API

Genspark built $36M ARR no-code agent product in 45 days using GPT-4.1 and OpenAI Realtime API.

OpenAI·10 months ago

Hugging Face· INFRA

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Hugging Face·11 months ago

Hugging Face· INFRA

CodeAgents + Structure: A Better Way to Execute Actions

Hugging Face·11 months ago

Hugging Face· INFRA

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

Hugging Face·1 year ago

Mistral AI· FRONTIER

Devstral

Devstral: Mistral AI open-source model optimized for autonomous coding agents and software development.

Mistral AI ·1 year ago

Hugging Face· INFRA

Tiny Agents: an MCP-powered agent in 50 lines of code

Hugging Face·1 year ago

OpenAI· FRONTIER

BrowseComp: a benchmark for browsing agents

OpenAI introduces BrowseComp benchmark for evaluating web browsing agent capabilities.

OpenAI·1 year ago

OpenAI· FRONTIER

PaperBench: Evaluating AI’s Ability to Replicate AI Research

PaperBench: new benchmark measuring AI agents' ability to replicate state-of-the-art research papers.

OpenAI·1 year ago

OpenAI· FRONTIER

Moving from intent-based bots to proactive AI agents

OpenAI shifts from intent-based bots to proactive AI agents architecture.

OpenAI·1 year ago

OpenAI· FRONTIER

Automating 90% of finance and legal work with agents

Hebbia's AI platform claims to automate 90% of finance and legal work tasks using OpenAI models.

OpenAI·1 year ago

OpenAI· FRONTIER

Introducing next-generation audio models in the API

OpenAI released advanced text-to-speech and speech-to-text APIs with customizable voice instructions for voice agents.

OpenAI·1 year ago

OpenAI· FRONTIER

New tools for building agents

OpenAI releases new tools for building and deploying AI agents.

OpenAI·1 year ago

xAI· FRONTIER

Grok 3 Beta — The Age of Reasoning Agents

xAI unveils early preview of Grok 3, emphasizing advanced reasoning and agentic capabilities.

xAI·1 year ago

Hugging Face· INFRA

OpenAI· FRONTIER

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

MLE-bench introduces benchmark for evaluating AI agents on machine learning engineering tasks.

OpenAI·2 years ago

OpenAI· FRONTIER

Automating customer support agents

MavenAGI launches GPT-4-powered customer service agent; Tripadvisor, Clickup, Rho deploy for support automation.

OpenAI·2 years ago

Hugging Face· INFRA

License to Call: Introducing Transformers Agents 2.0

Hugging Face·2 years ago

OpenAI· FRONTIER

Klarna's AI assistant does the work of 700 full-time agents

Klarna is using AI to revolutionize personal shopping, customer service, and employee productivity.

OpenAI·2 years ago

Hugging Face· INFRA

Open-source LLMs as LangChain Agents

Hugging Face·2 years ago

← Front Page30 matches

← Newer Older →

The Archive

Jupyter Agents: training LLMs to reason with notebooks

SafetyKit scales risk agents with OpenAI’s most capable models

Scaling accounting capacity with OpenAI

Our framework for developing safe and trustworthy agents

Resolving digital threats 100x faster with OpenAI

Model ML is helping financial firms rebuild with AI from the ground up

Back to The Future: Evaluating AI Agents on Predicting Future Events

No-code personal agents, powered by GPT-4.1 and Realtime API

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

CodeAgents + Structure: A Better Way to Execute Actions

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

Devstral

Tiny Agents: an MCP-powered agent in 50 lines of code

BrowseComp: a benchmark for browsing agents

PaperBench: Evaluating AI’s Ability to Replicate AI Research

Moving from intent-based bots to proactive AI agents

Automating 90% of finance and legal work with agents

Introducing next-generation audio models in the API

New tools for building agents

Grok 3 Beta — The Age of Reasoning Agents

Open-source DeepResearch – Freeing our search agents

We now support VLMs in smolagents!

AI Agents Are Here. What Now?

Introducing smolagents: simple agents that write actions in code.

Google DeepMind at NeurIPS 2024

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Automating customer support agents

License to Call: Introducing Transformers Agents 2.0

Klarna's AI assistant does the work of 700 full-time agents

Open-source LLMs as LangChain Agents