Detecting and reducing scheming in AI models
Apollo Research and OpenAI identify scheming behaviors in frontier models; propose early mitigation methods via stress tests.
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
Apollo Research and OpenAI identify scheming behaviors in frontier models; propose early mitigation methods via stress tests.
OpenAI announces Stargate UK datacenter initiative as part of UK AI infrastructure expansion.
OpenAI details approach to balancing teen safety, privacy, and freedom in AI platform use.
OpenAI integrates age prediction and parental controls in ChatGPT to provide age-appropriate experiences for minors.
OpenAI upgrades Codex with faster performance, improved reliability, and real-time collaboration across IDE, terminal, web, and mobile.
OpenAI's largest ChatGPT usage study documents economic value creation and mainstream adoption beyond early adopters.
OpenAI releases GPT-5-Codex, GPT-5 variant optimized for agentic coding with dynamic compute allocation based on task complexity.
OpenAI announces collaboration with US CAISI and UK AISI on AI safety and security strengthening.
OpenAI and Microsoft sign new MOU reinforcing partnership on AI safety and innovation.
OpenAI restructures nonprofit governance with equity grants enabling $100B+ in resources for safe AI development.
SafetyKit product leverages GPT-5 for content moderation and compliance enforcement with improved accuracy over legacy systems.
OpenAI opens $50M People-First AI Fund applications for US nonprofits focusing on education, community innovation, and economic opportunity.
OpenAI research identifies mechanistic causes of language model hallucinations and proposes improved evaluation methods for reliability.
OpenAI announces GPT-5 bio bug bounty program with up to $25K rewards for identifying safety vulnerabilities via universal jailbreaks.
OpenAI partners with Greek Government to deploy ChatGPT Edu in secondary schools and support local AI ecosystem development.
OpenAI launches Jobs Platform and AI Certifications to increase worker access to training and employment opportunities.
Vijaye Raji appointed CTO of Applications at OpenAI following Statsig acquisition; reports to Fidji Simo.
OpenAI releases gpt-realtime speech-to-speech model with Realtime API updates including MCP support, image input, and SIP calling.
OpenAI announces $50M People-First AI Fund for U.S. nonprofits in education, healthcare, and research; applications Sept–Oct 2025.
OpenAI's survey of 1,000+ people worldwide reveals alignment between public values and Model Spec; informs AI behavior defaults.
OpenAI and Anthropic publish joint safety evaluation results on misalignment, hallucinations, jailbreaking, and instruction following.
OpenAI addresses safety protocols for users in mental/emotional distress, acknowledges system limits, and outlines refinement efforts.
OpenAI announces Learning Accelerator program (details not provided).
OpenAI's GPT-4b micro aided Retro Bio in protein engineering for stem cell therapy, demonstrating domain-specific model utility in life sciences.
OpenAI urges California Governor Newsom to harmonize state AI regulation with national standards, positioning federal leadership.
Basis built AI agents using o3, o3-Pro, GPT-4.1, and GPT-5 delivering 30% time savings for accounting firms.
OpenAI releases GPT-5 in API with advanced reasoning, developer controls, and improved coding task performance.
OpenAI demonstrates GPT-5 applied to medical research workflows and use cases.
OpenAI announces GPT-5, achieving state-of-the-art performance in coding, math, writing, vision, and reasoning across benchmarks.