A new initiative for developing third-party model evaluations
Anthropic launches third-party model evaluation initiative to establish independent benchmarking standards.
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
Anthropic launches third-party model evaluation initiative to establish independent benchmarking standards.
CriticGPT: GPT-4-based critique model assists human trainers in identifying ChatGPT errors during RLHF training.
OpenAI partners with TIME media for archival content integration and sourced article links in AI responses.
Anthropic expands Claude access for U.S. government agencies with compliance and security features.
Anthropic releases Projects feature enabling persistent, multi-turn collaboration workflows in Claude interface.
OpenAI acquires Rockset, a data infrastructure company, to strengthen internal systems.
Anthropic releases Claude 3.5 Sonnet with improved performance across reasoning, code, and multimodal tasks.
OpenAI announces cybersecurity grant program to fund research integrating AI into threat detection and defense.
OpenAI proposes consistency models for diffusion-based generation, enabling single-step sampling instead of iterative processes.
OpenAI details holistic NLP approach to content moderation classification for real-world deployment.
OpenAI publishes improved training techniques for consistency models, advancing single-step generative model performance.
Paf case study: 70% employee adoption of ChatGPT Enterprise for coding and business tasks via custom GPTs.
OpenAI highlights agentic sales prospecting use case claiming 10x growth attribution.
Color Health launches Cancer Copilot using GPT-4o to automate diagnostic workup and accelerate cancer treatment access.
Retired U.S. Army General Paul Nakasone joins OpenAI Board, adding cybersecurity expertise to Safety and Security Committee.
Anthropic outlines red teaming methodology challenges and scalability limits in adversarial AI evaluation.
OpenAI and Apple announce partnership to integrate ChatGPT into Apple devices and services.
OpenAI appoints Sarah Friar as CFO and Kevin Weil as CPO.
OpenAI details Voice Engine text-to-speech technology and safety research protocols for synthetic voice deployment.