The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

OpenAI· FRONTIER

Roboschool

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

OpenAI·9 years ago

OpenAI· FRONTIER

Equivalence between policy gradients and soft Q-learning

OpenAI·9 years ago

OpenAI· FRONTIER

Stochastic Neural Networks for hierarchical reinforcement learning

OpenAI·9 years ago

OpenAI· FRONTIER

Unsupervised sentiment neuron

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.

OpenAI·9 years ago

OpenAI· FRONTIER

Spam detection in the physical world

We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.

OpenAI·9 years ago

OpenAI· FRONTIER

Evolution strategies as a scalable alternative to reinforcement learning

We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL’s inconveniences.

OpenAI·9 years ago

OpenAI· FRONTIER

One-shot imitation learning

OpenAI·9 years ago

OpenAI· FRONTIER

Distill

We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing).

OpenAI·9 years ago

OpenAI· FRONTIER

Learning to communicate

In this post we’ll outline new OpenAI research in which agents develop their own language.

OpenAI·9 years ago

OpenAI· FRONTIER

Emergence of grounded compositional language in multi-agent populations

OpenAI·9 years ago

OpenAI· FRONTIER

Prediction and control with temporal segment models

OpenAI·9 years ago

OpenAI· FRONTIER

Third-person imitation learning

OpenAI·9 years ago

OpenAI· FRONTIER

Attacking machine learning with adversarial examples

Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions for machines. In this post we’ll show how adversarial examples work across different mediums, and will discuss why securing systems against them can be difficult.

OpenAI·9 years ago

OpenAI· FRONTIER

Adversarial attacks on neural network policies

OpenAI·9 years ago

OpenAI· FRONTIER

Team update

The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software systems, or deploying machine learning on robots.

OpenAI·9 years ago

OpenAI· FRONTIER

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications

OpenAI·9 years ago

OpenAI· FRONTIER

Faulty reward functions in the wild

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

OpenAI·10 years ago

OpenAI· FRONTIER

Universe

We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.

OpenAI·10 years ago

OpenAI· FRONTIER

OpenAI and Microsoft

We’re working with Microsoft to start running most of our large-scale experiments on Azure.

OpenAI·10 years ago

OpenAI· FRONTIER

#Exploration: A study of count-based exploration for deep reinforcement learning

OpenAI·10 years ago

OpenAI· FRONTIER

On the quantitative analysis of decoder-based generative models

OpenAI·10 years ago

OpenAI· FRONTIER

A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models

OpenAI·10 years ago

OpenAI· FRONTIER

RL²: Fast reinforcement learning via slow reinforcement learning

OpenAI·10 years ago

OpenAI· FRONTIER

Variational lossy autoencoder

OpenAI·10 years ago

OpenAI· FRONTIER

Extensions and limitations of the neural GPU

OpenAI·10 years ago

OpenAI· FRONTIER

Semi-supervised knowledge transfer for deep learning from private training data

OpenAI·10 years ago

OpenAI· FRONTIER

Report from the self-organizing conference

Last week we hosted over a hundred and fifty AI practitioners in our offices for our first self-organizing conference on machine learning.

OpenAI·10 years ago

OpenAI· FRONTIER

Transfer from simulation to real world through learning deep inverse dynamics model

OpenAI·10 years ago

OpenAI· FRONTIER

Infrastructure for deep learning

Deep learning is an empirical science, and the quality of a group’s infrastructure is a multiplier on progress. Fortunately, today’s open-source ecosystem makes it possible for anyone to build great deep learning infrastructure.

OpenAI·10 years ago

OpenAI· FRONTIER

Machine Learning Unconference

The latest information about the Unconference is now available at the Unconference wiki, which will be periodically updated with more information for attendees.

OpenAI·10 years ago

← Front Page30 stories

← Newer Older →