Roboschool
We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.
We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon reviews.
We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.
We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL’s inconveniences.
In this post we’ll outline new OpenAI research in which agents develop their own language.
Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions for machines. In this post we’ll show how adversarial examples work across different mediums, and will discuss why securing systems against them can be difficult.
The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software systems, or deploying machine learning on robots.
Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.
We’re working with Microsoft to start running most of our large-scale experiments on Azure.
Last week we hosted over a hundred and fifty AI practitioners in our offices for our first self-organizing conference on machine learning.
Deep learning is an empirical science, and the quality of a group’s infrastructure is a multiplier on progress. Fortunately, today’s open-source ecosystem makes it possible for anyone to build great deep learning infrastructure.
The latest information about the Unconference is now available at the Unconference wiki, which will be periodically updated with more information for attendees.