Hindsight experience replay

A motivating example of hindsight experience replay is a bit flipping problem. Say start with a binary number as an initial state, an action can be flipping any arbitrary bit, and the goal is to reach a particular state. It is, of course, a toy problem that can be solved easily. However, say if we…