A method to detect:

Approach

  1. Demonstrate each subcomponent in isolation
  2. Stat with explicit prompting/training to do the treacherous turn, then spoodfeed less and less to make demonstrations realistic
  3. Build understanding that we are able to get demonstrations with little to no spoonfeeding and failures emerge in a natural training setup

Catching Behaviors

Training Setups