The task of inferring true objectives based on a designed reward.