Jointly learning rewards and policies: an iterative Inverse Reinforcement Learning framework with…