This work aims to improve the zero-shot generalization performance in navigation on novel layouts.
We propose a parameterized action reinforcement learning algorithm to improve the performance of match plan generation in Bing search.
This work proposes a reinforcement learning framework for compositional object-oriented environments.