In new environments, training a Reinforcement Learning (RL) agent from scratch can prove to be inefficient. The computational and temporal costs can be significantly reduced if the agent is capable of learning across diverse environments and effective...
In new environments, training a Reinforcement Learning (RL) agent from scratch can prove to be inefficient. The computational and temporal costs can be significantly reduced if the agent is capable of learning across diverse environments and effectively engaging in transfer learning. However, achieving learning across multiple environments is challenging due to the varying state and action spaces inherent in different RL problems. A naive parameter sharing with environment-specific layers for different state-action spaces does not effectively facilitate transfer learning. In this work, we present a flexible and environment-agnostic architecture designed to facilitate learning across multiple environments simultaneously, while enabling efficient transfer learning for new environments. We also develop training algorithms within the proposed architecture to facilitate both online and offline RL. Our experiments demonstrate that multi-environment training with one agent is possible in heterogeneous environments and parameter sharing is not effective in transfer learning.