Provably Improved Context-Based Offline Meta-RL with Attention and Contrastive Learning

Lanqing Li; Yuanhao Huang; Mingzhe Chen; Siteng Luo; Dijun Luo; Junzhou Huang

doi:10.48550/arxiv.2102.10774

Back

Preprint

Provably Improved Context-Based Offline Meta-RL with Attention and Contrastive Learning

Lanqing Li, Yuanhao Huang, Mingzhe Chen, Siteng Luo, Dijun Luo and Junzhou Huang

arXiv (Cornell University)

2021-02-22

DOI: https://doi.org/10.48550/arxiv.2102.10774

Abstract

Meta-learning for offline reinforcement learning (OMRL) is an understudied problem with tremendous potential impact by enabling RL algorithms in many real-world applications. A popular solution to the problem is to infer task identity as augmented state using a context-based encoder, for which efficient learning of robust task representations remains an open challenge. In this work, we provably improve upon one of the SOTA OMRL algorithms, FOCAL, by incorporating intra-task attention mechanism and inter-task contrastive learning objectives, to robustify task representation learning against sparse reward and distribution shift. Theoretical analysis and experiments are presented to demonstrate the superior performance and robustness of our end-to-end and model-free framework compared to prior algorithms across multiple meta-RL benchmarks.

Metrics

9 Record Views

Details

Title: Provably Improved Context-Based Offline Meta-RL with Attention and Contrastive Learning
Creators: Lanqing Li
Yuanhao Huang
Mingzhe Chen
Siteng Luo
Dijun Luo
Junzhou Huang
Publication Details: arXiv (Cornell University)
Academic Unit: CoE - Electrical & Computer Engineering; College of Engineering
Language: English
Resource Type: Preprint
Record Identifier: 991031934107002976