Code

Improving and Accelerating Offline RL in Large Discrete Action Spaces with Structured Policy Initialization

(Paper | Code)

SAINT: Attention-Based Policies for Discrete Combinatorial Action Spaces

(Paper | Code)

BraVE: Offline Reinforcement Learning for Discrete Combinatorial Action Spaces

(Paper | Code)