|
Code
- Improving and Accelerating Offline RL in Large Discrete Action Spaces with Structured Policy Initialization
(Paper | Code)
- SAINT: Attention-Based Policies for Discrete Combinatorial Action Spaces
(Paper | Code)
- BraVE: Offline Reinforcement Learning for Discrete Combinatorial Action Spaces
(Paper | Code)
|