|
![]() |
Unsupervised Action-Policy Quantization via Maximum Entropy Mixture Policies with Minimum Entropy Components
Yamen Habib, Dmytro Grytskyy, Rubén Moreno-Bote arxiv, 2024 Project Page / Thread / Paper |
![]() |
Enhancing Exploration via Off-Reward Dynamic Reference Reinforcement Learning
Yamen Habib, Dmytro Grytskyy, Rubén Moreno-Bote arxiv, 2024 Project Page / Thread / Paper |
![]() |
Complex behavior from intrinsic motivation to occupy future action-state path space
Jorge Ramírez-Ruiz, Dmytro Grytskyy, Chiara Mastrogiuseppe, Yamen Habib, Rubén Moreno-Bote Nature Communications, 2024 thread / paper |