Unsupervised Action-Policy Quantization via Maximum Entropy Mixture Policies with Minimum Entropy Components Yamen Habib, Dmytro Grytskyy, Rubén Moreno-Bote EWRL, 2025 Project Page
Enhancing Exploration via Off-Reward Dynamic Reference Reinforcement Learning Yamen Habib, Dmytro Grytskyy, Rubén Moreno-Bote EWRL, 2024 Project Page
Complex behavior from intrinsic motivation to occupy future action-state path space Jorge Ramírez-Ruiz, Dmytro Grytskyy, Chiara Mastrogiuseppe, Yamen Habib, Rubén Moreno-Bote Nature Communications, 2024 Thread / Paper