| Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint).   | AAAI | 2024 | 0 | 
        
        
            | Averaging n-step Returns Reduces Variance in Reinforcement Learning.   | ICML | 2024 | 0 | 
        
        
            | Proper Laplacian Representation Learning.   | ICLR | 2024 | 0 | 
        
        
            | GVFs in the real world: making predictions online for water treatment.   | MLJ | 2024 | 0 | 
        
        
            | Investigating the properties of neural network representations in reinforcement learning.   | Artificial Intelligence | 2024 | 0 | 
        
        
            | Deep Laplacian-based Options for Temporally-Extended Exploration.   | ICML | 2023 | 0 | 
        
        
            | Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning.   | ICML | 2023 | 0 | 
        
        
            | Temporal Abstraction in Reinforcement Learning with the Successor Representation.   | JMLR | 2023 | 0 | 
        
        
            | Reward-respecting subtasks for model-based reinforcement learning.   | Artificial Intelligence | 2023 | 0 | 
        
        
            | Temporal abstractions-augmented temporally contrastive learning: An alternative to the Laplacian in RL.   | UAI | 2022 | 5 | 
        
        
            | A general class of surrogate functions for stable and efficient reinforcement learning.   | AISTATS | 2022 | 0 | 
        
        
            | Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning.   | ICLR | 2021 | 84 | 
        
        
            | Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization.   | ICML | 2021 | 0 | 
        
        
            | Exploration in Reinforcement Learning with Deep Covering Options.   | ICLR | 2020 | 34 | 
        
        
            | On Bonus Based Exploration Methods In The Arcade Learning Environment.   | ICLR | 2020 | 36 | 
        
        
            | An operator view of policy gradient methods.   | NIPS/NeurIPS | 2020 | 15 | 
        
        
            | Count-Based Exploration with the Successor Representation.   | AAAI | 2020 | 0 | 
        
        
            | Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation.   | IROS | 2018 | 9 | 
        
        
            | Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents (Extended Abstract).   | IJCAI | 2018 | 0 | 
        
        
            | Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents.   | JAIR | 2018 | 0 | 
        
        
            | A Laplacian Framework for Option Discovery in Reinforcement Learning.   | ICML | 2017 | 209 | 
        
        
            | State of the Art Control of Atari Games Using Shallow Reinforcement Learning.   | AAMAS | 2016 | 0 | 
        
        
            | True Online Temporal-Difference Learning.   | JMLR | 2016 | 0 |