From Past to Future: Rethinking Eligibility Traces.
|
AAAI |
2024 |
0 |
Position: Benchmarking is Limited in Reinforcement Learning Research.
|
ICML |
2024 |
0 |
Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation.
|
NIPS/NeurIPS |
2024 |
0 |
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization.
|
AAMAS |
2023 |
0 |
Multi-Step Generalized Policy Improvement by Leveraging Approximate Models.
|
NIPS/NeurIPS |
2023 |
0 |
Behavior Alignment via Reward Function Optimization.
|
NIPS/NeurIPS |
2023 |
0 |
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning.
|
NIPS/NeurIPS |
2023 |
0 |
Fairness Guarantees under Demographic Shift.
|
ICLR |
2022 |
9 |
Constrained Offline Policy Optimization.
|
ICML |
2022 |
4 |
Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer.
|
ICML |
2022 |
2 |
Off-Policy Evaluation for Action-Dependent Non-stationary Environments.
|
NIPS/NeurIPS |
2022 |
0 |
Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection.
|
AAMAS |
2021 |
8 |
Universal Off-Policy Evaluation.
|
NIPS/NeurIPS |
2021 |
26 |
Posterior Value Functions: Hindsight Baselines for Policy Gradient Methods.
|
ICML |
2021 |
4 |
A Methodology for Neural Network Architectural Tuning Using Activation Occurrence Maps.
|
IJCNN |
2019 |
5 |
A Compression-Inspired Framework for Macro Discovery.
|
AAMAS |
2019 |
0 |
Towards Designing Optimal Reward Functions in Multi-Agent Reinforcement Learning Problems.
|
IJCNN |
2018 |
5 |
Comparing Multi-Armed Bandit Algorithms and Q-learning for Multiagent Action Selection: a Case Study in Route Choice.
|
IJCNN |
2018 |
6 |
Learning to Minimise Regret in Route Choice.
|
AAMAS |
2017 |
15 |
Task-based behavior generalization via manifold clustering.
|
IROS |
2017 |
0 |
Context-Based Concurrent Experience Sharing in Multiagent Systems.
|
AAMAS |
2017 |
3 |
A Flexible Approach for Designing Optimal Reward Functions.
|
AAMAS |
2017 |
3 |
Energetic Natural Gradient Descent.
|
ICML |
2016 |
16 |
Learning parameterized motor skills on a humanoid robot.
|
ICRA |
2014 |
48 |
Active Learning of Parameterized Skills.
|
ICML |
2014 |
23 |
Biasing the behavior of organizationally adept agents: (extended abstract).
|
AAMAS |
2013 |
3 |
TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration.
|
AAAI |
2012 |
0 |
Learning Parameterized Skills.
|
ICML |
2012 |
195 |
Improving reinforcement learning with context detection.
|
AAMAS |
2006 |
24 |
Dealing with non-stationary environments using context detection.
|
ICML |
2006 |
157 |
RL-CD: Dealing with Non-Stationarity in Reinforcement Learning.
|
AAAI |
2006 |
0 |
ITSUMO: an Intelligent Transportation System for Urban Mobility.
|
AAMAS |
2006 |
0 |