Bruno C. Da Silva 0001

32 publications

8 venues

H Index 9

Affiliation

University of Massachusetts, Amherst, MA, USA
Federal University of Rio Grande do Sul (UFRGS), Institute of Informatics, Porto Alegre, Brazil

Links

Name Venue Year citations
From Past to Future: Rethinking Eligibility Traces. AAAI 2024 0
Position: Benchmarking is Limited in Reinforcement Learning Research. ICML 2024 0
Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation. NIPS/NeurIPS 2024 0
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization. AAMAS 2023 0
Multi-Step Generalized Policy Improvement by Leveraging Approximate Models. NIPS/NeurIPS 2023 0
Behavior Alignment via Reward Function Optimization. NIPS/NeurIPS 2023 0
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning. NIPS/NeurIPS 2023 0
Fairness Guarantees under Demographic Shift. ICLR 2022 9
Constrained Offline Policy Optimization. ICML 2022 4
Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer. ICML 2022 2
Off-Policy Evaluation for Action-Dependent Non-stationary Environments. NIPS/NeurIPS 2022 0
Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection. AAMAS 2021 8
Universal Off-Policy Evaluation. NIPS/NeurIPS 2021 26
Posterior Value Functions: Hindsight Baselines for Policy Gradient Methods. ICML 2021 4
A Methodology for Neural Network Architectural Tuning Using Activation Occurrence Maps. IJCNN 2019 5
A Compression-Inspired Framework for Macro Discovery. AAMAS 2019 0
Towards Designing Optimal Reward Functions in Multi-Agent Reinforcement Learning Problems. IJCNN 2018 5
Comparing Multi-Armed Bandit Algorithms and Q-learning for Multiagent Action Selection: a Case Study in Route Choice. IJCNN 2018 6
Learning to Minimise Regret in Route Choice. AAMAS 2017 15
Task-based behavior generalization via manifold clustering. IROS 2017 0
Context-Based Concurrent Experience Sharing in Multiagent Systems. AAMAS 2017 3
A Flexible Approach for Designing Optimal Reward Functions. AAMAS 2017 3
Energetic Natural Gradient Descent. ICML 2016 16
Learning parameterized motor skills on a humanoid robot. ICRA 2014 48
Active Learning of Parameterized Skills. ICML 2014 23
Biasing the behavior of organizationally adept agents: (extended abstract). AAMAS 2013 3
TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration. AAAI 2012 0
Learning Parameterized Skills. ICML 2012 195
Improving reinforcement learning with context detection. AAMAS 2006 24
Dealing with non-stationary environments using context detection. ICML 2006 157
RL-CD: Dealing with Non-Stationarity in Reinforcement Learning. AAAI 2006 0
ITSUMO: an Intelligent Transportation System for Urban Mobility. AAMAS 2006 0
Copyright ©2019 Universität Würzburg

Impressum | Privacy | FAQ