Jan Leike

17 publications

9 venues

H Index 9

Affiliation

Anthropic PBC, San Francisco, CA, USA
OpenAI, San Francisco, CA, USA
Australian National University, Canberra, ACT, Australia
University of Freiburg, Germany

Links

Name Venue Year citations
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision. ICML 2024 0
Let's Verify Step by Step. ICLR 2024 0
Training language models to follow instructions with human feedback. NIPS/NeurIPS 2022 0
Quantifying Differences in Reward Functions. ICLR 2021 0
Pitfalls of Learning a Reward Function Online. IJCAI 2020 14
Learning Human Objectives by Evaluating Hypothetical Behavior. ICML 2020 0
Reward learning from human preferences and demonstrations in Atari. NIPS/NeurIPS 2018 2
On Thompson Sampling and Asymptotic Optimality. IJCAI 2017 50
Universal Reinforcement Learning Algorithms: Survey and Experiments. IJCAI 2017 17
Generalised Discount Functions applied to a Monte-Carlo AI u Implementation. AAMAS 2017 4
Deep Reinforcement Learning from Human Preferences. NIPS/NeurIPS 2017 676
A Formal Solution to the Grain of Truth Problem. UAI 2016 14
Thompson Sampling is Asymptotically Optimal in General Environments. UAI 2016 37
Loss Bounds and Time Complexity for Speed Priors. AISTATS 2016 7
On the Computability of AIXI. UAI 2015 10
Sequential Extensions of Causal and Evidential Decision Theory. ADT 2015 14
Bad Universal Priors and Notions of Optimality. COLT 2015 36
Copyright ©2019 Universität Würzburg

Impressum | Privacy | FAQ